Normally the test runners run in 'forking mode'. When a lot of messages are logged to stdout, the child process can still be writing data when the parent writes 'ok' or 'not ok'. This can confuse the tap-driver.sh process.