This saves three cycles per character, and reduces the size of the function by eight bytes (the loop is unrolled eight times). Thanks to davidg for pointing this out.