unicode 'u\fffd' is used to represent utf8 error when range a string, but itself is a valid utf8 character.