rename scanner_test.go to lexer_test.go make the code more robust when unicode.ReplacementChar appears