Commit Graph

45 Commits

Author SHA1 Message Date
5af13e8a9f *: improve import (#31887)
ref pingcap/tidb#31716
2022-01-22 09:15:46 +08:00
0c19f92741 encoding: refactor transform to avoid the misuse buffer (#31480)
close pingcap/tidb#31488
2022-01-10 12:13:41 +08:00
cfbe70846a config: remove new charset config (#31291)
close pingcap/tidb#31236
2022-01-06 13:26:37 +08:00
6e75367ad0 collation: make gbk default collation to gbk_bin if new collation is not enabled (#31208)
close pingcap/tidb#31221
2022-01-04 15:48:35 +08:00
17927d489a parser: decode the binary literal value in enum or set (#31231)
close pingcap/tidb#30740
2022-01-04 14:14:35 +08:00
9ffeff2857 types: convert to new charset before inserting to blob or json column (#31031)
close pingcap/tidb#30690
2021-12-30 21:35:50 +08:00
6c0c442a32 parser: add custom_gbk encoding to deal with 0x80 (#31123)
close pingcap/tidb#30581
2021-12-30 13:39:50 +08:00
4dec85cce3 parser: convert identifier to utf8mb4 charset (#31108)
close pingcap/tidb#31113
2021-12-29 16:07:50 +08:00
7647c2b9b0 parser: parse gbk encoding SQL token by token instead of the whole SQL (#31039)
close pingcap/tidb#30765
2021-12-28 22:07:49 +08:00
61d13b561c encoding: skip utf8 charset validation in some cases (#31061)
close pingcap/tidb#31014
2021-12-28 17:51:49 +08:00
d543b60464 parser: add IsValid() to Encoding to speed up string validation for UTF-8 (#30937)
close pingcap/tidb#30936
2021-12-22 18:19:48 +08:00
ab35db14a6 *: refactor encoding and uniform usages (#30288) 2021-12-20 15:27:46 +08:00
1e5e86920d test: merge serial tests in bindinfo, expression, parser and statistics (#30749) 2021-12-16 18:40:37 +08:00
b4df7ca4dc test: make all the tests run in serial (#30692) 2021-12-14 18:06:36 +08:00
6898975a7d parser: fix missing charset and collation of handle column (#30320) 2021-12-02 17:05:54 +08:00
a0f7643e3f *: forbid set TiFlash Replica for a table with unsupport charset (#30162) 2021-11-30 15:55:53 +08:00
e2e6e5561b encoding: refactor the character encoding (#29935) 2021-11-22 16:59:49 +08:00
c835349972 table, parser: check for invalid GBK characters before insertion (#28814) 2021-11-22 13:29:49 +08:00
9ec688ebe8 expression, parser: support GBK for builtin function ord (#29527) 2021-11-18 14:33:47 +08:00
1d179552f4 mockstore: set FieldType.charset in coprocessor (#29646) 2021-11-11 17:53:06 +08:00
ad85341857 charset: support builtin function convert (#28943) 2021-11-01 16:56:51 +08:00
11b61738f3 parser: migrate test-infra to testify (#29191) 2021-10-28 12:30:49 +08:00
412dd4f763 parser: refactor Parse() interface to make it extensible (#28975) 2021-10-21 10:56:45 +08:00
9f8c053d06 parser: refactor encode and decode (#28917)
* refactor encoding

* add a simple test case for encoding

Co-authored-by: tangenta <tangenta@126.com>
Co-authored-by: Ti Chi Robot <ti-community-prow-bot@tidb.io>
2021-10-19 15:50:33 +08:00
42001503cb expression, parser: support GBK charset for builtin function lower() (#28869) 2021-10-15 18:02:41 +08:00
ac9dcbb53f expression: support GBK charset for builtin function upper() (#28817) 2021-10-15 15:29:27 +08:00
xhe
94e30df8e2 parser: fix compiling, add unit tests (#28700) 2021-10-11 23:15:25 +08:00
bd299b990e [parser] charset: fix Encoding.Encode() and add some tests (#1344) 2021-10-09 14:53:23 +08:00
ba105bbd10 [parser] support parsing SQL with encodings other than utf8 (#1312) 2021-10-09 14:53:23 +08:00
34646f4acb [parser] charset: sort the charset by its name (#1307) 2021-10-09 14:53:23 +08:00
b266e09b30 [parser] charset: clean up some code about charset (#1306) 2021-10-09 14:53:23 +08:00
763fc87ceb [parser] support missing charset (#1140) (#1301) 2021-10-09 14:53:23 +08:00
9beeac38b3 [parser] collation: add new pinyin collation utf8mb4_zh_pinyin_tidb_as_cs (#1058) 2021-10-09 14:53:23 +08:00
e3daabacc0 [parser] *: redact arguments for Error (#1051) 2021-10-09 14:53:23 +08:00
6eb2909901 [parser] change all dot import, according to staticcheck (#730)
* change all dot import, according to staticcheck

* make staticcheck happy
2021-10-09 14:53:23 +08:00
989870f109 [parser] *: remove testleak (#676) 2021-10-09 14:53:23 +08:00
94eb38bc74 [parser] clean up error code and message for parser (#618) 2021-10-09 14:53:23 +08:00
1fa4d04572 [parser] *: fix upper-cased charset and collation name (#301) 2021-10-09 14:53:23 +08:00
64b24c11d0 [parser] charset: add GetSupportedCollations to get only supported collations (#295) 2021-10-09 14:53:23 +08:00
35f6c88433 [parser] suspend unknown collation id 255 error (#294) 2021-10-09 14:53:23 +08:00
H
1e6b681e72 [parser] improve:GetCharsetDesc (#256) 2021-10-09 14:53:23 +08:00
bb08c990a4 [parser] improve: GetAllCharsets and GetCharsetInfoByID (#247) 2021-10-09 14:53:23 +08:00
55967eb284 [parser] charsert: make collation case insensitive (#66) 2021-10-09 14:53:23 +08:00
6f2c7111c4 [parser] *: change default charset from utf8 to utf8mb4 (#13) 2021-10-09 14:53:23 +08:00
c270f55405 [parser] *: move github.com/pingcap/tidb/parser here (#1) 2021-10-09 14:53:23 +08:00