mirror of https://git.postgresql.org/git/postgresql.git synced 2026-02-22 22:37:01 +08:00

Go to file

Heikki Linnakangas 54598670fe Remove 'charlen' argument from make_trigrams()

The function assumed that if charlen == bytelen, there are no
multibyte characters in the string. That's sensible, but the callers
were a little careless in how they calculated the lengths. The callers
converted the string to lowercase before calling make_trigram(), and
the 'charlen' value was calculated *before* the conversion to
lowercase while 'bytelen' was calculated after the conversion. If the
lowercased string had a different number of characters than the
original, make_trigram() might incorrectly apply the fastpath and
treat all the bytes as single-byte characters, or fail to apply the
fastpath (which is harmless), or it might hit the "Assert(bytelen ==
charlen)" assertion. I'm not aware of any locale / character
combinations where you could hit that assertion in practice,
i.e. where a string converted to lowercase would have fewer characters
than the original, but it seems best to avoid making that assumption.

To fix, remove the 'charlen' argument. To keep the performance when
there are no multibyte characters, always try the fast path first, but
check the input for multibyte characters as we go. The check on each
byte adds some overhead, but it's close enough. And to compensate, the
find_word() function no longer needs to count the characters.

This fixes one small bug in make_trigrams(): in the multibyte
codepath, it peeked at the byte just after the end of the input
string. When compiled with IGNORECASE, that was harmless because there
is always a NUL byte or blank after the input string. But with
!IGNORECASE, the call from generate_wildcard_trgm() doesn't guarantee
that.

Backpatch to v18, but no further. In previous versions lower-casing was
done character by character, and thus the assumption that lower-casing
doesn't change the character length was valid. That was changed in v18,
commit fb1a18810f.

Security: CVE-2026-2007
Reviewed-by: Noah Misch <noah@leadboat.com>

2026-02-09 12:08:58 +13:00

.github

Add CODE_OF_CONDUCT.md, CONTRIBUTING.md, and SECURITY.md.

2024-07-02 13:03:58 -05:00

config

Revert "Change copyObject() to use typeof_unqual"

2026-02-07 10:08:38 +01:00

contrib

Remove 'charlen' argument from make_trigrams()

2026-02-09 12:08:58 +13:00

doc

libpq: Prepare for protocol grease during 19beta

2026-02-06 10:31:45 -08:00

src

Replace some hard-wired OID constants with corresponding macros.

2026-02-07 23:15:20 -05:00

.cirrus.star

ci: Simplify ci-os-only handling

2025-08-14 12:09:34 -04:00

.cirrus.tasks.yml

ci: Configure g++ with 32-bit for 32-bit build

2026-01-09 08:58:50 +01:00

.cirrus.yml

ci: Per-repo configuration for manually trigger tasks

2025-08-14 11:54:03 -04:00

.dir-locals.el

Make Emacs perl-mode indent more like perltidy.

2019-01-13 11:32:31 -08:00

.editorconfig

Update .editorconfig and .gitattributes for postgresql.conf.sample.

2025-11-18 10:28:36 -06:00

.git-blame-ignore-revs

Add a couple of recent commits to .git-blame-ignore-revs.

2026-01-28 15:56:48 -06:00

.gitattributes

Update .editorconfig and .gitattributes for postgresql.conf.sample.

2025-11-18 10:28:36 -06:00

.gitignore

Update top-level .gitignore.

2022-12-04 15:23:00 -05:00

.mailmap

Add a Git .mailmap file

2024-11-05 13:56:02 +01:00

aclocal.m4

autoconf: Move export_dynamic determination to configure

2022-12-06 18:55:28 -08:00

configure

Revert "Change copyObject() to use typeof_unqual"

2026-02-07 10:08:38 +01:00

configure.ac

Revert "Change copyObject() to use typeof_unqual"

2026-02-07 10:08:38 +01:00

Update copyright for 2026

2026-01-01 13:24:10 -05:00

GNUmakefile.in

Allow selecting the git revision to be packaged by "make dist".

2024-05-03 11:08:50 -04:00

HISTORY

Canonicalize some URLs

2020-02-10 20:47:50 +01:00

Makefile

Remove AIX support

2024-02-28 15:17:23 +04:00

meson_options.txt

2026-01-01 13:24:10 -05:00

meson.build

meson: host_system value for Solaris is 'sunos' not 'solaris'.

2026-02-07 20:05:52 -05:00

README.md

Revise the style of a paragraph in README.md.

2024-03-21 10:16:41 -05:00

README.md

PostgreSQL Database Management System

This directory contains the source code distribution of the PostgreSQL database management system.

PostgreSQL is an advanced object-relational database management system that supports an extended subset of the SQL standard, including transactions, foreign keys, subqueries, triggers, user-defined types and functions. This distribution also contains C language bindings.

General documentation about this version of PostgreSQL can be found at https://www.postgresql.org/docs/devel/. In particular, information about building PostgreSQL from the source code can be found at https://www.postgresql.org/docs/devel/installation.html.

The latest version of this software, and related software, may be obtained at https://www.postgresql.org/download/. For more information look at our web site located at https://www.postgresql.org/.

Languages

C 84.8%

PLpgSQL 6.1%

Perl 4.7%

Yacc 1.2%

Meson 0.7%

Other 2.4%