This is a discussion on BUG #3525: Lithuanian characters doesn't work in queries with regular expressions within the pgsql Bugs forums, part of the PostgreSQL category; --> The following bug has been logged online: Bug reference: 3525 Logged by: Rolandas Rudomanskis Email address: rolandasr@gmail.com PostgreSQL version: ...
| |||||||
| Register | FAQ | Members List | Calendar | Search | Today's Posts | Mark Forums Read |
| ||||
| The following bug has been logged online: Bug reference: 3525 Logged by: Rolandas Rudomanskis Email address: rolandasr@gmail.com PostgreSQL version: 8.2.3 Operating system: Debian Etch Description: Lithuanian characters doesn't work in queries with regular expressions Details: The database is set to unicode. Debian locale is utf-8. All lithuanian characters characters, except ė, ų, namely, ą, č, ę, į, š, ū, ž in non-case sensitive queries specifying word beginning, are ignored. SQL example: select * from table1 where title ~* '\\m" + mytext + "';" ---------------------------(end of broadcast)--------------------------- TIP 5: don't forget to increase your free space map settings |
| ||||
| "Rolandas Rudomanskis" <rolandasr@gmail.com> writes: > Description: Lithuanian characters doesn't work in queries with > regular expressions > The database is set to unicode. Debian locale is utf-8. Yeah, the regex code doesn't handle localization issues (such as case-insensitive matching) correctly for multibyte characters. Can you use a single-byte encoding? Alternatively, I believe case-insensitive LIKE matching works for this. regards, tom lane ---------------------------(end of broadcast)--------------------------- TIP 4: Have you searched our list archives? http://archives.postgresql.org |