This is a discussion on Re: UTF8 or Unicode within the pgsql Hackers forums, part of the PostgreSQL category; --> Bruce Momjian <pgman@candle.pha.pa.us> writes: > Peter Eisentraut wrote: >> I think this is what we should do: >> >> ...
| |||||||
| FAQ | Members List | Calendar | Search | Today's Posts | Mark Forums Read |
| ||||
| Bruce Momjian <pgman@candle.pha.pa.us> writes: > Peter Eisentraut wrote: >> I think this is what we should do: >> >> UNICODE => UTF8 >> ALT => WIN866 >> WIN => WIN1251 >> TCVN => WIN1258 > OK, but what about latin1? I think LATIN1 is fine as-is. It's a reasonably popular name for the character set, and despite Tatsuo's complaint, it's not going to confuse anyone in practice --- the 7-bit version of that standard has no traction. The reason UNICODE is a bad name for UTF8 is exactly that there are multiple physical encodings of Unicode that are in common use. regards, tom lane ---------------------------(end of broadcast)--------------------------- TIP 1: subscribe and unsubscribe commands go to majordomo@postgresql.org |
| |||
| Am Freitag, 25. Februar 2005 16:26 schrieb Bruce Momjian: > OK, but what about latin1? The following character set names are specified in the SQL standard and therefore somewhat non-negotiable: SQL_CHARACTER GRAPHIC_IRV LATIN1 ISO8BIT UTF16 UTF8 UCS2 SQL_TEXT SQL_IDENTIFIER So we have to use LATIN1, even though it creates an inconsistency. We discussed this a while ago during the last great renaming, I think. Btw., I think ISO8BIT is the correct name for what we call SQL_ASCII, but I haven't analyzed that in detail, yet. -- Peter Eisentraut http://developer.postgresql.org/~petere/ ---------------------------(end of broadcast)--------------------------- TIP 2: you can get off all lists at once with the unregister command (send "unregister YourEmailAddressHere" to majordomo@postgresql.org) |
| ||||
| Peter Eisentraut wrote: > Am Freitag, 25. Februar 2005 16:26 schrieb Bruce Momjian: > > OK, but what about latin1? > > The following character set names are specified in the SQL standard and > therefore somewhat non-negotiable: > > SQL_CHARACTER > GRAPHIC_IRV > LATIN1 > ISO8BIT > UTF16 > UTF8 > UCS2 > SQL_TEXT > SQL_IDENTIFIER > > So we have to use LATIN1, even though it creates an inconsistency. We > discussed this a while ago during the last great renaming, I think. > Oh, UTF8 and not UTF-8? I thought UTF-8 was the standard name, but if ANSI uses UTF8 we will have to use that. > Btw., I think ISO8BIT is the correct name for what we call SQL_ASCII, but I > haven't analyzed that in detail, yet. OK, please let us know. -- Bruce Momjian | http://candle.pha.pa.us pgman@candle.pha.pa.us | (610) 359-1001 + If your life is a hard drive, | 13 Roberts Road + Christ can be your backup. | Newtown Square, Pennsylvania 19073 ---------------------------(end of broadcast)--------------------------- TIP 4: Don't 'kill -9' the postmaster |
| Thread Tools | |
| Display Modes | |
|
|