Unicode, unicode, unicode - it's not working!

Started by ajf, 2012 Oct 06, 17:24:53

previous topic - next topic

0 Members and 1 Guest are viewing this topic.

Go Down

ajf

Hi.

I tried to use the mathematical set membership symbol, but it was replaced with a ? in my signature (and indeed, this post, before editing).

Is this a filter of some sort? A bug? On the other SMF forum I use, non-ASCII code points show up as html entities sometimes, but usually correctly. Are you sure the MySQL table for signatures is set to the correct charset? Make sure it's the one that accepts non-BMP symbols... (I can help you with this if you like, although I doubt I'll need to...)

EDIT: You have UTF-8 page encoding but not UTF-8 support? I've seen SMF forums set to Windows-1252 page encoding with better Unicode support!  :I

Thanks!

EDIT: Examples:

??????? (doesn't work, should be mandarin chinese)

"hello" (works, should be smart quotes)

£ (works, should be pound GBP symbol)

€ (works, should be euro EUR symbol)

- hmm, looks like latin-1(edit: Windows-1252) or possibly whole bmp is fine, but not beyond that

let's try some wikipedia examples:

U+007A   z   LATIN SMALL LETTER Z (works!)

U+6C34   ?   CJK UNIFIED IDEOGRAPH-6C34 (water)

U+10000   ????   LINEAR B SYLLABLE B008 A (first non-BMP code point) - FAILS!

U+1D11E   ????   MUSICAL SYMBOL G CLEF - FAILES!

U+10FFFD   ????   PRIVATE USE CHARACTER-10FFFD (last Unicode code point) - FAILS!


OK, looks like it's anything non-latin-1(edit: Windows-1252), not non-bmp, that's broken.

:(

Liska



Liska

I gave it a shot. Meh. You don't need them. Use English.

Go Up