Mantis - Squeak
Viewing Issue Advanced Details
1792 Multilingual minor always 09-13-05 20:01 07-21-13 02:48
bert  
bert  
normal  
assigned 3.8  
open  
none    
none  
0001792: UTF8TextConverter incorrectly reads malformed multi-byte sequences
In an UTF8 multi-byte sequence, the second to last byte need to be of the form "10xxxxxx". This is not checked for by the UTF8TextConverter. It just reads those bytes. However, it must not interpret those bytes as multi-byte sequence, but rather start a new character there. Otherwise, valid characters are skipped.

Notes
(0014393)
tim   
07-21-13 02:48   
Does it still do this?