Sophie

Sophie

distrib > Mandriva > 2008.1 > x86_64 > by-pkgid > f236c7084c3c16765567cb3ebdeb3327 > files > 8

lib64unicode0-0.4.gnome-11mdv2008.1.x86_64.rpm

more work on test suite

Real bugs:
1. unicode_iconv needs to save the shift state when backing up
2. the init function for a converter has no idea if it is initializing
   for read or write.  should have two init functions

Treat malformed (and overlong, except maybe c0 80) UTF-8 characters
like 0xfffd.  Define for 0xfffd in converter header.

we need some more functions:
  a way to map character number to byte number
    (so we can continue to use strncmp)
  a strchr and strrchr using a unicode_char_t argument
  strcasecmp and strncasecmp

intermediate buffer:
  * how to size it?
  * should be circular for efficiency
  * should be bypassed when converting to/from UCS-4

more charsets
  Japanese charsets like JIS are very important
  These must be converted from Unicode tables using a script

name charsets appropriately
names of existing charsets should be changed.
eg ucs2 -> utf16
look at IANA registry

look at Solaris and glibc 2.1:
  make sure we use the same names for the same encodings
  see if we have to make it so our aliases apply even when
    using the system iconv (necessary when we want to use a fixed
    charset but existing systems don't agree on what it should be
    named)

if fromcode==tocode, then just copy through

merge with FriBidi

use gtk-doc

ucs readers which understand the feff convention

turn converters into shared objects of their own and dynamically load
them as requested

unicode_canonical_decomposition has a bad interface
Should let user give a buffer


================================================================

more properties:

* mirrored.  use 2-level table