*** empty log message ***
[bpt/emacs.git] / README.unicode
CommitLineData
af301c44
DL
1 -*-text-*-
2
3Problems, fixmes and other issues in the emacs-unicode branch
8cfdb48d 4-------------------------------------------------------------
af301c44 5
63d6cc62
DL
6Notes by fx to record various things of variable importance. handa
7needs to check them -- don't take too seriously, especially with
8regard to completeness.
af301c44 9
8cfdb48d
DL
10_Do take seriously that you don't want this branch unless you're
11actually working on it; you risk your data by actually using it._ If
12you just want to edit Unicode and/or unify iso-8859 et al, see the
13existing support and the extra stuff at
14<URL:ftp://dlpx1.dl.ac.uk/fx/emacs/Mule>, mostly now in the CVS trunk.
15(Editing support is mostly orthogonal to the internal representation.)
af301c44 16
8cfdb48d
DL
17 * SINGLE_BYTE_CHAR_P returns true for Latin-1 characters, which has
18 undesirable effects.
af301c44
DL
19
20 * Rationalize character syntax and its relationship to the Unicode
21 database. Specifically, the latin-N.el files aren't consistent for
8cfdb48d
DL
22 common characters (and obviously have redundancies except in
23 unibyte mode).
af301c44 24
8cfdb48d
DL
25 * Fontset handling and customization needs work. We want to relate
26 fonts to scripts, probably based on the Unicode blocks. The
27 presence of small-repertoire 10646-encoded fonts in XFree 4 is a
28 pain, not currently worked round.
af301c44 29
8cfdb48d 30 * Work is also needed on charset and coding system priorities.
af301c44
DL
31
32 * The relevant bits of latin1-disp.el need porting (and probably
33 re-naming/updating). See also cyril-util.el.
34
8cfdb48d 35 * Quail files need more work now the encoding is irrelevant.
af301c44
DL
36
37 * What to do with the old coding categories stuff?
38
8cfdb48d 39 * Syntax for symbols &c in characters.el needs looking at.
af301c44
DL
40
41 * The preferred-coding-system property of charsets should probably be
42 junked unless it can be made more useful now.
43
8cfdb48d 44 * find-coding-systems-for-charsets needs re-writing or removing.
af301c44
DL
45
46 * find-multibyte-characters needs looking at.
47
48 * Implement Korean cp949/UHC and any other important missing
49 charsets.
50
8cfdb48d 51 * Check up on definitions of tcvn and alternativnj.
af301c44
DL
52
53 * Lazy-load tables for unify-charset somehow?
54
63d6cc62 55 * Translation tables for {en,de}code currently aren't supported.
af301c44
DL
56
57 * Defining CCL coding systems currently doesn't work.
58
59 * iso-2022 charsets get unified on i/o.
60
61 * Revisit locale processing: look at treating the language and
62 charset parts separately. (Language should affect things like
63 speling and calendar, but that's not a Unicode issue.)
64
65 * Handle Unicode combining characters usefully, e.g. diacritics, and
8cfdb48d 66