Commit | Line | Data |
---|---|---|
af301c44 DL |
1 | -*-text-*- |
2 | ||
3 | Problems, fixmes and other issues in the emacs-unicode branch | |
8cfdb48d | 4 | ------------------------------------------------------------- |
af301c44 | 5 | |
63d6cc62 DL |
6 | Notes by fx to record various things of variable importance. handa |
7 | needs to check them -- don't take too seriously, especially with | |
8 | regard to completeness. | |
af301c44 | 9 | |
8cfdb48d DL |
10 | _Do take seriously that you don't want this branch unless you're |
11 | actually working on it; you risk your data by actually using it._ If | |
12 | you just want to edit Unicode and/or unify iso-8859 et al, see the | |
13 | existing support and the extra stuff at | |
14 | <URL:ftp://dlpx1.dl.ac.uk/fx/emacs/Mule>, mostly now in the CVS trunk. | |
15 | (Editing support is mostly orthogonal to the internal representation.) | |
af301c44 | 16 | |
8cfdb48d DL |
17 | * SINGLE_BYTE_CHAR_P returns true for Latin-1 characters, which has |
18 | undesirable effects. | |
af301c44 DL |
19 | |
20 | * Rationalize character syntax and its relationship to the Unicode | |
21 | database. Specifically, the latin-N.el files aren't consistent for | |
8cfdb48d DL |
22 | common characters (and obviously have redundancies except in |
23 | unibyte mode). | |
af301c44 | 24 | |
8cfdb48d DL |
25 | * Fontset handling and customization needs work. We want to relate |
26 | fonts to scripts, probably based on the Unicode blocks. The | |
27 | presence of small-repertoire 10646-encoded fonts in XFree 4 is a | |
28 | pain, not currently worked round. | |
af301c44 | 29 | |
8cfdb48d | 30 | * Work is also needed on charset and coding system priorities. |
af301c44 DL |
31 | |
32 | * The relevant bits of latin1-disp.el need porting (and probably | |
33 | re-naming/updating). See also cyril-util.el. | |
34 | ||
8cfdb48d | 35 | * Quail files need more work now the encoding is irrelevant. |
af301c44 DL |
36 | |
37 | * What to do with the old coding categories stuff? | |
38 | ||
8cfdb48d | 39 | * Syntax for symbols &c in characters.el needs looking at. |
af301c44 DL |
40 | |
41 | * The preferred-coding-system property of charsets should probably be | |
42 | junked unless it can be made more useful now. | |
43 | ||
8cfdb48d | 44 | * find-coding-systems-for-charsets needs re-writing or removing. |
af301c44 DL |
45 | |
46 | * find-multibyte-characters needs looking at. | |
47 | ||
48 | * Implement Korean cp949/UHC and any other important missing | |
49 | charsets. | |
50 | ||
8cfdb48d | 51 | * Check up on definitions of tcvn and alternativnj. |
af301c44 DL |
52 | |
53 | * Lazy-load tables for unify-charset somehow? | |
54 | ||
63d6cc62 | 55 | * Translation tables for {en,de}code currently aren't supported. |
af301c44 DL |
56 | |
57 | * Defining CCL coding systems currently doesn't work. | |
58 | ||
59 | * iso-2022 charsets get unified on i/o. | |
60 | ||
61 | * Revisit locale processing: look at treating the language and | |
62 | charset parts separately. (Language should affect things like | |
63 | speling and calendar, but that's not a Unicode issue.) | |
64 | ||
65 | * Handle Unicode combining characters usefully, e.g. diacritics, and | |
8cfdb48d | 66 |