Commit | Line | Data |
---|---|---|
af301c44 DL |
1 | -*-text-*- |
2 | ||
3 | Problems, fixmes and other issues in the emacs-unicode branch | |
8cfdb48d | 4 | ------------------------------------------------------------- |
af301c44 | 5 | |
63d6cc62 DL |
6 | Notes by fx to record various things of variable importance. handa |
7 | needs to check them -- don't take too seriously, especially with | |
8 | regard to completeness. | |
af301c44 | 9 | |
8cfdb48d DL |
10 | _Do take seriously that you don't want this branch unless you're |
11 | actually working on it; you risk your data by actually using it._ If | |
12 | you just want to edit Unicode and/or unify iso-8859 et al, see the | |
13 | existing support and the extra stuff at | |
14 | <URL:ftp://dlpx1.dl.ac.uk/fx/emacs/Mule>, mostly now in the CVS trunk. | |
15 | (Editing support is mostly orthogonal to the internal representation.) | |
af301c44 | 16 | |
8cfdb48d DL |
17 | * SINGLE_BYTE_CHAR_P returns true for Latin-1 characters, which has |
18 | undesirable effects. | |
af301c44 DL |
19 | |
20 | * Rationalize character syntax and its relationship to the Unicode | |
21 | database. Specifically, the latin-N.el files aren't consistent for | |
8cfdb48d DL |
22 | common characters (and obviously have redundancies except in |
23 | unibyte mode). | |
af301c44 | 24 | |
8cfdb48d DL |
25 | * Fontset handling and customization needs work. We want to relate |
26 | fonts to scripts, probably based on the Unicode blocks. The | |
27 | presence of small-repertoire 10646-encoded fonts in XFree 4 is a | |
28 | pain, not currently worked round. | |
af301c44 | 29 | |
aeae5b21 KH |
30 | With the change on 2002-07-26, multiple fonts can be |
31 | specified in a fontset for a specific range of characters. | |
32 | Each range can also be specified by script. Before using | |
33 | ISO10646 fonts, Emacs checks their repertories to avoid such | |
34 | fonts that don't have a glyph for a specific character. | |
97a04413 | 35 | |
8cfdb48d | 36 | * Work is also needed on charset and coding system priorities. |
af301c44 DL |
37 | |
38 | * The relevant bits of latin1-disp.el need porting (and probably | |
39 | re-naming/updating). See also cyril-util.el. | |
40 | ||
8cfdb48d | 41 | * Quail files need more work now the encoding is irrelevant. |
af301c44 DL |
42 | |
43 | * What to do with the old coding categories stuff? | |
44 | ||
8cfdb48d | 45 | * Syntax for symbols &c in characters.el needs looking at. |
af301c44 DL |
46 | |
47 | * The preferred-coding-system property of charsets should probably be | |
48 | junked unless it can be made more useful now. | |
49 | ||
8cfdb48d | 50 | * find-coding-systems-for-charsets needs re-writing or removing. |
af301c44 DL |
51 | |
52 | * find-multibyte-characters needs looking at. | |
53 | ||
54 | * Implement Korean cp949/UHC and any other important missing | |
55 | charsets. | |
56 | ||
8cfdb48d | 57 | * Check up on definitions of tcvn and alternativnj. |
af301c44 DL |
58 | |
59 | * Lazy-load tables for unify-charset somehow? | |
60 | ||
aeae5b21 KH |
61 | Actually, Emacs clear out all charset maps and unify-map just |
62 | before dumping, and their are loaded again on demand the | |
63 | dumped emacs. But, those maps (char tables) generated while | |
64 | temacs is running can't be get rid of from the dumped emacs. | |
65 | ||
63d6cc62 | 66 | * Translation tables for {en,de}code currently aren't supported. |
af301c44 DL |
67 | |
68 | * Defining CCL coding systems currently doesn't work. | |
69 | ||
70 | * iso-2022 charsets get unified on i/o. | |
71 | ||
72 | * Revisit locale processing: look at treating the language and | |
73 | charset parts separately. (Language should affect things like | |
74 | speling and calendar, but that's not a Unicode issue.) | |
75 | ||
76 | * Handle Unicode combining characters usefully, e.g. diacritics, and | |
8cfdb48d | 77 |