[Docs] Clarify Zadig usage in FAQ Docs (#6360)
[jackhill/qmk/firmware.git] / docs / feature_unicode.md
CommitLineData
7b0356d1 1# Unicode Support
995c3141 2
ecf0612c 3Unicode characters can be input straight from your keyboard! There are some limitations, however.
995c3141 4
ecf0612c 5QMK has three different methods for enabling Unicode input and defining keycodes:
995c3141 6
ecf0612c 7## Basic Unicode
995c3141 8
ecf0612c 9This method supports Unicode code points up to `0x7FFF`. This covers characters for most modern languages, as well as symbols, but it doesn't cover emoji.
995c3141 10
ecf0612c 11Add the following to your `rules.mk`:
f86c0c26 12
ecf0612c 13```make
14UNICODE_ENABLE = yes
15```
16
17Then add `UC(c)` keycodes to your keymap, where _c_ is the code point (preferably in hexadecimal, up to 4 digits long). For example: `UC(0x45B)`, `UC(0x30C4)`.
18
19## Unicode Map
20
21This method supports all possible code points (up to `0x10FFFF`); however, you need to maintain a separate mapping table in your keymap file, which may contain at most 16384 entries.
22
23Add the following to your `rules.mk`:
24
25```make
26UNICODEMAP_ENABLE = yes
27```
28
29Then add `X(i)` keycodes to your keymap, where _i_ is an array index into the mapping table:
f86c0c26
DJ
30
31```c
fdeec296 32enum unicode_names {
ecf0612c 33 BANG,
34 IRONY,
35 SNEK
f86c0c26
DJ
36};
37
38const uint32_t PROGMEM unicode_map[] = {
ecf0612c 39 [BANG] = 0x203D, // ‽
40 [IRONY] = 0x2E2E, // ⸮
41 [SNEK] = 0x1F40D, // 🐍
ca356475 42};
f86c0c26
DJ
43```
44
bdc8d89e
45Then you can use `X(BANG)`, `X(SNEK)` etc. in your keymap.
46
47### Lower and Upper Case
48
ecf0612c 49Characters often come in lower and upper case pairs, such as å and Å. To make inputting these characters easier, you can use `XP(i, j)` in your keymap, where _i_ and _j_ are the mapping table indices of the lower and upper case character, respectively. If you're holding down Shift or have Caps Lock turned on when you press the key, the second (upper case) character will be inserted; otherwise, the first (lower case) version will appear.
bdc8d89e 50
ecf0612c 51This is most useful when creating a keymap for an international layout with special characters. Instead of having to put the lower and upper case versions of a character on separate keys, you can have them both on the same key by using `XP()`. This helps blend Unicode keys in with regular alphas.
bdc8d89e
52
53Due to keycode size constraints, _i_ and _j_ can each only refer to one of the first 128 characters in your `unicode_map`. In other words, 0 ≤ _i_ ≤ 127 and 0 ≤ _j_ ≤ 127. This is enough for most use cases, but if you'd like to customize the index calculation, you can override the [`unicodemap_index()`](https://github.com/qmk/qmk_firmware/blob/71f640d47ee12c862c798e1f56392853c7b1c1a8/quantum/process_keycode/process_unicodemap.c#L40) function. This also allows you to, say, check Ctrl instead of Shift/Caps.
995c3141 54
ecf0612c 55## UCIS
56
57This method also supports all possible code points. As with the Unicode Map method, you need to maintain a mapping table in your keymap file. However, there are no built-in keycodes for this feature — you have to create a custom keycode or function that invokes this functionality.
995c3141 58
ecf0612c 59Add the following to your `rules.mk`:
60
61```make
62UCIS_ENABLE = yes
63```
f86c0c26 64
ecf0612c 65Then define a table like this in your keymap file:
f86c0c26
DJ
66
67```c
cd9262d7 68const qk_ucis_symbol_t ucis_symbol_table[] = UCIS_TABLE(
ecf0612c 69 UCIS_SYM("poop", 0x1F4A9), // 💩
70 UCIS_SYM("rofl", 0x1F923), // 🤣
71 UCIS_SYM("kiss", 0x1F619) // 😙
f86c0c26
DJ
72);
73```
74
ecf0612c 75To use it, call `qk_ucis_start()`. Then, type the mnemonic for the character (such as "rofl"), and hit Space or Enter. QMK should erase the "rofl" text and insert the laughing emoji.
f86c0c26
DJ
76
77### Customization
78
cd9262d7 79There are several functions that you can define in your keymap to customize the functionality of this feature.
f86c0c26 80
cd9262d7
81* `void qk_ucis_start_user(void)` – This runs when you call the "start" function, and can be used to provide feedback. By default, it types out a keyboard emoji.
82* `void qk_ucis_success(uint8_t symbol_index)` – This runs when the input has matched something and has completed. By default, it doesn't do anything.
83* `void qk_ucis_symbol_fallback (void)` – This runs when the input doesn't match anything. By default, it falls back to trying that input as a Unicode code.
f86c0c26 84
fdeec296
DJ
85You can find the default implementations of these functions in [`process_ucis.c`](https://github.com/qmk/qmk_firmware/blob/master/quantum/process_keycode/process_ucis.c).
86
87## Input Modes
88
89Unicode input in QMK works by inputting a sequence of characters to the OS, sort of like a macro. Unfortunately, the way this is done differs for each platform. Specifically, each platform requires a different combination of keys to trigger Unicode input. Therefore, a corresponding input mode has to be set in QMK.
90
91The following input modes are available:
92
ecf0612c 93* **`UC_OSX`**: macOS built-in Unicode hex input. Supports code points up to `0xFFFF` (`0x10FFFF` with Unicode Map).
fdeec296
DJ
94
95 To enable, go to _System Preferences > Keyboard > Input Sources_, add _Unicode Hex Input_ to the list (it's under _Other_), then activate it from the input dropdown in the Menu Bar.
bdc8d89e 96 By default, this mode uses the left Option key (`KC_LALT`) for Unicode input, but this can be changed by defining [`UNICODE_KEY_OSX`](#input-key-configuration) with another keycode.
fdeec296 97
bdc8d89e 98 !> Using the _Unicode Hex Input_ input source may disable some Option based shortcuts, such as Option + Left Arrow and Option + Right Arrow.
9b82a929 99
cd9262d7 100* **`UC_LNX`**: Linux built-in IBus Unicode input. Supports code points up to `0x10FFFF` (all possible code points).
fdeec296
DJ
101
102 Enabled by default and works almost anywhere on IBus-enabled distros. Without IBus, this mode works under GTK apps, but rarely anywhere else.
bdc8d89e 103 By default, this mode uses Ctrl+Shift+U (`LCTL(LSFT(KC_U))`) to start Unicode input, but this can be changed by defining [`UNICODE_KEY_LNX`](#input-key-configuration) with another keycode. This might be required for IBus versions ≥1.5.15, where Ctrl+Shift+U behavior is consolidated into Ctrl+Shift+E.
fdeec296
DJ
104
105* **`UC_WIN`**: _(not recommended)_ Windows built-in hex numpad Unicode input. Supports code points up to `0xFFFF`.
106
bdc8d89e 107 To enable, create a registry key under `HKEY_CURRENT_USER\Control Panel\Input Method\EnableHexNumpad` of type `REG_SZ` called `EnableHexNumpad` and set its value to `1`. This can be done from the Command Prompt by running `reg add "HKCU\Control Panel\Input Method" -v EnableHexNumpad -t REG_SZ -d 1` with administrator privileges. Reboot afterwards.
fdeec296
DJ
108 This mode is not recommended because of reliability and compatibility issues; use the `UC_WINC` mode instead.
109
110* **`UC_BSD`**: _(non implemented)_ Unicode input under BSD. Not implemented at this time. If you're a BSD user and want to help add support for it, please [open an issue on GitHub](https://github.com/qmk/qmk_firmware/issues).
111
bdc8d89e 112* **`UC_WINC`**: Windows Unicode input using [WinCompose](https://github.com/samhocevar/wincompose). As of v0.9.0, supports code points up to `0x10FFFF` (all possible code points).
fdeec296
DJ
113
114 To enable, install the [latest release](https://github.com/samhocevar/wincompose/releases/latest). Once installed, WinCompose will automatically run on startup. Works reliably under all version of Windows supported by the app.
bdc8d89e 115 By default, this mode uses right Alt (`KC_RALT`) as the Compose key, but this can be changed in the WinCompose settings and by defining [`UNICODE_KEY_WINC`](#input-key-configuration) with another keycode.
fdeec296
DJ
116
117### Switching Input Modes
118
119There are two ways to set the input mode for Unicode: by keycode or by function. Keep in mind that both methods write to persistent storage (EEPROM), and are loaded each time the keyboard starts. So once you've set it the first time, you don't need to set it again unless you want to change it, or you've reset the EEPROM settings.
120
121You can switch the input mode at any time by using one of the following keycodes. The easiest way is to add the ones you use to your keymap.
122
bdc8d89e
123|Keycode |Alias |Input Mode |Description |
124|----------------------|---------|------------|--------------------------------------------------------------|
125|`UNICODE_MODE_FORWARD`|`UC_MOD` |Next in list|[Cycle](#input-mode-cycling) through selected modes |
126|`UNICODE_MODE_REVERSE`|`UC_RMOD`|Prev in list|[Cycle](#input-mode-cycling) through selected modes in reverse|
127|`UNICODE_MODE_OSX` |`UC_M_OS`|`UC_OSX` |Switch to macOS input |
128|`UNICODE_MODE_LNX` |`UC_M_LN`|`UC_LNX` |Switch to Linux input |
129|`UNICODE_MODE_WIN` |`UC_M_WI`|`UC_WIN` |Switch to Windows input |
130|`UNICODE_MODE_BSD` |`UC_M_BS`|`UC_BSD` |Switch to BSD input (not implemented) |
131|`UNICODE_MODE_WINC` |`UC_M_WC`|`UC_WINC` |Switch to Windows input using WinCompose |
fdeec296 132
bdc8d89e 133You can also switch the input mode by calling `set_unicode_input_mode(x)` in your code, where _x_ is one of the above input mode constants (e.g. `UC_LNX`). Since the function only needs to be called once, it's recommended that you do it in `eeconfig_init_user()` (or a similar function). For example:
995c3141 134
f86c0c26 135```c
fdeec296 136void eeconfig_init_user(void) {
ecf0612c 137 set_unicode_input_mode(UC_LNX);
f86c0c26 138}
fdeec296 139```
f86c0c26 140
fdeec296 141### Audio Feedback
f86c0c26 142
fdeec296
DJ
143If you have the [Audio feature](feature_audio.md) enabled on the board, you can set melodies to be played when you press the above keys. That way you can have some audio feedback when switching input modes.
144
145For instance, you can add these definitions to your `config.h` file:
146
147```c
148#define UNICODE_SONG_OSX COIN_SOUND
149#define UNICODE_SONG_LNX UNICODE_LINUX
150#define UNICODE_SONG_BSD MARIO_GAMEOVER
151#define UNICODE_SONG_WIN UNICODE_WINDOWS
152#define UNICODE_SONG_WINC UNICODE_WINDOWS
f86c0c26
DJ
153```
154
fdeec296
DJ
155### Additional Customization
156
bdc8d89e 157Because Unicode is a large and versatile feature, there are a number of options you can customize to make it work better on your system.
fdeec296 158
bdc8d89e 159#### Start and Finish Input Functions
fdeec296
DJ
160
161The functions for starting and finishing Unicode input on your platform can be overridden locally. Possible uses include customizing input mode behavior if you don't use the default keys, or adding extra visual/audio feedback to Unicode input.
162
bdc8d89e 163* `void unicode_input_start(void)` – This sends the initial sequence that tells your platform to enter Unicode input mode. For example, it presses Ctrl+Shift+U on Linux and holds the Option key on macOS.
fdeec296 164* `void unicode_input_finish(void)` – This is called to exit Unicode input mode, for example by pressing Space or releasing the Option key.
f86c0c26 165
fdeec296 166You can find the default implementations of these functions in [`process_unicode_common.c`](https://github.com/qmk/qmk_firmware/blob/master/quantum/process_keycode/process_unicode_common.c).
995c3141 167
fdeec296 168#### Input Key Configuration
f86c0c26 169
bdc8d89e
170You can customize the keys used to trigger Unicode input for macOS, Linux and WinCompose by adding corresponding defines to your `config.h`. The default values match the platforms' default settings, so you shouldn't need to change this unless Unicode input isn't working, or you want to use a different key (e.g. in order to free up left or right Alt).
171
172|Define |Type |Default |Example |
173|------------------|----------|------------------|-------------------------------------------|
174|`UNICODE_KEY_OSX` |`uint8_t` |`KC_LALT` |`#define UNICODE_KEY_OSX KC_RALT` |
175|`UNICODE_KEY_LNX` |`uint16_t`|`LCTL(LSFT(KC_U))`|`#define UNICODE_KEY_LNX LCTL(LSFT(KC_E))`|
176|`UNICODE_KEY_WINC`|`uint8_t` |`KC_RALT` |`#define UNICODE_KEY_WINC KC_RGUI` |
177
178#### Input Mode Cycling
179
180You can choose which input modes are available for cycling through. By default, this is disabled. If you want to enable it, limiting it to just the modes you use makes sense. Note that the values in the list are comma-delimited.
fdeec296
DJ
181
182```c
bdc8d89e 183#define UNICODE_SELECTED_MODES UC_OSX, UC_LNX, UC_WIN, UC_WINC
fdeec296
DJ
184```
185
bdc8d89e 186You can cycle through the selected modes by using the `UC_MOD`/`UC_RMOD` keycodes, or by calling `cycle_unicode_input_mode(offset)` in your code (`offset` is how many modes to move forward by, so +1 corresponds to `UC_MOD`).
fdeec296 187
bdc8d89e 188By default, when the keyboard boots, it will initialize the input mode to the last one you used. You can disable this and make it start with the first mode in the list every time by adding the following to your `config.h`:
fdeec296
DJ
189
190```c
bdc8d89e 191#define UNICODE_CYCLE_PERSIST false
fdeec296 192```
995c3141 193
bdc8d89e
194!> Using `UNICODE_SELECTED_MODES` means you don't have to initially set the input mode in `matrix_init_user()` (or a similar function); the Unicode system will do that for you on startup. This has the added benefit of avoiding unnecessary writes to EEPROM.
195
f86c0c26
DJ
196## `send_unicode_hex_string`
197
198To type multiple characters for things like (ノಠ痊ಠ)ノ彡┻━┻, you can use `send_unicode_hex_string()` much like `SEND_STRING()` except you would use hex values separate by spaces.
f5ae3760
DJ
199For example, the table flip seen above would be `send_unicode_hex_string("0028 30CE 0CA0 75CA 0CA0 0029 30CE 5F61 253B 2501 253B")`
200
201There are many ways to get a hex code, but an easy one is [this site](https://r12a.github.io/app-conversion/). Just make sure to convert to hexadecimal, and that is your string.
202
f86c0c26 203## Additional Language Support
995c3141
JH
204
205In `quantum/keymap_extras/`, you'll see various language files - these work the same way as the alternative layout ones do. Most are defined by their two letter country/language code followed by an underscore and a 4-letter abbreviation of its name. `FR_UGRV` which will result in a `ù` when using a software-implemented AZERTY layout. It's currently difficult to send such characters in just the firmware.
206
f86c0c26 207## International Characters on Windows
995c3141 208
f86c0c26 209### AutoHotkey allows Windows users to create custom hotkeys among others.
995c3141 210
f86c0c26 211The method does not require Unicode support in the keyboard itself but depends instead of [AutoHotkey](https://autohotkey.com) running in the background.
995c3141
JH
212
213First you need to select a modifier combination that is not in use by any of your programs.
214CtrlAltWin is not used very widely and should therefore be perfect for this.
215There is a macro defined for a mod-tab combo `LCAG_T`.
216Add this mod-tab combo to a key on your keyboard, e.g.: `LCAG_T(KC_TAB)`.
217This makes the key behave like a tab key if pressed and released immediately but changes it to the modifier if used with another key.
218
219In the default script of AutoHotkey you can define custom hotkeys.
220
221 <^<!<#a::Send, ä
222 <^<!<#<+a::Send, Ä
223
224The hotkeys above are for the combination CtrlAltGui and CtrlAltGuiShift plus the letter a.
225AutoHotkey inserts the Text right of `Send, ` when this combination is pressed.
f86c0c26
DJ
226
227### US International
228
9b82a929 229If you enable the US International layout on the system, it will use punctuation to accent the characters.
f86c0c26
DJ
230
231For instance, typing "`a" will result in à.
232
233You can find details on how to enable this [here](https://support.microsoft.com/en-us/help/17424/windows-change-keyboard-layout).