You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.

452 lines
16 KiB

  1. # Unicode :id=unicode
  2. With a little help from your OS, practically any Unicode character can be input using your keyboard.
  3. ## Caveats :id=caveats
  4. There are some limitations to this feature. Because there is no "standard" method of Unicode input across all operating systems, each of them require their own setup process on both the host *and* in the firmware, which may involve installation of additional software. This also means Unicode input will not "just work" when the keyboard is plugged into another device.
  5. ## Usage :id=usage
  6. The core Unicode API can be used purely programmatically. However, there are also additional subsystems which build on top of it and come with keycodes to make things easier. See below for more details.
  7. Add the following to your keymap's `rules.mk`:
  8. ```make
  9. UNICODE_COMMON = yes
  10. ```
  11. ## Basic Configuration :id=basic-configuration
  12. Add the following to your `config.h`:
  13. |Define |Default |Description |
  14. |------------------------|------------------|--------------------------------------------------------------------------------|
  15. |`UNICODE_KEY_MAC` |`KC_LEFT_ALT` |The key to hold when beginning a Unicode sequence with the macOS input mode |
  16. |`UNICODE_KEY_LNX` |`LCTL(LSFT(KC_U))`|The key to tap when beginning a Unicode sequence with the Linux input mode |
  17. |`UNICODE_KEY_WINC` |`KC_RIGHT_ALT` |The key to hold when beginning a Unicode sequence with the WinCompose input mode|
  18. |`UNICODE_SELECTED_MODES`|`-1` |A comma separated list of input modes for cycling through |
  19. |`UNICODE_CYCLE_PERSIST` |`true` |Whether to persist the current Unicode input mode to EEPROM |
  20. |`UNICODE_TYPE_DELAY` |`10` |The amount of time to wait, in milliseconds, between Unicode sequence keystrokes|
  21. ### Audio Feedback :id=audio-feedback
  22. If you have the [Audio](feature_audio.md) feature enabled on your board, you can configure it to play sounds when the input mode is changed.
  23. Add the following to your `config.h`:
  24. |Define |Default|Description |
  25. |-------------------|-------|-----------------------------------------------------------|
  26. |`UNICODE_SONG_MAC` |*n/a* |The song to play when the macOS input mode is selected |
  27. |`UNICODE_SONG_LNX` |*n/a* |The song to play when the Linux input mode is selected |
  28. |`UNICODE_SONG_BSD` |*n/a* |The song to play when the BSD input mode is selected |
  29. |`UNICODE_SONG_WIN` |*n/a* |The song to play when the Windows input mode is selected |
  30. |`UNICODE_SONG_WINC`|*n/a* |The song to play when the WinCompose input mode is selected|
  31. ## Input Subsystems :id=input-subsystems
  32. Each of these subsystems have their own pros and cons in terms of flexibility and ease of use. Choose the one that best fits your needs.
  33. <!-- tabs:start -->
  34. ### ** Basic **
  35. This is the easiest to use, albeit somewhat limited. It supports code points up to `U+7FFF`, which covers characters for most modern languages (including East Asian), as well as many symbols, but does not include emoji.
  36. To enable Basic Unicode, add the following to your `rules.mk`:
  37. ```make
  38. UNICODE_ENABLE = yes
  39. ```
  40. You can then add `UC(c)` keycodes to your keymap, where *c* is the code point of the desired character (in hexadecimal - the `U+` prefix will not work). For example, `UC(0x40B)` will output [Ћ](https://unicode-table.com/en/040B/), and `UC(0x30C4)` will output [](https://unicode-table.com/en/30C4).
  41. ### ** Unicode Map **
  42. Unicode Map supports all possible code points (up to `U+10FFFF`). Here, the code points are stored in a separate mapping table (which may contain at most 16,384 entries), instead of directly in the keymap.
  43. To enable Unicode Map, add the following to your `rules.mk`:
  44. ```make
  45. UNICODEMAP_ENABLE = yes
  46. ```
  47. Then, you will need to create a mapping table in your `keymap.c`, and (optionally) an enum for naming the array indices, like so:
  48. ```c
  49. enum unicode_names {
  50. BANG,
  51. IRONY,
  52. SNEK
  53. };
  54. const uint32_t PROGMEM unicode_map[] = {
  55. [BANG] = 0x203D, // ‽
  56. [IRONY] = 0x2E2E, // ⸮
  57. [SNEK] = 0x1F40D, // 🐍
  58. };
  59. ```
  60. Finally, add `UM(i)` keycodes to your keymap, where *i* is an index into the `unicode_map[]` array. If you defined the enum above, you can use those names instead, for example `UM(BANG)` or `UM(SNEK)`.
  61. #### Lower and Upper Case Pairs :id=unicodemap-pairs
  62. Some writing systems have lowercase and uppercase variants of each character, such as å and Å. To make inputting these characters easier, you can use the `UP(i, j)` keycode in your keymap, where *i* and *j* are the mapping table indices of the lowercase and uppercase characters, respectively. If you're holding down Shift or have Caps Lock turned on when you press the key, the uppercase character will be inserted; otherwise, the lowercase character will be inserted.
  63. ```c
  64. const uint32_t PROGMEM unicode_map[] = {
  65. [AE_LOWER] = 0x00E6, // æ
  66. [AE_UPPER] = 0x00C6, // Æ
  67. };
  68. ```
  69. This is most useful when creating a keymap for an international layout with special characters. Instead of having to put the lower and upper case versions of a character on separate keys, you can have them both on the same key. This helps blend Unicode keys in with regular keycodes.
  70. Due to keycode size constraints, *i* and *j* can each only refer to one of the first 128 characters in your `unicode_map`. In other words, 0 ≤ *i* ≤ 127 and 0 ≤ *j* ≤ 127.
  71. ### ** UCIS **
  72. As with Unicode Map, the UCIS method also supports all possible code points, and requires the use of a mapping table. However, it works much differently - Unicode characters are input by replacing a typed mnemonic.
  73. To enable UCIS, add the following to your keymap's `rules.mk`:
  74. ```make
  75. UCIS_ENABLE = yes
  76. ```
  77. Then, create a mapping table in your `keymap.c`:
  78. ```c
  79. const ucis_symbol_t ucis_symbol_table[] = UCIS_TABLE(
  80. UCIS_SYM("poop", 0x1F4A9), // 💩
  81. UCIS_SYM("rofl", 0x1F923), // 🤣
  82. UCIS_SYM("ukr", 0x1F1FA, 0x1F1E6), // 🇺🇦
  83. UCIS_SYM("look", 0x0CA0, 0x005F, 0x0CA0) // ಠ_ಠ
  84. );
  85. ```
  86. By default, each table entry may be up to three code points long. This can be changed by adding `#define UCIS_MAX_CODE_POINTS n` to your keymap's `config.h`.
  87. To invoke UCIS input, the `ucis_start()` function must first be called (for example, in a custom "Unicode" keycode). Then, type the mnemonic for the mapping table entry (such as "rofl"), and hit Space or Enter. The "rofl" text will be backspaced and the emoji inserted.
  88. <!-- tabs:end -->
  89. ## Input Modes :id=input-modes
  90. Unicode input works by typing a sequence of characters, similar to a macro. However, since this sequence depends on your OS, you will need to prepare both your host machine and QMK to recognise and send the correct Unicode input sequences respectively.
  91. To set the list of enabled input modes, add the `UNICODE_SELECTED_MODES` define to your keymap's `config.h`, for example:
  92. ```c
  93. #define UNICODE_SELECTED_MODES UNICODE_MODE_LINUX
  94. // or
  95. #define UNICODE_SELECTED_MODES UNICODE_MODE_MACOS, UNICODE_MODE_WINCOMPOSE
  96. ```
  97. These modes can then be cycled through using the `UC_NEXT` and `UC_PREV` keycodes. You can also switch to any input mode, even if it is not specified in `UNICODE_SELECTED_MODES`, using their respective keycodes.
  98. If your keyboard has working EEPROM, it will remember the last used input mode and continue using it on the next power up. This can be disabled by defining `UNICODE_CYCLE_PERSIST` to `false`.
  99. <!-- tabs:start -->
  100. ### ** macOS **
  101. **Mode Name:** `UNICODE_MODE_MACOS`
  102. macOS has built-in support for Unicode input as its own input source. It supports all possible code points by way of surrogate pairs for code points above `U+FFFF`.
  103. To enable, go to **System Preferences → Keyboard → Input Sources**, then add Unicode Hex Input to the list (under Other), and activate it from the input dropdown in the menu bar. Note that this may disable some Option-based shortcuts such as Option+Left and Option+Right.
  104. ### ** Linux (IBus) **
  105. **Mode Name:** `UNICODE_MODE_LINUX`
  106. For Linux distros with IBus, Unicode input is enabled by default, supports all possible code points, and works almost anywhere. Without IBus, it works under GTK apps, but rarely anywhere else.
  107. Users who would like support in non-GTK apps without IBus may need to resort to a more indirect method, such as creating a custom keyboard layout.
  108. ### ** Windows (WinCompose) **
  109. **Mode Name:** `UNICODE_MODE_WINCOMPOSE`
  110. This mode requires a third-party tool called [WinCompose](https://github.com/samhocevar/wincompose). It supports all possible code points, and is the recommended input mode for Windows.
  111. To enable, install the [latest release from GitHub](https://github.com/samhocevar/wincompose/releases/latest). Once installed, it will automatically run on startup. This works reliably under all versions of Windows supported by WinCompose.
  112. ### ** Windows (HexNumpad) **
  113. **Mode Name:** `UNICODE_MODE_WINDOWS`
  114. !> This input mode is *not* the "Alt code" system. Alt codes are not Unicode; they instead follow [the Windows-1252 character set](https://en.wikipedia.org/wiki/Alt_code).
  115. This is Windows' built-in hex numpad Unicode input mode. It only supports code points up to `U+FFFF`, and is not recommended due to reliability and compatibility issues.
  116. To enable, run the following as an administrator, then reboot:
  117. ```
  118. reg add "HKCU\Control Panel\Input Method" -v EnableHexNumpad -t REG_SZ -d 1
  119. ```
  120. ### ** Emacs **
  121. **Mode Name:** `UNICODE_MODE_EMACS`
  122. Emacs supports code point input with the `insert-char` command.
  123. ### ** BSD **
  124. **Mode Name:** `UNICODE_MODE_BSD`
  125. Not currently implemented. If you're a BSD user and want to contribute support for this input mode, please [feel free](contributing.md)!
  126. <!-- tabs:end -->
  127. ## Keycodes :id=keycodes
  128. |Key |Aliases |Description |
  129. |----------------------------|---------|----------------------------------------------------------------|
  130. |`UC(c)` | |Send Unicode code point `c`, up to `0x7FFF` |
  131. |`UM(i)` | |Send Unicode code point at index `i` in `unicode_map` |
  132. |`UP(i, j)` | |Send Unicode code point at index `i`, or `j` if Shift/Caps is on|
  133. |`QK_UNICODE_MODE_NEXT` |`UC_NEXT`|Cycle through selected input modes |
  134. |`QK_UNICODE_MODE_PREVIOUS` |`UC_PREV`|Cycle through selected input modes in reverse |
  135. |`QK_UNICODE_MODE_MACOS` |`UC_MAC` |Switch to macOS input |
  136. |`QK_UNICODE_MODE_LINUX` |`UC_LINX`|Switch to Linux input |
  137. |`QK_UNICODE_MODE_WINDOWS` |`UC_WIN` |Switch to Windows input |
  138. |`QK_UNICODE_MODE_BSD` |`UC_BSD` |Switch to BSD input (not implemented) |
  139. |`QK_UNICODE_MODE_WINCOMPOSE`|`UC_WINC`|Switch to Windows input using WinCompose |
  140. |`QK_UNICODE_MODE_EMACS` |`UC_EMAC`|Switch to emacs (`C-x-8 RET`) |
  141. ## API :id=api
  142. ### `uint8_t get_unicode_input_mode(void)` :id=api-get-unicode-input-mode
  143. Get the current Unicode input mode.
  144. #### Return Value :id=api-get-unicode-input-mode-return-value
  145. The currently active Unicode input mode.
  146. ---
  147. ### `void set_unicode_input_mode(uint8_t mode)` :id=api-set-unicode-input-mode
  148. Set the Unicode input mode.
  149. #### Arguments :id=api-set-unicode-input-mode-arguments
  150. - `uint8_t mode`
  151. The input mode to set.
  152. ---
  153. ### `void unicode_input_mode_step(void)` : id=api-unicode-input-mode-step
  154. Change to the next Unicode input mode.
  155. ---
  156. ### `void unicode_input_mode_step_reverse(void)` : id=api-unicode-input-mode-step-reverse
  157. Change to the previous Unicode input mode.
  158. ---
  159. ### `void unicode_input_mode_set_user(uint8_t input_mode)` :id=api-unicode-input-mode-set-user
  160. User-level callback, invoked when the input mode is changed.
  161. #### Arguments :id=api-unicode-input-mode-set-user-arguments
  162. - `uint8_t input_mode`
  163. The new input mode.
  164. ---
  165. ### `void unicode_input_mode_set_kb(uint8_t input_mode)` :id=api-unicode-input-mode-set-kb
  166. Keyboard-level callback, invoked when the input mode is changed.
  167. #### Arguments :id=api-unicode-input-mode-set-kb-arguments
  168. - `uint8_t input_mode`
  169. The new input mode.
  170. ---
  171. ### `void unicode_input_start(void)` :id=api-unicode-input-start
  172. Begin the Unicode input sequence. The exact behavior depends on the currently selected input mode:
  173. - **macOS**: Hold `UNICODE_KEY_MAC`
  174. - **Linux**: Tap `UNICODE_KEY_LNX`
  175. - **WinCompose**: Tap `UNICODE_KEY_WINC`, then U
  176. - **HexNumpad**: Hold Left Alt, then tap Numpad +
  177. - **Emacs**: Tap Ctrl+X, then 8, then Enter
  178. This function is weakly defined, and can be overridden in user code.
  179. ---
  180. ### `void unicode_input_finish(void)` :id=api-unicode-input-finish
  181. Complete the Unicode input sequence. The exact behavior depends on the currently selected input mode:
  182. - **macOS**: Release `UNICODE_KEY_MAC`
  183. - **Linux**: Tap Space
  184. - **WinCompose**: Tap Enter
  185. - **HexNumpad**: Release Left Alt
  186. - **Emacs**: Tap Enter
  187. This function is weakly defined, and can be overridden in user code.
  188. ---
  189. ### `void unicode_input_cancel(void)` :id=api-unicode-input-cancel
  190. Cancel the Unicode input sequence. The exact behavior depends on the currently selected input mode:
  191. - **macOS**: Release `UNICODE_KEY_MAC`
  192. - **Linux**: Tap Escape
  193. - **WinCompose**: Tap Escape
  194. - **HexNumpad**: Release Left Alt
  195. - **Emacs**: Tap Ctrl+G
  196. This function is weakly defined, and can be overridden in user code.
  197. ---
  198. ### `void register_unicode(uint32_t code_point)` :id=api-register-unicode
  199. Input a single Unicode character. A surrogate pair will be sent if required by the input mode.
  200. #### Arguments :id=api-register-unicode-arguments
  201. - `uint32_t code_point`
  202. The code point of the character to send.
  203. ---
  204. ### `void send_unicode_string(const char *str)` :id=api-send-unicode-string
  205. Send a string containing Unicode characters.
  206. #### Arguments :id=api-send-unicode-string-arguments
  207. - `const char *str`
  208. The string to send.
  209. ---
  210. ### `uint8_t unicodemap_index(uint16_t keycode)` :id=api-unicodemap-index
  211. Get the index into the `unicode_map` array for the given keycode, respecting shift state for pair keycodes.
  212. #### Arguments :id=api-unicodemap-index-arguments
  213. - `uint16_t keycode`
  214. The Unicode Map keycode to get the index of.
  215. #### Return Value :id=api-unicodemap-index-return-value
  216. An index into the `unicode_map` array.
  217. ---
  218. ### `uint32_t unicodemap_get_code_point(uint8_t index)` :id=api-unicodemap-get-code-point
  219. Get the code point for the given index in the `unicode_map` array.
  220. #### Arguments :id=unicodemap-get-code-point-arguments
  221. - `uint8_t index`
  222. The index into the `unicode_map` array.
  223. #### Return Value :id=unicodemap-get-code-point-return-value
  224. A Unicode code point value.
  225. ---
  226. ### `void register_unicodemap(uint8_t index)` :id=api-register-unicodemap
  227. Send the code point for the given index in the `unicode_map` array.
  228. #### Arguments :id=api-register-unicodemap-arguments
  229. - `uint8_t index`
  230. The index into the `unicode_map` array.
  231. ---
  232. ### `void ucis_start(void)` :id=api-ucis-start
  233. Begin the input sequence.
  234. ---
  235. ### `bool ucis_active(void)` :id=api-ucis-active
  236. Whether UCIS is currently active.
  237. #### Return Value :id=api-ucis-active-return-value
  238. `true` if UCIS is active.
  239. ---
  240. ### `uint8_t ucis_count(void)` :id=api-ucis-count
  241. Get the number of characters in the input sequence buffer.
  242. #### Return Value :id=api-ucis-count-return-value
  243. The current input sequence buffer length.
  244. ---
  245. ### `bool ucis_add(uint16_t keycode)` :id=api-ucis-add
  246. Add the given keycode to the input sequence buffer.
  247. #### Arguments :id=api-ucis-add-arguments
  248. - `uint16_t keycode`
  249. The keycode to add. Must be between `KC_A` and `KC_Z`, or `KC_1` and `KC_0`.
  250. #### Return Value :id=api-ucis-add-return-value
  251. `true` if the keycode was added.
  252. ---
  253. ### `bool ucis_remove_last(void)` :id=api-ucis-remove-last
  254. Remove the last character from the input sequence buffer.
  255. #### Return Value :id=api-ucis-remove-last
  256. `true` if the sequence was not empty.
  257. ---
  258. ### `void ucis_finish(void)` :id=api-ucis-finish
  259. Mark the input sequence as complete, and attempt to match.
  260. ---
  261. ### `void ucis_cancel(void)` :id=api-ucis-cancel
  262. Cancel the input sequence.
  263. ---
  264. ### `void register_ucis(void)` :id=api-register-ucis
  265. Send the code point(s) for the given UCIS index.
  266. #### Arguments :id=api-register-ucis-arguments
  267. - `uint8_t index`
  268. The index into the UCIS symbol table.