User:Isaac914/uim/Setting up Japanese Input in UIM
This page discusses how to set up Japanese input using the UIM input framework.
You need the following packages to input Japanese.
- Japanese fonts
- Japanese input method (Kana to Kanji conversion engine)
- Input method framework: uim
Recommended Japanese fonts are as follows.
- adobe-source-han-sans || or
- An opens source OTF fonts developed by Adobe.
Serif and Sans-serif
- IPA fonts ||
- An open source OTF font set including sans-serif (Gothic) and serif (Mincho) glyphs provided by Information-technology Promotion Agency, Japan (IPA).
If you want to show 2channel Shift JIS art properly, use one of the following fonts:
- ipamona font (AUR: AUR)
- Monapo font (AUR: AUR)
Anthy is one of the most popular Japanese input methods in the open source world. However, it has not been maintained for a long time. Debian succeeds it from May 2010.
Installfrom the official repositories.
Anthy's default dictionary does not include several characters which are not specified on EUC-JP (JIS X 0208) such as "①", "♥", etc. alt-cannadic provides extra dictionaries including those characters.
Get alt-cannadic dictionary and put them under your
$ tar jxvf alt-cannadic-091230.tar.bz2 $ mkdir ~/.anthy/imported_words_default.d (if not exist) $ cp alt-cannadic-091230/extra/*.t ~/.anthy/imported_words_default.d/
Please see official wiki for more detail (Japanese).
Modified Anthy (anthy-ut)
Modified Anthy is a set of patches and huge extended dictionaries which aims to improve the Kana to Kanji conversion quality of original Anthy.
Modified Anthy consists two different upstreams:
- Modified Anthy applies to only Anthy (UTF-8). So you have to choose Anthy (UTF-8) for default input method on uim.
- Modified Anthy does not have compatibility of the dictionaries and learning data with original Anthy.
Compiling modified Anthy using PKGBUILD
Modified Anthy is available on AUR namedAUR.
Get anthy-ut tarball and makepkg to make and install package:
$ wget https://aur.archlinux.org/packages/anthy-ut/anthy-ut.tar.gz $ tar xvf anthy-ut.tar.gz $ cd anthy-ut $ makepkg -s -i
If you already use original Anthy, you have to convert the existing learning data format.
$ rm ~/.anthy/last-record1_*.bin $ anthy-agent --update-base-record $ rm ~/.anthy/last-record1_*.bin $ anthy-agent --update-base-record
(Though this step repeats the same commands twice, it is not mistypes.)
Anthy Kaomoji is a modified version of Anthy that converts Hiragana text to Kana Kanji mixed text and has emoticon (顔文字) and 2ch dictionaries. It can be found in the AUR ( AUR).
Though macuim provides uim-mozc plugin.AUR adapts to only ibus input method framework,
- Mozc (Vanilla)
AUR is available on AUR.
You can install this from unofficial user repository. Add the following into your /etc/pacman.conf:
[pnsft-pur] SigLevel = Optional TrustAll Server = http://downloads.sourceforge.net/project/pnsft-aur/pur/$arch
And refresh package database:
# pacman -Syy
You can choose install packages specifying group name as follows:
# pacman -S mozc-im
Or, specify package names directly. For example:
# pacman -S uim-mozc
- mozc-ut and mozc-svn
AUR and AUR can be built uim-mozc.
To build uim-mozc, edit PKGBUILD like follow, i,e. uncomment
## If you will not be using ibus, comment out below. _ibus_mozc="yes" ## If you will be using uim, uncomment below. _uim_mozc="yes" ## If applying patch for uim-mozc fails, try to uncomment below. #_kill_kill_line="yes" ## This will disable the 'kill-line' function of uim-mozc.
- Registering Mozc
# uim-module-manager --register mozc
Google CGI API for Japanese input
Google CGI API for Japanese Input (Google-CGIAPI-Jp) is CGI service to provide Japanese conversion on the Internet by Google. It can be used on web browser. Its conversion engine seems to be equivalent to Google Japanese Input, so conversion quality is probably better than Mozc.
You can use it via uim. Choose "Google-CGIAPI-Jp" on uim-im-switcher-gtk/gtk3/qt4 or uim-pref-gtk/gtk3/qt4.