WARNING: Version 1.3 of Elasticsearch has passed its EOL date.
This documentation is no longer being maintained and may be removed. If you are running this version, we strongly advise you to upgrade. For the latest information, see the current release documentation.
CJK Width Token Filter
edit
IMPORTANT: This documentation is no longer updated. Refer to Elastic's version policy and the latest documentation.
CJK Width Token Filter
editThe cjk_width
token filter normalizes CJK width differences:
- Folds fullwidth ASCII variants into the equivalent basic Latin
- Folds halfwidth Katakana variants into the equivalent Kana
This token filter can be viewed as a subset of NFKC/NFKD Unicode normalization. See the ICU Analysis Plugin for full normalization support.
Was this helpful?
Thank you for your feedback.