Analysis plugins
editAnalysis plugins
editAnalysis plugins extend Elasticsearch by adding new analyzers, tokenizers, token filters, or character filters to Elasticsearch.
Core analysis plugins
editThe core analysis plugins are:
- ICU
- Adds extended Unicode support using the ICU libraries, including better analysis of Asian languages, Unicode normalization, Unicode-aware case folding, collation support, and transliteration.
- Kuromoji
- Advanced analysis of Japanese using the Kuromoji analyzer.
- Nori
- Morphological analysis of Korean using the Lucene Nori analyzer.
- Phonetic
- Analyzes tokens into their phonetic equivalent using Soundex, Metaphone, Caverphone, and other codecs.
- SmartCN
- An analyzer for Chinese or mixed Chinese-English text. This analyzer uses probabilistic knowledge to find the optimal word segmentation for Simplified Chinese text. The text is first broken into sentences, then each sentence is segmented into words.
- Stempel
- Provides high quality stemming for Polish.
- Ukrainian
- Provides stemming for Ukrainian.
Community contributed analysis plugins
editA number of analysis plugins have been contributed by our community:
- IK Analysis Plugin (by Medcl)
- Pinyin Analysis Plugin (by Medcl)
- Vietnamese Analysis Plugin (by Duy Do)
- STConvert Analysis Plugin (by Medcl)