New

The executive guide to generative AI

Read more
IMPORTANT: This documentation is no longer updated. Refer to Elastic's version policy and the latest documentation.

Normalization Token Filter

edit

There are several token filters available which try to normalize special characters of a certain language.

Arabic

arabic_normalization

German

german_normalization

Hindi

hindi_normalization

Indic

indic_normalization

Kurdish (Sorani)

sorani_normalization

Persian

persian_normalization

Scandinavian

scandinavian_normalization, scandinavian_folding

Serbian

not-released-yet[serbian_normalization],

Was this helpful?
Feedback