IMPORTANT: No additional bug fixes or documentation updates will be released for this version. For the latest information, see the current release documentation.

« Token filter reference ASCII folding token filter »

› › ›

Apostrophe token filter

edit

IMPORTANT: This documentation is no longer updated. Refer to Elastic's version policy and the latest documentation.

Apostrophe token filter

edit

Strips all characters after an apostrophe, including the apostrophe itself.

This filter is included in Elasticsearch’s built-in Turkish language analyzer. It uses Lucene’s ApostropheFilter, which was built for the Turkish language.

Example

edit

The following analyze API request demonstrates how the apostrophe token filter works.

response = client.indices.analyze(
  body: {
    tokenizer: 'standard',
    filter: [
      'apostrophe'
    ],
    text: "Istanbul'a veya Istanbul'dan"
  }
)
puts response

GET /_analyze
{
  "tokenizer" : "standard",
  "filter" : ["apostrophe"],
  "text" : "Istanbul'a veya Istanbul'dan"
}

The filter produces the following tokens:

[ Istanbul, veya, Istanbul ]

Add to an analyzer

edit

The following create index API request uses the apostrophe token filter to configure a new custom analyzer.

response = client.indices.create(
  index: 'apostrophe_example',
  body: {
    settings: {
      analysis: {
        analyzer: {
          standard_apostrophe: {
            tokenizer: 'standard',
            filter: [
              'apostrophe'
            ]
          }
        }
      }
    }
  }
)
puts response

PUT /apostrophe_example
{
  "settings": {
    "analysis": {
      "analyzer": {
        "standard_apostrophe": {
          "tokenizer": "standard",
          "filter": [ "apostrophe" ]
        }
      }
    }
  }
}

« Token filter reference ASCII folding token filter »