Class CategorizationAnalyzerConfig

  • All Implemented Interfaces:
    ToXContent, ToXContentFragment

    public class CategorizationAnalyzerConfig
    extends java.lang.Object
    implements ToXContentFragment
    Configuration for the categorization analyzer. The syntax is a subset of what can be supplied to the _analyzeendpoint. To summarize, the first option is to specify the name of an out-of-the-box analyzer: "categorization_analyzer" : "standard" The second option is to specify a custom analyzer by combining the char_filters, tokenizer and token_filters fields. In turn, each of these can be specified as the name of an out-of-the-box one or as an object defining a custom one. For example: "char_filters" : [ "html_strip", { "type" : "pattern_replace", "pattern": "SQL: .*" } ], "tokenizer" : "thai", "token_filters" : [ "lowercase", { "type" : "pattern_replace", "pattern": "^[0-9].*" } ]