Class CategorizationAnalyzerConfig

java.lang.Object
org.elasticsearch.client.ml.job.config.CategorizationAnalyzerConfig
All Implemented Interfaces:
org.elasticsearch.common.xcontent.ToXContent, org.elasticsearch.common.xcontent.ToXContentFragment

public class CategorizationAnalyzerConfig
extends java.lang.Object
implements org.elasticsearch.common.xcontent.ToXContentFragment
Configuration for the categorization analyzer. The syntax is a subset of what can be supplied to the _analyzeendpoint. To summarize, the first option is to specify the name of an out-of-the-box analyzer: "categorization_analyzer" : "standard" The second option is to specify a custom analyzer by combining the char_filters, tokenizer and token_filters fields. In turn, each of these can be specified as the name of an out-of-the-box one or as an object defining a custom one. For example: "char_filters" : [ "html_strip", { "type" : "pattern_replace", "pattern": "SQL: .*" } ], "tokenizer" : "thai", "token_filters" : [ "lowercase", { "type" : "pattern_replace", "pattern": "^[0-9].*" } ]