Class CustomWordEmbedding

java.lang.Object
org.elasticsearch.client.ml.inference.preprocessing.CustomWordEmbedding
All Implemented Interfaces:
NamedXContentObject, PreProcessor, org.elasticsearch.common.xcontent.ToXContent, org.elasticsearch.common.xcontent.ToXContentObject

public class CustomWordEmbedding
extends java.lang.Object
implements PreProcessor
This is a pre-processor that embeds text into a numerical vector. It calculates a set of features based on script type, ngram hashes, and most common script values. The features are then concatenated with specific quantization scales and weights into a vector of length 80. This is a fork and a port of: https://github.com/google/cld3/blob/06f695f1c8ee530104416aab5dcf2d6a1414a56a/src/embedding_network.cc
  • Nested Class Summary

    Nested classes/interfaces inherited from interface org.elasticsearch.common.xcontent.ToXContent

    org.elasticsearch.common.xcontent.ToXContent.DelegatingMapParams, org.elasticsearch.common.xcontent.ToXContent.MapParams, org.elasticsearch.common.xcontent.ToXContent.Params
  • Field Summary

    Fields
    Modifier and Type Field Description
    static java.lang.String NAME  
    static org.elasticsearch.common.xcontent.ConstructingObjectParser<CustomWordEmbedding,​java.lang.Void> PARSER  

    Fields inherited from interface org.elasticsearch.common.xcontent.ToXContent

    EMPTY_PARAMS
  • Method Summary

    Modifier and Type Method Description
    boolean equals​(java.lang.Object o)  
    static CustomWordEmbedding fromXContent​(org.elasticsearch.common.xcontent.XContentParser parser)  
    java.lang.String getName()  
    int hashCode()  
    org.elasticsearch.common.xcontent.XContentBuilder toXContent​(org.elasticsearch.common.xcontent.XContentBuilder builder, org.elasticsearch.common.xcontent.ToXContent.Params params)  

    Methods inherited from class java.lang.Object

    clone, finalize, getClass, notify, notifyAll, toString, wait, wait, wait

    Methods inherited from interface org.elasticsearch.common.xcontent.ToXContentObject

    isFragment
  • Field Details

    • NAME

      public static final java.lang.String NAME
      See Also:
      Constant Field Values
    • PARSER

      public static final org.elasticsearch.common.xcontent.ConstructingObjectParser<CustomWordEmbedding,​java.lang.Void> PARSER
  • Method Details

    • fromXContent

      public static CustomWordEmbedding fromXContent​(org.elasticsearch.common.xcontent.XContentParser parser)
    • getName

      public java.lang.String getName()
      Specified by:
      getName in interface NamedXContentObject
      Specified by:
      getName in interface PreProcessor
      Returns:
      The name of the pre-processor
    • toXContent

      public org.elasticsearch.common.xcontent.XContentBuilder toXContent​(org.elasticsearch.common.xcontent.XContentBuilder builder, org.elasticsearch.common.xcontent.ToXContent.Params params) throws java.io.IOException
      Specified by:
      toXContent in interface org.elasticsearch.common.xcontent.ToXContent
      Throws:
      java.io.IOException
    • equals

      public boolean equals​(java.lang.Object o)
      Overrides:
      equals in class java.lang.Object
    • hashCode

      public int hashCode()
      Overrides:
      hashCode in class java.lang.Object