Package com.yahoo.language.process
@ExportPackage
@PublicApi
package com.yahoo.language.process
-
ClassDescriptionDetermines the class of a given character.A chunker converts splits a text string into multiple smaller strings (chunks).An embedder converts a text string to a tensorRuntime that is injectable through
Embedderconstructor.Generates field values given an input text.A class which splits consecutive word character sequences into overlapping character n-grams.An immutable start index and length pairInvocationContext<SUBCLASS extends InvocationContext<SUBCLASS>>Context of an invocation of a component carrying out a processing task.Parameters to a linguistics operation.This interface provides NFKC normalization of Strings through the underlying linguistics library.Exception class indicating that a fatal error occured during linguistic processing.A segmenter splits a string into separate segments (such as words) without applying any further processing (such as stemming) on each segment.Immutable named lists of "special tokens" - strings which should override the normal tokenizer semantics and be tokenized into a single token.An immutable list of special tokens - strings which should override the normal tokenizer semantics and be tokenized into a single token.An immutable special tokenA list of strings which does not allow for duplicate elements.Interface providing stemming of single words.An enum of the stemming modes which can be requested.A single token produced by the tokenizer.Language-sensitive tokenization of a text string.List of token scripts (e.g. latin, japanese, chinese, etc.) which may warrant different linguistics treatment.An enumeration of token types.Interface for providers of text transformations such as accent removal.