Interface Segmenter

All Known Implementing Classes:
SegmenterImpl

public interface Segmenter
A segmenter splits a string into separate segments (such as words) without applying any further processing (such as stemming) on each segment. This is useful when token processing should be done separately from segmentation, such as in linguistic processing of queries, where token processing depends on field settings in a specific schema, while segmentation only depends on language and happens before schema-specific processing.
Author:
Mathias Mølster Lidal
  • Method Summary

    Modifier and Type
    Method
    Description
    segment(String input, Language language)
    Returns a list of segments produced from a string.
  • Method Details

    • segment

      List<String> segment(String input, Language language)
      Returns a list of segments produced from a string.
      Parameters:
      input - the text to segment
      language - the language of the input text
      Returns:
      the resulting list of segments
      Throws:
      ProcessingException - if an exception is encountered during processing