Class PathHierarchyTokenizer

  • All Implemented Interfaces:
    Closeable, AutoCloseable

    public class PathHierarchyTokenizer
    extends org.apache.lucene.analysis.Tokenizer
    Tokenizer for path-like hierarchies.

    Take something like:

      /something/something/else
     
    and make:
      /something
      /something/something
      /something/something/else
     
    • Nested Class Summary

      • Nested classes/interfaces inherited from class org.apache.lucene.util.AttributeSource

        org.apache.lucene.util.AttributeSource.AttributeFactory, org.apache.lucene.util.AttributeSource.State
    • Field Summary

      Fields 
      Modifier and Type Field Description
      static char DEFAULT_DELIMITER  
      static int DEFAULT_SKIP  
      • Fields inherited from class org.apache.lucene.analysis.Tokenizer

        input
    • Method Summary

      All Methods Instance Methods Concrete Methods 
      Modifier and Type Method Description
      void end()  
      boolean incrementToken()  
      void reset()  
      • Methods inherited from class org.apache.lucene.analysis.Tokenizer

        close, correctOffset, reset
      • Methods inherited from class org.apache.lucene.util.AttributeSource

        addAttribute, addAttributeImpl, captureState, clearAttributes, cloneAttributes, copyTo, equals, getAttribute, getAttributeClassesIterator, getAttributeFactory, getAttributeImplsIterator, hasAttribute, hasAttributes, hashCode, reflectAsString, reflectWith, restoreState, toString
    • Constructor Detail

      • PathHierarchyTokenizer

        public PathHierarchyTokenizer​(Reader input)
      • PathHierarchyTokenizer

        public PathHierarchyTokenizer​(Reader input,
                                      int skip)
      • PathHierarchyTokenizer

        public PathHierarchyTokenizer​(Reader input,
                                      int bufferSize,
                                      char delimiter)
      • PathHierarchyTokenizer

        public PathHierarchyTokenizer​(Reader input,
                                      char delimiter,
                                      char replacement)
      • PathHierarchyTokenizer

        public PathHierarchyTokenizer​(Reader input,
                                      char delimiter,
                                      char replacement,
                                      int skip)
      • PathHierarchyTokenizer

        public PathHierarchyTokenizer​(Reader input,
                                      int bufferSize,
                                      char delimiter,
                                      char replacement,
                                      int skip)
    • Method Detail

      • incrementToken

        public final boolean incrementToken()
                                     throws IOException
        Specified by:
        incrementToken in class org.apache.lucene.analysis.TokenStream
        Throws:
        IOException
      • end

        public final void end()
        Overrides:
        end in class org.apache.lucene.analysis.TokenStream
      • reset

        public void reset()
                   throws IOException
        Overrides:
        reset in class org.apache.lucene.analysis.TokenStream
        Throws:
        IOException