Uses of Class
org.apache.lucene.util.AttributeSource
-
Packages that use AttributeSource Package Description org.apache.lucene.analysis API and code to convert text into indexable/searchable tokens.org.apache.lucene.analysis.standard Standards-based analyzers implemented with JFlex.org.apache.lucene.collation CollationKeyFilter
converts each token into its binaryCollationKey
using the providedCollator
, and then encode theCollationKey
as a String usingIndexableBinaryStringTools
, to allow it to be stored as an index term.org.apache.lucene.index Code to maintain and access indices.org.apache.lucene.util Some utility classes. -
-
Uses of AttributeSource in org.apache.lucene.analysis
Subclasses of AttributeSource in org.apache.lucene.analysis Modifier and Type Class Description class
ASCIIFoldingFilter
This class converts alphabetic, numeric, and symbolic Unicode characters which are not in the first 127 ASCII characters (the "Basic Latin" Unicode block) into their ASCII equivalents, if one exists.class
CachingTokenFilter
This class can be used if the token attributes of a TokenStream are intended to be consumed more than once.class
CharTokenizer
An abstract base class for simple, character-oriented tokenizers.class
FilteringTokenFilter
Abstract base class for TokenFilters that may remove tokens.class
ISOLatin1AccentFilter
Deprecated.If you build a new index, useASCIIFoldingFilter
which covers a superset of Latin 1.class
KeywordMarkerFilter
Marks terms as keywords via theKeywordAttribute
.class
KeywordTokenizer
Emits the entire input as a single token.class
LengthFilter
Removes words that are too long or too short from the stream.class
LetterTokenizer
A LetterTokenizer is a tokenizer that divides text at non-letters.class
LimitTokenCountFilter
This TokenFilter limits the number of tokens while indexing.class
LowerCaseFilter
Normalizes token text to lower case.class
LowerCaseTokenizer
LowerCaseTokenizer performs the function of LetterTokenizer and LowerCaseFilter together.class
NumericTokenStream
Expert: This class provides aTokenStream
for indexing numeric values that can be used byNumericRangeQuery
orNumericRangeFilter
.class
PorterStemFilter
Transforms the token stream as per the Porter stemming algorithm.class
StopFilter
Removes stop words from a token stream.class
TeeSinkTokenFilter
This TokenFilter provides the ability to set aside attribute states that have already been analyzed.static class
TeeSinkTokenFilter.SinkTokenStream
TokenStream output from a tee with optional filtering.class
TokenFilter
A TokenFilter is a TokenStream whose input is another TokenStream.class
Tokenizer
A Tokenizer is a TokenStream whose input is a Reader.class
TokenStream
class
TypeTokenFilter
Removes tokens whose types appear in a set of blocked types from a token stream.class
WhitespaceTokenizer
A WhitespaceTokenizer is a tokenizer that divides text at whitespace.Methods in org.apache.lucene.analysis with parameters of type AttributeSource Modifier and Type Method Description abstract boolean
TeeSinkTokenFilter.SinkFilter. accept(AttributeSource source)
Returns true, iff the current state of the passed-inAttributeSource
shall be stored in the sink.Constructors in org.apache.lucene.analysis with parameters of type AttributeSource Constructor Description CharTokenizer(AttributeSource source, Reader input)
Deprecated.useCharTokenizer(Version, AttributeSource, Reader)
instead.CharTokenizer(Version matchVersion, AttributeSource source, Reader input)
Creates a newCharTokenizer
instanceKeywordTokenizer(AttributeSource source, Reader input, int bufferSize)
LetterTokenizer(AttributeSource source, Reader in)
Deprecated.useLetterTokenizer(Version, AttributeSource, Reader)
instead.LetterTokenizer(Version matchVersion, AttributeSource source, Reader in)
Construct a new LetterTokenizer using a givenAttributeSource
.LowerCaseTokenizer(AttributeSource source, Reader in)
Deprecated.useLowerCaseTokenizer(Version, AttributeSource, Reader)
instead.LowerCaseTokenizer(Version matchVersion, AttributeSource source, Reader in)
Construct a new LowerCaseTokenizer using a givenAttributeSource
.NumericTokenStream(AttributeSource source, int precisionStep)
Expert: Creates a token stream for numeric values with the specifiedprecisionStep
using the givenAttributeSource
.Tokenizer(AttributeSource source)
Deprecated.useTokenizer(AttributeSource, Reader)
instead.Tokenizer(AttributeSource source, Reader input)
Construct a token stream processing the given input using the given AttributeSource.TokenStream(AttributeSource input)
A TokenStream that uses the same attributes as the supplied one.WhitespaceTokenizer(AttributeSource source, Reader in)
Deprecated.WhitespaceTokenizer(Version matchVersion, AttributeSource source, Reader in)
Construct a new WhitespaceTokenizer using a givenAttributeSource
. -
Uses of AttributeSource in org.apache.lucene.analysis.standard
Subclasses of AttributeSource in org.apache.lucene.analysis.standard Modifier and Type Class Description class
ClassicFilter
Normalizes tokens extracted withClassicTokenizer
.class
ClassicTokenizer
A grammar-based tokenizer constructed with JFlexclass
StandardFilter
Normalizes tokens extracted withStandardTokenizer
.class
StandardTokenizer
A grammar-based tokenizer constructed with JFlex.class
UAX29URLEmailTokenizer
This class implements Word Break rules from the Unicode Text Segmentation algorithm, as specified in Unicode Standard Annex #29 URLs and email addresses are also tokenized according to the relevant RFCs.Constructors in org.apache.lucene.analysis.standard with parameters of type AttributeSource Constructor Description ClassicTokenizer(Version matchVersion, AttributeSource source, Reader input)
Creates a new ClassicTokenizer with a givenAttributeSource
.StandardTokenizer(Version matchVersion, AttributeSource source, Reader input)
Creates a new StandardTokenizer with a givenAttributeSource
.UAX29URLEmailTokenizer(AttributeSource source, Reader input)
Deprecated.UAX29URLEmailTokenizer(Version matchVersion, AttributeSource source, Reader input)
Creates a new UAX29URLEmailTokenizer with a givenAttributeSource
. -
Uses of AttributeSource in org.apache.lucene.collation
Subclasses of AttributeSource in org.apache.lucene.collation Modifier and Type Class Description class
CollationKeyFilter
Converts each token into itsCollationKey
, and then encodes the CollationKey withIndexableBinaryStringTools
, to allow it to be stored as an index term. -
Uses of AttributeSource in org.apache.lucene.index
Methods in org.apache.lucene.index that return AttributeSource Modifier and Type Method Description AttributeSource
FieldInvertState. getAttributeSource()
-
Uses of AttributeSource in org.apache.lucene.util
Methods in org.apache.lucene.util that return AttributeSource Modifier and Type Method Description AttributeSource
AttributeSource. cloneAttributes()
Performs a clone of allAttributeImpl
instances returned in a newAttributeSource
instance.Methods in org.apache.lucene.util with parameters of type AttributeSource Modifier and Type Method Description void
AttributeSource. copyTo(AttributeSource target)
Copies the contents of thisAttributeSource
to the given targetAttributeSource
.Constructors in org.apache.lucene.util with parameters of type AttributeSource Constructor Description AttributeSource(AttributeSource input)
An AttributeSource that uses the same attributes as the supplied one.
-