Package org.apache.lucene.analysis.ru
Class RussianLetterTokenizer
- java.lang.Object
-
- org.apache.lucene.util.AttributeSource
-
- org.apache.lucene.analysis.TokenStream
-
- org.apache.lucene.analysis.Tokenizer
-
- org.apache.lucene.analysis.CharTokenizer
-
- org.apache.lucene.analysis.ru.RussianLetterTokenizer
-
- All Implemented Interfaces:
Closeable
,AutoCloseable
@Deprecated public class RussianLetterTokenizer extends org.apache.lucene.analysis.CharTokenizer
Deprecated.UseStandardTokenizer
instead, which has the same functionality. This filter will be removed in Lucene 5.0A RussianLetterTokenizer is aTokenizer
that extendsLetterTokenizer
by also allowing the basic Latin digits 0-9.You must specify the required
Version
compatibility when creatingRussianLetterTokenizer
:- As of 3.1,
CharTokenizer
uses an int based API to normalize and detect token characters. SeeCharTokenizer.isTokenChar(int)
andCharTokenizer.normalize(int)
for details.
-
-
Constructor Summary
Constructors Constructor Description RussianLetterTokenizer(Reader in)
Deprecated.useRussianLetterTokenizer(Version, Reader)
instead.RussianLetterTokenizer(org.apache.lucene.util.AttributeSource.AttributeFactory factory, Reader in)
Deprecated.RussianLetterTokenizer(org.apache.lucene.util.AttributeSource source, Reader in)
Deprecated.RussianLetterTokenizer(org.apache.lucene.util.Version matchVersion, Reader in)
Deprecated.Construct a new RussianLetterTokenizer.RussianLetterTokenizer(org.apache.lucene.util.Version matchVersion, org.apache.lucene.util.AttributeSource.AttributeFactory factory, Reader in)
Deprecated.Construct a new RussianLetterTokenizer using a givenAttributeSource.AttributeFactory
.RussianLetterTokenizer(org.apache.lucene.util.Version matchVersion, org.apache.lucene.util.AttributeSource source, Reader in)
Deprecated.Construct a new RussianLetterTokenizer using a givenAttributeSource
.
-
Method Summary
All Methods Instance Methods Concrete Methods Deprecated Methods Modifier and Type Method Description protected boolean
isTokenChar(int c)
Deprecated.Collects only characters which satisfyCharacter.isLetter(int)
.-
Methods inherited from class org.apache.lucene.analysis.CharTokenizer
end, incrementToken, isTokenChar, normalize, normalize, reset
-
Methods inherited from class org.apache.lucene.util.AttributeSource
addAttribute, addAttributeImpl, captureState, clearAttributes, cloneAttributes, copyTo, equals, getAttribute, getAttributeClassesIterator, getAttributeFactory, getAttributeImplsIterator, hasAttribute, hasAttributes, hashCode, reflectAsString, reflectWith, restoreState, toString
-
-
-
-
Constructor Detail
-
RussianLetterTokenizer
public RussianLetterTokenizer(org.apache.lucene.util.Version matchVersion, Reader in)
Deprecated.Construct a new RussianLetterTokenizer. * @param matchVersion Lucene version to match See {@link above}- Parameters:
in
- the input to split up into tokens
-
RussianLetterTokenizer
public RussianLetterTokenizer(org.apache.lucene.util.Version matchVersion, org.apache.lucene.util.AttributeSource source, Reader in)
Deprecated.Construct a new RussianLetterTokenizer using a givenAttributeSource
.- Parameters:
matchVersion
- Lucene version to match See {@link above}source
- the attribute source to use for thisTokenizer
in
- the input to split up into tokens
-
RussianLetterTokenizer
public RussianLetterTokenizer(org.apache.lucene.util.Version matchVersion, org.apache.lucene.util.AttributeSource.AttributeFactory factory, Reader in)
Deprecated.Construct a new RussianLetterTokenizer using a givenAttributeSource.AttributeFactory
. * @param matchVersion Lucene version to match See {@link above}- Parameters:
factory
- the attribute factory to use for thisTokenizer
in
- the input to split up into tokens
-
RussianLetterTokenizer
@Deprecated public RussianLetterTokenizer(Reader in)
Deprecated.useRussianLetterTokenizer(Version, Reader)
instead. This will be removed in Lucene 4.0.Construct a new RussianLetterTokenizer.
-
RussianLetterTokenizer
@Deprecated public RussianLetterTokenizer(org.apache.lucene.util.AttributeSource source, Reader in)
Deprecated.useRussianLetterTokenizer(Version, AttributeSource, Reader)
instead. This will be removed in Lucene 4.0.Construct a new RussianLetterTokenizer using a givenAttributeSource
.
-
RussianLetterTokenizer
@Deprecated public RussianLetterTokenizer(org.apache.lucene.util.AttributeSource.AttributeFactory factory, Reader in)
Deprecated.useRussianLetterTokenizer(Version, AttributeSource.AttributeFactory, Reader)
instead. This will be removed in Lucene 4.0.Construct a new RussianLetterTokenizer using a givenAttributeSource.AttributeFactory
.
-
-
Method Detail
-
isTokenChar
protected boolean isTokenChar(int c)
Deprecated.Collects only characters which satisfyCharacter.isLetter(int)
.- Overrides:
isTokenChar
in classorg.apache.lucene.analysis.CharTokenizer
-
-