Closeable, AutoCloseable@Deprecated public class RussianLetterTokenizer extends CharTokenizer
StandardTokenizer instead, which has the same functionality.
This filter will be removed in Lucene 5.0Tokenizer that extends LetterTokenizer
by also allowing the basic Latin digits 0-9.
You must specify the required Version compatibility when creating
RussianLetterTokenizer:
CharTokenizer uses an int based API to normalize and
detect token characters. See CharTokenizer.isTokenChar(int) and
CharTokenizer.normalize(int) for details.AttributeSource.AttributeFactory, AttributeSource.State| Constructor | Description |
|---|---|
RussianLetterTokenizer(Reader in) |
Deprecated.
use
RussianLetterTokenizer(Version, Reader) instead. |
RussianLetterTokenizer(AttributeSource.AttributeFactory factory,
Reader in) |
Deprecated.
|
RussianLetterTokenizer(AttributeSource source,
Reader in) |
Deprecated.
|
RussianLetterTokenizer(Version matchVersion,
Reader in) |
Deprecated.
Construct a new RussianLetterTokenizer.
|
RussianLetterTokenizer(Version matchVersion,
AttributeSource.AttributeFactory factory,
Reader in) |
Deprecated.
Construct a new RussianLetterTokenizer using a given
AttributeSource.AttributeFactory. |
RussianLetterTokenizer(Version matchVersion,
AttributeSource source,
Reader in) |
Deprecated.
Construct a new RussianLetterTokenizer using a given
AttributeSource. |
| Modifier and Type | Method | Description |
|---|---|---|
protected boolean |
isTokenChar(int c) |
Deprecated.
Collects only characters which satisfy
Character.isLetter(int). |
addAttribute, addAttributeImpl, captureState, clearAttributes, cloneAttributes, copyTo, equals, getAttribute, getAttributeClassesIterator, getAttributeFactory, getAttributeImplsIterator, hasAttribute, hasAttributes, hashCode, reflectAsString, reflectWith, restoreState, toStringend, incrementToken, isTokenChar, normalize, normalize, resetclone, finalize, getClass, notify, notifyAll, wait, wait, waitclose, correctOffsetresetpublic RussianLetterTokenizer(Version matchVersion, Reader in)
in - the input to split up into tokenspublic RussianLetterTokenizer(Version matchVersion, AttributeSource source, Reader in)
AttributeSource.public RussianLetterTokenizer(Version matchVersion, AttributeSource.AttributeFactory factory, Reader in)
AttributeSource.AttributeFactory. * @param
matchVersion Lucene version to match See
{@link above}factory - the attribute factory to use for this Tokenizerin - the input to split up into tokens@Deprecated public RussianLetterTokenizer(Reader in)
RussianLetterTokenizer(Version, Reader) instead. This will
be removed in Lucene 4.0.@Deprecated public RussianLetterTokenizer(AttributeSource source, Reader in)
RussianLetterTokenizer(Version, AttributeSource, Reader)
instead. This will be removed in Lucene 4.0.AttributeSource.@Deprecated public RussianLetterTokenizer(AttributeSource.AttributeFactory factory, Reader in)
RussianLetterTokenizer(Version, AttributeSource.AttributeFactory, Reader)
instead. This will be removed in Lucene 4.0.AttributeSource.AttributeFactory.protected boolean isTokenChar(int c)
Character.isLetter(int).isTokenChar in class CharTokenizerCopyright © 2000-2018 Apache Software Foundation. All Rights Reserved.