Class SentenceTokenizer
- java.lang.Object
-
- org.apache.lucene.util.AttributeSource
-
- org.apache.lucene.analysis.TokenStream
-
- org.apache.lucene.analysis.Tokenizer
-
- org.apache.lucene.analysis.cn.smart.SentenceTokenizer
-
- All Implemented Interfaces:
Closeable,AutoCloseable
public final class SentenceTokenizer extends org.apache.lucene.analysis.TokenizerTokenizes input text into sentences.The output tokens can then be broken into words with
WordTokenFilter- WARNING: This API is experimental and might change in incompatible ways in the next release.
-
-
Constructor Summary
Constructors Constructor Description SentenceTokenizer(Reader reader)SentenceTokenizer(org.apache.lucene.util.AttributeSource.AttributeFactory factory, Reader reader)SentenceTokenizer(org.apache.lucene.util.AttributeSource source, Reader reader)
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description voidend()booleanincrementToken()voidreset()voidreset(Reader input)-
Methods inherited from class org.apache.lucene.util.AttributeSource
addAttribute, addAttributeImpl, captureState, clearAttributes, cloneAttributes, copyTo, equals, getAttribute, getAttributeClassesIterator, getAttributeFactory, getAttributeImplsIterator, hasAttribute, hasAttributes, hashCode, reflectAsString, reflectWith, restoreState, toString
-
-
-
-
Constructor Detail
-
SentenceTokenizer
public SentenceTokenizer(Reader reader)
-
SentenceTokenizer
public SentenceTokenizer(org.apache.lucene.util.AttributeSource source, Reader reader)
-
SentenceTokenizer
public SentenceTokenizer(org.apache.lucene.util.AttributeSource.AttributeFactory factory, Reader reader)
-
-
Method Detail
-
incrementToken
public boolean incrementToken() throws IOException- Specified by:
incrementTokenin classorg.apache.lucene.analysis.TokenStream- Throws:
IOException
-
reset
public void reset() throws IOException- Overrides:
resetin classorg.apache.lucene.analysis.TokenStream- Throws:
IOException
-
reset
public void reset(Reader input) throws IOException
- Overrides:
resetin classorg.apache.lucene.analysis.Tokenizer- Throws:
IOException
-
end
public void end() throws IOException- Overrides:
endin classorg.apache.lucene.analysis.TokenStream- Throws:
IOException
-
-