Class FilterIndexReader
- java.lang.Object
-
- org.apache.lucene.index.IndexReader
-
- org.apache.lucene.index.FilterIndexReader
-
- All Implemented Interfaces:
Closeable,AutoCloseable,Cloneable
- Direct Known Subclasses:
PruningReader
public class FilterIndexReader extends IndexReader
AFilterIndexReadercontains another IndexReader, which it uses as its basic source of data, possibly transforming the data along the way or providing additional functionality. The classFilterIndexReaderitself simply implements all abstract methods ofIndexReaderwith versions that pass all requests to the contained index reader. Subclasses ofFilterIndexReadermay further override some of these methods and may also provide additional methods and fields.Note: The default implementation of
IndexReader.doOpenIfChanged()throwsUnsupportedOperationException(like the base class), so it's not possible to reopen aFilterIndexReader. To reopen, you have to first reopen the underlying reader and wrap it again with the custom filter.
-
-
Nested Class Summary
Nested Classes Modifier and Type Class Description static classFilterIndexReader.FilterTermDocsBase class for filteringTermDocsimplementations.static classFilterIndexReader.FilterTermEnumBase class for filteringTermEnumimplementations.static classFilterIndexReader.FilterTermPositionsBase class for filteringTermPositionsimplementations.-
Nested classes/interfaces inherited from class org.apache.lucene.index.IndexReader
IndexReader.ReaderClosedListener
-
-
Field Summary
Fields Modifier and Type Field Description protected IndexReaderin-
Fields inherited from class org.apache.lucene.index.IndexReader
hasChanges
-
-
Constructor Summary
Constructors Constructor Description FilterIndexReader(IndexReader in)Construct a FilterIndexReader based on the specified base reader.
-
Method Summary
All Methods Instance Methods Concrete Methods Deprecated Methods Modifier and Type Method Description Directorydirectory()Returns the directory associated with this index.intdocFreq(Term t)Returns the number of documents containing the termt.protected voiddoClose()Implements close.protected voiddoCommit(Map<String,String> commitUserData)Deprecated.Documentdocument(int n, FieldSelector fieldSelector)Get theDocumentat thenth position.protected voiddoDelete(int n)Deprecated.protected voiddoSetNorm(int d, String f, byte b)Deprecated.protected voiddoUndeleteAll()Deprecated.Map<String,String>getCommitUserData()Retrieve the String userData optionally passed to IndexWriter#commit.ObjectgetCoreCacheKey()If the subclass of FilteredIndexReader modifies the contents of the FieldCache, you must override this method to provide a different keyObjectgetDeletesCacheKey()If the subclass of FilteredIndexReader modifies the deleted docs, you must override this method to provide a different keyFieldInfosgetFieldInfos()Get theFieldInfosdescribing all fields in this reader.IndexCommitgetIndexCommit()Expert: return the IndexCommit that this reader has opened.IndexReader[]getSequentialSubReaders()Expert: returns the sequential sub readers that this reader is logically composed of.TermFreqVectorgetTermFreqVector(int docNumber, String field)Return a term frequency vector for the specified document and field.voidgetTermFreqVector(int docNumber, String field, TermVectorMapper mapper)Load the Term Vector into a user-defined data structure instead of relying on the parallel arrays of theTermFreqVector.voidgetTermFreqVector(int docNumber, TermVectorMapper mapper)Map all the term vectors for all fields in a DocumentTermFreqVector[]getTermFreqVectors(int docNumber)Return an array of term frequency vectors for the specified document.intgetTermInfosIndexDivisor()For IndexReader implementations that use TermInfosReader to read terms, this returns the current indexDivisor as specified when the reader was opened.longgetUniqueTermCount()Returns the number of unique terms (across all fields) in this reader.longgetVersion()Version number when this IndexReader was opened.booleanhasDeletions()Returns true if any documents have been deletedbooleanhasNorms(String field)Returns true if there are norms stored for this field.booleanisCurrent()Check whether any new changes have occurred to the index since this reader was opened.booleanisDeleted(int n)Returns true if document n has been deletedbooleanisOptimized()Deprecated.intmaxDoc()Returns one greater than the largest possible document number.byte[]norms(String f)Returns the byte-encoded normalization factor for the named field of every document.voidnorms(String f, byte[] bytes, int offset)Reads the byte-encoded normalization factor for the named field of every document.intnumDocs()Returns the number of documents in this index.TermDocstermDocs()Returns an unpositionedTermDocsenumerator.TermDocstermDocs(Term term)Returns an enumeration of all the documents which containterm.TermPositionstermPositions()Returns an unpositionedTermPositionsenumerator.TermEnumterms()Returns an enumeration of all the terms in the index.TermEnumterms(Term t)Returns an enumeration of all terms starting at a given term.StringtoString()-
Methods inherited from class org.apache.lucene.index.IndexReader
acquireWriteLock, addReaderClosedListener, clone, clone, close, commit, commit, decRef, deleteDocument, deleteDocuments, document, doOpenIfChanged, doOpenIfChanged, doOpenIfChanged, doOpenIfChanged, ensureOpen, flush, flush, getCommitUserData, getCurrentVersion, getRefCount, incRef, indexExists, lastModified, listCommits, numDeletedDocs, open, open, open, open, open, open, open, open, open, open, open, openIfChanged, openIfChanged, openIfChanged, openIfChanged, removeReaderClosedListener, reopen, reopen, reopen, reopen, setNorm, setNorm, termPositions, tryIncRef, undeleteAll
-
-
-
-
Field Detail
-
in
protected IndexReader in
-
-
Constructor Detail
-
FilterIndexReader
public FilterIndexReader(IndexReader in)
Construct a FilterIndexReader based on the specified base reader. Directory locking for delete, undeleteAll, and setNorm operations is left to the base reader.
Note that base reader is closed if this FilterIndexReader is closed.
- Parameters:
in- specified base reader.
-
-
Method Detail
-
directory
public Directory directory()
Description copied from class:IndexReaderReturns the directory associated with this index. The Default implementation returns the directory specified by subclasses when delegating to the IndexReader(Directory) constructor, or throws an UnsupportedOperationException if one was not specified.- Overrides:
directoryin classIndexReader
-
getIndexCommit
public IndexCommit getIndexCommit() throws IOException
Description copied from class:IndexReaderExpert: return the IndexCommit that this reader has opened. This method is only implemented by those readers that correspond to a Directory with its own segments_N file.- Overrides:
getIndexCommitin classIndexReader- Throws:
IOException
-
getFieldInfos
public FieldInfos getFieldInfos()
Description copied from class:IndexReaderGet theFieldInfosdescribing all fields in this reader. NOTE: do not make any changes to the returned FieldInfos!- Specified by:
getFieldInfosin classIndexReader
-
getTermInfosIndexDivisor
public int getTermInfosIndexDivisor()
Description copied from class:IndexReaderFor IndexReader implementations that use TermInfosReader to read terms, this returns the current indexDivisor as specified when the reader was opened.- Overrides:
getTermInfosIndexDivisorin classIndexReader
-
getTermFreqVectors
public TermFreqVector[] getTermFreqVectors(int docNumber) throws IOException
Description copied from class:IndexReaderReturn an array of term frequency vectors for the specified document. The array contains a vector for each vectorized field in the document. Each vector contains terms and frequencies for all terms in a given vectorized field. If no such fields existed, the method returns null. The term vectors that are returned may either be of typeTermFreqVectoror of typeTermPositionVectorif positions or offsets have been stored.- Specified by:
getTermFreqVectorsin classIndexReader- Parameters:
docNumber- document for which term frequency vectors are returned- Returns:
- array of term frequency vectors. May be null if no term vectors have been stored for the specified document.
- Throws:
IOException- if index cannot be accessed- See Also:
Field.TermVector
-
getTermFreqVector
public TermFreqVector getTermFreqVector(int docNumber, String field) throws IOException
Description copied from class:IndexReaderReturn a term frequency vector for the specified document and field. The returned vector contains terms and frequencies for the terms in the specified field of this document, if the field had the storeTermVector flag set. If termvectors had been stored with positions or offsets, aTermPositionVectoris returned.- Specified by:
getTermFreqVectorin classIndexReader- Parameters:
docNumber- document for which the term frequency vector is returnedfield- field for which the term frequency vector is returned.- Returns:
- term frequency vector May be null if field does not exist in the specified document or term vector was not stored.
- Throws:
IOException- if index cannot be accessed- See Also:
Field.TermVector
-
getTermFreqVector
public void getTermFreqVector(int docNumber, String field, TermVectorMapper mapper) throws IOExceptionDescription copied from class:IndexReaderLoad the Term Vector into a user-defined data structure instead of relying on the parallel arrays of theTermFreqVector.- Specified by:
getTermFreqVectorin classIndexReader- Parameters:
docNumber- The number of the document to load the vector forfield- The name of the field to loadmapper- TheTermVectorMapperto process the vector. Must not be null- Throws:
IOException- if term vectors cannot be accessed or if they do not exist on the field and doc. specified.
-
getTermFreqVector
public void getTermFreqVector(int docNumber, TermVectorMapper mapper) throws IOExceptionDescription copied from class:IndexReaderMap all the term vectors for all fields in a Document- Specified by:
getTermFreqVectorin classIndexReader- Parameters:
docNumber- The number of the document to load the vector formapper- TheTermVectorMapperto process the vector. Must not be null- Throws:
IOException- if term vectors cannot be accessed or if they do not exist on the field and doc. specified.
-
getUniqueTermCount
public long getUniqueTermCount() throws IOExceptionDescription copied from class:IndexReaderReturns the number of unique terms (across all fields) in this reader. This method returns long, even though internally Lucene cannot handle more than 2^31 unique terms, for a possible future when this limitation is removed.- Overrides:
getUniqueTermCountin classIndexReader- Throws:
IOException
-
numDocs
public int numDocs()
Description copied from class:IndexReaderReturns the number of documents in this index.- Specified by:
numDocsin classIndexReader
-
maxDoc
public int maxDoc()
Description copied from class:IndexReaderReturns one greater than the largest possible document number. This may be used to, e.g., determine how big to allocate an array which will have an element for every document number in an index.- Specified by:
maxDocin classIndexReader
-
document
public Document document(int n, FieldSelector fieldSelector) throws CorruptIndexException, IOException
Description copied from class:IndexReaderGet theDocumentat thenth position. TheFieldSelectormay be used to determine whatFields to load and how they should be loaded. NOTE: If this Reader (more specifically, the underlyingFieldsReader) is closed before the lazyFieldis loaded an exception may be thrown. If you want the value of a lazyFieldto be available after closing you must explicitly load it or fetch the Document again with a new loader.NOTE: for performance reasons, this method does not check if the requested document is deleted, and therefore asking for a deleted document may yield unspecified results. Usually this is not required, however you can call
IndexReader.isDeleted(int)with the requested document ID to verify the document is not deleted.- Specified by:
documentin classIndexReader- Parameters:
n- Get the document at thenth positionfieldSelector- TheFieldSelectorto use to determine what Fields should be loaded on the Document. May be null, in which case all Fields will be loaded.- Returns:
- The stored fields of the
Documentat the nth position - Throws:
CorruptIndexException- if the index is corruptIOException- if there is a low-level IO error- See Also:
Fieldable,FieldSelector,SetBasedFieldSelector,LoadFirstFieldSelector
-
isDeleted
public boolean isDeleted(int n)
Description copied from class:IndexReaderReturns true if document n has been deleted- Specified by:
isDeletedin classIndexReader
-
hasDeletions
public boolean hasDeletions()
Description copied from class:IndexReaderReturns true if any documents have been deleted- Specified by:
hasDeletionsin classIndexReader
-
doUndeleteAll
@Deprecated protected void doUndeleteAll() throws CorruptIndexException, IOException
Deprecated.Implements actual undeleteAll() in subclass.- Specified by:
doUndeleteAllin classIndexReader- Throws:
CorruptIndexExceptionIOException
-
hasNorms
public boolean hasNorms(String field) throws IOException
Description copied from class:IndexReaderReturns true if there are norms stored for this field.- Overrides:
hasNormsin classIndexReader- Throws:
IOException
-
norms
public byte[] norms(String f) throws IOException
Description copied from class:IndexReaderReturns the byte-encoded normalization factor for the named field of every document. This is used by the search code to score documents. Returns null if norms were not indexed for this field.- Specified by:
normsin classIndexReader- Throws:
IOException- See Also:
AbstractField.setBoost(float)
-
norms
public void norms(String f, byte[] bytes, int offset) throws IOException
Description copied from class:IndexReaderReads the byte-encoded normalization factor for the named field of every document. This is used by the search code to score documents.- Specified by:
normsin classIndexReader- Throws:
IOException- See Also:
AbstractField.setBoost(float)
-
doSetNorm
@Deprecated protected void doSetNorm(int d, String f, byte b) throws CorruptIndexException, IOException
Deprecated.Implements setNorm in subclass.- Specified by:
doSetNormin classIndexReader- Throws:
CorruptIndexExceptionIOException
-
terms
public TermEnum terms() throws IOException
Description copied from class:IndexReaderReturns an enumeration of all the terms in the index. The enumeration is ordered by Term.compareTo(). Each term is greater than all that precede it in the enumeration. Note that after calling terms(),TermEnum.next()must be called on the resulting enumeration before calling other methods such asTermEnum.term().- Specified by:
termsin classIndexReader- Throws:
IOException- if there is a low-level IO error
-
terms
public TermEnum terms(Term t) throws IOException
Description copied from class:IndexReaderReturns an enumeration of all terms starting at a given term. If the given term does not exist, the enumeration is positioned at the first term greater than the supplied term. The enumeration is ordered by Term.compareTo(). Each term is greater than all that precede it in the enumeration.- Specified by:
termsin classIndexReader- Throws:
IOException- if there is a low-level IO error
-
docFreq
public int docFreq(Term t) throws IOException
Description copied from class:IndexReaderReturns the number of documents containing the termt.- Specified by:
docFreqin classIndexReader- Throws:
IOException- if there is a low-level IO error
-
termDocs
public TermDocs termDocs() throws IOException
Description copied from class:IndexReaderReturns an unpositionedTermDocsenumerator.Note: the TermDocs returned is unpositioned. Before using it, ensure that you first position it with
TermDocs.seek(Term)orTermDocs.seek(TermEnum).- Specified by:
termDocsin classIndexReader- Throws:
IOException- if there is a low-level IO error
-
termDocs
public TermDocs termDocs(Term term) throws IOException
Description copied from class:IndexReaderReturns an enumeration of all the documents which containterm. For each document, the document number, the frequency of the term in that document is also provided, for use in search scoring. If term is null, then all non-deleted docs are returned with freq=1. Thus, this method implements the mapping:-
Term => <docNum, freq>*
The enumeration is ordered by document number. Each document number is greater than all that precede it in the enumeration.
- Overrides:
termDocsin classIndexReader- Throws:
IOException- if there is a low-level IO error
-
termPositions
public TermPositions termPositions() throws IOException
Description copied from class:IndexReaderReturns an unpositionedTermPositionsenumerator.- Specified by:
termPositionsin classIndexReader- Throws:
IOException- if there is a low-level IO error
-
doDelete
@Deprecated protected void doDelete(int n) throws CorruptIndexException, IOException
Deprecated.Implements deletion of the document numbereddocNum. Applications should callIndexReader.deleteDocument(int)orIndexReader.deleteDocuments(Term).- Specified by:
doDeletein classIndexReader- Throws:
CorruptIndexExceptionIOException
-
doCommit
@Deprecated protected void doCommit(Map<String,String> commitUserData) throws IOException
Deprecated.Implements commit.- Specified by:
doCommitin classIndexReader- Throws:
IOException
-
doClose
protected void doClose() throws IOExceptionDescription copied from class:IndexReaderImplements close.- Specified by:
doClosein classIndexReader- Throws:
IOException
-
getVersion
public long getVersion()
Description copied from class:IndexReaderVersion number when this IndexReader was opened. Not implemented in the IndexReader base class.If this reader is based on a Directory (ie, was created by calling
IndexReader.open(org.apache.lucene.store.Directory), orIndexReader.openIfChanged(org.apache.lucene.index.IndexReader)on a reader based on a Directory), then this method returns the version recorded in the commit that the reader opened. This version is advanced every timeIndexWriter.commit()is called.- Overrides:
getVersionin classIndexReader
-
isCurrent
public boolean isCurrent() throws CorruptIndexException, IOExceptionDescription copied from class:IndexReaderCheck whether any new changes have occurred to the index since this reader was opened.If this reader is based on a Directory (ie, was created by calling
IndexReader.open(org.apache.lucene.store.Directory), orIndexReader.openIfChanged(org.apache.lucene.index.IndexReader)on a reader based on a Directory), then this method checks if any further commits (seeIndexWriter.commit()have occurred in that directory).If instead this reader is a near real-time reader (ie, obtained by a call to
IndexWriter.getReader(), or by callingIndexReader.openIfChanged(org.apache.lucene.index.IndexReader)on a near real-time reader), then this method checks if either a new commmit has occurred, or any new uncommitted changes have taken place via the writer. Note that even if the writer has only performed merging, this method will still return false.In any event, if this returns false, you should call
IndexReader.openIfChanged(org.apache.lucene.index.IndexReader)to get a new reader that sees the changes.- Overrides:
isCurrentin classIndexReader- Throws:
CorruptIndexException- if the index is corruptIOException- if there is a low-level IO error
-
isOptimized
@Deprecated public boolean isOptimized()
Deprecated.- Overrides:
isOptimizedin classIndexReader
-
getSequentialSubReaders
public IndexReader[] getSequentialSubReaders()
Description copied from class:IndexReaderExpert: returns the sequential sub readers that this reader is logically composed of. For example, IndexSearcher uses this API to drive searching by one sub reader at a time. If this reader is not composed of sequential child readers, it should return null. If this method returns an empty array, that means this reader is a null reader (for example a MultiReader that has no sub readers).NOTE: You should not try using sub-readers returned by this method to make any changes (setNorm, deleteDocument, etc.). While this might succeed for one composite reader (like MultiReader), it will most likely lead to index corruption for other readers (like DirectoryReader obtained through
IndexReader.open(org.apache.lucene.store.Directory). Use the parent reader directly.- Overrides:
getSequentialSubReadersin classIndexReader
-
getCommitUserData
public Map<String,String> getCommitUserData()
Description copied from class:IndexReaderRetrieve the String userData optionally passed to IndexWriter#commit. This will return null ifIndexWriter.commit(Map)has never been called for this index.- Overrides:
getCommitUserDatain classIndexReader
-
getCoreCacheKey
public Object getCoreCacheKey()
If the subclass of FilteredIndexReader modifies the contents of the FieldCache, you must override this method to provide a different key- Overrides:
getCoreCacheKeyin classIndexReader
-
getDeletesCacheKey
public Object getDeletesCacheKey()
If the subclass of FilteredIndexReader modifies the deleted docs, you must override this method to provide a different key- Overrides:
getDeletesCacheKeyin classIndexReader
-
toString
public String toString()
- Overrides:
toStringin classIndexReader
-
-