org.apache.lucene.analysis
public abstract class Analyzer extends Object
Typical implementations first build a Tokenizer, which breaks the stream of characters from the Reader into raw Tokens. One or more TokenFilters may then be applied to the output of the Tokenizer.
WARNING: You must override one of the methods defined by this class in your subclass or the Analyzer will enter an infinite loop.
Constructor and Description |
---|
Analyzer() |
Modifier and Type | Method and Description |
---|---|
int |
getPositionIncrementGap(String fieldName)
Invoked before indexing a Fieldable instance if
terms have already been added to that field.
|
protected Object |
getPreviousTokenStream()
Used by Analyzers that implement reusableTokenStream
to retrieve previously saved TokenStreams for re-use
by the same thread.
|
TokenStream |
reusableTokenStream(String fieldName,
Reader reader)
Creates a TokenStream that is allowed to be re-used
from the previous time that the same thread called
this method.
|
protected void |
setPreviousTokenStream(Object obj)
Used by Analyzers that implement reusableTokenStream
to save a TokenStream for later re-use by the same
thread.
|
abstract TokenStream |
tokenStream(String fieldName,
Reader reader)
Creates a TokenStream which tokenizes all the text in the provided
Reader.
|
public abstract TokenStream tokenStream(String fieldName, Reader reader)
public TokenStream reusableTokenStream(String fieldName, Reader reader) throws IOException
IOException
protected Object getPreviousTokenStream()
protected void setPreviousTokenStream(Object obj)
public int getPositionIncrementGap(String fieldName)
fieldName
- Fieldable name being indexed.tokenStream(String,Reader)
Copyright © 2000-2014 Apache Software Foundation. All Rights Reserved.