Package | Description |
---|---|
org.apache.lucene.analysis |
API and code to convert text into indexable/searchable tokens.
|
org.apache.lucene.analysis.br |
Analyzer for Brazilian.
|
org.apache.lucene.analysis.cjk |
Analyzer for Chinese, Japanese and Korean.
|
org.apache.lucene.analysis.cn |
Analyzer for Chinese.
|
org.apache.lucene.analysis.de |
Analyzer for German.
|
org.apache.lucene.analysis.el |
Analyzer for Greek.
|
org.apache.lucene.analysis.fr |
Analyzer for French.
|
org.apache.lucene.analysis.ngram | |
org.apache.lucene.analysis.nl |
Analyzer for Dutch.
|
org.apache.lucene.analysis.payloads |
Provides various convenience classes for creating payloads on Tokens.
|
org.apache.lucene.analysis.ru |
Analyzer for Russian.
|
org.apache.lucene.analysis.sinks |
Implementations of the SinkTokenizer that might be useful.
|
org.apache.lucene.analysis.snowball |
TokenFilter and Analyzer implementations that use Snowball
stemmers. |
org.apache.lucene.analysis.standard |
A fast grammar-based tokenizer constructed with JFlex.
|
org.apache.lucene.analysis.th | |
org.apache.lucene.index.memory |
High-performance single-document main memory Apache Lucene fulltext search index.
|
org.apache.lucene.search.highlight |
The highlight package contains classes to provide "keyword in context" features
typically used to highlight search terms in the text of results pages.
|
org.apache.lucene.wikipedia.analysis |
Modifier and Type | Method and Description |
---|---|
Token |
SinkTokenizer.next()
Returns the next token out of the list of cached tokens
|
Token |
TokenStream.next()
Returns the next token in the stream, or null at EOS.
|
Token |
CachingTokenFilter.next() |
Token |
LengthFilter.next(Token result)
Returns the next input Token whose termText() is the right len
|
Token |
PorterStemFilter.next(Token result) |
Token |
ISOLatin1AccentFilter.next(Token result) |
Token |
KeywordTokenizer.next(Token result) |
Token |
CharTokenizer.next(Token token) |
Token |
TokenStream.next(Token result)
Returns the next token in the stream, or null at EOS.
|
Token |
LowerCaseFilter.next(Token result) |
Token |
StopFilter.next(Token result)
Returns the next input Token whose termText() is not a stop word.
|
Token |
TeeTokenFilter.next(Token result) |
Modifier and Type | Method and Description |
---|---|
void |
SinkTokenizer.add(Token t)
Override this method to cache only certain tokens, or new tokens based
on the old tokens.
|
Token |
LengthFilter.next(Token result)
Returns the next input Token whose termText() is the right len
|
Token |
PorterStemFilter.next(Token result) |
Token |
ISOLatin1AccentFilter.next(Token result) |
Token |
KeywordTokenizer.next(Token result) |
Token |
CharTokenizer.next(Token token) |
Token |
TokenStream.next(Token result)
Returns the next token in the stream, or null at EOS.
|
Token |
LowerCaseFilter.next(Token result) |
Token |
StopFilter.next(Token result)
Returns the next input Token whose termText() is not a stop word.
|
Token |
TeeTokenFilter.next(Token result) |
Modifier and Type | Method and Description |
---|---|
Token |
BrazilianStemFilter.next() |
Modifier and Type | Method and Description |
---|---|
Token |
CJKTokenizer.next()
Returns the next token in the stream, or null at EOS.
|
Modifier and Type | Method and Description |
---|---|
Token |
ChineseFilter.next() |
Token |
ChineseTokenizer.next() |
Modifier and Type | Method and Description |
---|---|
Token |
GermanStemFilter.next() |
Modifier and Type | Method and Description |
---|---|
Token |
GreekLowerCaseFilter.next() |
Modifier and Type | Method and Description |
---|---|
Token |
FrenchStemFilter.next() |
Token |
ElisionFilter.next()
Returns the next input Token whith termText() without elisioned start
|
Modifier and Type | Method and Description |
---|---|
Token |
NGramTokenFilter.next()
Returns the next token in the stream, or null at EOS.
|
Token |
NGramTokenizer.next()
Returns the next token in the stream, or null at EOS.
|
Token |
EdgeNGramTokenFilter.next()
Returns the next token in the stream, or null at EOS.
|
Token |
EdgeNGramTokenizer.next()
Returns the next token in the stream, or null at EOS.
|
Modifier and Type | Method and Description |
---|---|
Token |
DutchStemFilter.next() |
Modifier and Type | Method and Description |
---|---|
Token |
NumericPayloadTokenFilter.next(Token result) |
Token |
TypeAsPayloadTokenFilter.next(Token result) |
Token |
TokenOffsetPayloadTokenFilter.next(Token result) |
Modifier and Type | Method and Description |
---|---|
Token |
NumericPayloadTokenFilter.next(Token result) |
Token |
TypeAsPayloadTokenFilter.next(Token result) |
Token |
TokenOffsetPayloadTokenFilter.next(Token result) |
Modifier and Type | Method and Description |
---|---|
Token |
RussianLowerCaseFilter.next() |
Token |
RussianStemFilter.next() |
Modifier and Type | Method and Description |
---|---|
void |
TokenTypeSinkTokenizer.add(Token t) |
void |
TokenRangeSinkTokenizer.add(Token t) |
void |
DateRecognizerSinkTokenizer.add(Token t) |
Modifier and Type | Method and Description |
---|---|
Token |
SnowballFilter.next()
Returns the next input Token, after being stemmed
|
Modifier and Type | Method and Description |
---|---|
Token |
StandardFilter.next(Token result)
Returns the next token in the stream, or null at EOS.
|
Token |
StandardTokenizer.next(Token result) |
Modifier and Type | Method and Description |
---|---|
Token |
StandardFilter.next(Token result)
Returns the next token in the stream, or null at EOS.
|
Token |
StandardTokenizer.next(Token result) |
Modifier and Type | Method and Description |
---|---|
Token |
ThaiWordFilter.next() |
Modifier and Type | Method and Description |
---|---|
protected Token |
SynonymTokenFilter.createToken(String synonym,
Token current)
Creates and returns a token for the given synonym of the current input
token; Override for custom (stateless or stateful) behaviour, if desired.
|
Token |
SynonymTokenFilter.next()
Returns the next token in the stream, or null at EOS.
|
Modifier and Type | Method and Description |
---|---|
protected Token |
SynonymTokenFilter.createToken(String synonym,
Token current)
Creates and returns a token for the given synonym of the current input
token; Override for custom (stateless or stateful) behaviour, if desired.
|
Modifier and Type | Method and Description |
---|---|
Token |
TokenGroup.getToken(int index) |
Modifier and Type | Method and Description |
---|---|
float |
QueryScorer.getTokenScore(Token token) |
float |
Scorer.getTokenScore(Token token)
Called for each token in the current fragment
|
boolean |
SimpleFragmenter.isNewFragment(Token token) |
boolean |
Fragmenter.isNewFragment(Token nextToken)
Test to see if this token from the stream should be held in a new TextFragment
|
boolean |
NullFragmenter.isNewFragment(Token token) |
Modifier and Type | Method and Description |
---|---|
Token |
WikipediaTokenizer.next(Token result) |
Modifier and Type | Method and Description |
---|---|
Token |
WikipediaTokenizer.next(Token result) |
Copyright © 2000-2014 Apache Software Foundation. All Rights Reserved.