Field
that is
tokenized
,
not stored
,
termVectorStored with positions
(or
termVectorStored with positions and offsets
),addField(fieldName, stream, 1.0f)
.Field
.maxTokens
tokens from the underlying child analyzer,
ignoring all remaining tokens.SynonymTokenFilter
.tokenStream(String fieldName, Reader reader)
if the fieldName has been seen before, altogether ignoring the Reader parameter on cache lookup."\\W+"
; Divides text at non-letters (NOT Character.isLetter(c))Reader
, that can flexibly separate text into terms via a regular expression Pattern
(with behaviour identical to String.split(String)
),
and that combines the functionality of
LetterTokenizer
,
LowerCaseTokenizer
,
WhitespaceTokenizer
,
StopFilter
into a single efficient
multi-purpose class.tokenStream(String, String)
and is
less efficient than tokenStream(String, String)
."\\s+"
; Divides text at whitespaces (Character.isWhitespace(c))Copyright © 2000-2014 Apache Software Foundation. All Rights Reserved.