org.apache.lucene.analysis.cz
public final class CzechAnalyzer extends Analyzer
Modifier and Type | Field and Description |
---|---|
static String[] |
CZECH_STOP_WORDS
List of typical stopwords.
|
Constructor and Description |
---|
CzechAnalyzer()
Builds an analyzer with the default stop words (
CZECH_STOP_WORDS ). |
CzechAnalyzer(File stopwords)
Builds an analyzer with the given stop words.
|
CzechAnalyzer(HashSet stopwords) |
CzechAnalyzer(String[] stopwords)
Builds an analyzer with the given stop words.
|
Modifier and Type | Method and Description |
---|---|
void |
loadStopWords(InputStream wordfile,
String encoding)
Loads stopwords hash from resource stream (file, database...).
|
TokenStream |
tokenStream(String fieldName,
Reader reader)
Creates a TokenStream which tokenizes all the text in the provided Reader.
|
getPositionIncrementGap, getPreviousTokenStream, reusableTokenStream, setPreviousTokenStream
public static final String[] CZECH_STOP_WORDS
public CzechAnalyzer()
CZECH_STOP_WORDS
).public CzechAnalyzer(String[] stopwords)
public CzechAnalyzer(HashSet stopwords)
public CzechAnalyzer(File stopwords) throws IOException
IOException
public void loadStopWords(InputStream wordfile, String encoding)
wordfile
- File containing the wordlistencoding
- Encoding used (win-1250, iso-8859-2, ...), null for default system encodingpublic final TokenStream tokenStream(String fieldName, Reader reader)
tokenStream
in class Analyzer
Copyright © 2000-2014 Apache Software Foundation. All Rights Reserved.