org.apache.lucene.analysis.cz
public final class CzechAnalyzer extends Analyzer
| Field Summary | |
|---|---|
| static String[] | CZECH_STOP_WORDS
List of typical stopwords. |
| Constructor Summary | |
|---|---|
| CzechAnalyzer()
Builds an analyzer with the default stop words (CZECH_STOP_WORDS). | |
| CzechAnalyzer(String[] stopwords)
Builds an analyzer with the given stop words. | |
| CzechAnalyzer(HashSet stopwords) | |
| CzechAnalyzer(File stopwords)
Builds an analyzer with the given stop words. | |
| Method Summary | |
|---|---|
| void | loadStopWords(InputStream wordfile, String encoding)
Loads stopwords hash from resource stream (file, database...). |
| TokenStream | tokenStream(String fieldName, Reader reader)
Creates a TokenStream which tokenizes all the text in the provided Reader.
|
Parameters: wordfile File containing the wordlist encoding Encoding used (win-1250, iso-8859-2, ...), null for default system encoding
Returns: A TokenStream build from a StandardTokenizer filtered with StandardFilter, LowerCaseFilter, and StopFilter