org.apache.lucene.analysis
public final class LowerCaseTokenizer extends LetterTokenizer
Note: this does a decent job for most European languages, but does a terrible job for some Asian languages, where words are not separated by spaces.
| Constructor Summary | |
|---|---|
| LowerCaseTokenizer(Reader in) Construct a new LowerCaseTokenizer. | |
| Method Summary | |
|---|---|
| protected char | normalize(char c) Collects only characters which satisfy
Character#isLetter(char). |