Package org.apache.lucene.analysis.ja
Analyzer for Japanese.
-
Class Summary Class Description GraphvizFormatter Outputs the dot (graphviz) string for the viterbi lattice.JapaneseAnalyzer Analyzer for Japanese that uses morphological analysis.JapaneseBaseFormFilter Replaces term text with theBaseFormAttribute
.JapaneseKatakanaStemFilter ATokenFilter
that normalizes common katakana spelling variations ending in a long sound character by removing this character (U+30FC).JapanesePartOfSpeechStopFilter Removes tokens that match a set of part-of-speech tags.JapaneseReadingFormFilter ATokenFilter
that replaces the term attribute with the reading of a token in either katakana or romaji form.JapaneseTokenizer Tokenizer for Japanese that uses morphological analysis.Token Analyzed token with morphological data from its dictionary. -
Enum Summary Enum Description JapaneseTokenizer.Mode Tokenization mode: this determines how the tokenizer handles compound and unknown words.