public class CJKAnalyzer
extends org.apache.lucene.analysis.Analyzer
Modifier and Type | Field and Description |
---|---|
static java.lang.String[] |
STOP_WORDS
An array containing some common English words that are not usually
useful for searching and some double-byte interpunctions.
|
Constructor and Description |
---|
CJKAnalyzer()
Deprecated.
Use
CJKAnalyzer(Version) instead |
CJKAnalyzer(java.lang.String[] stopWords)
Deprecated.
Use
CJKAnalyzer(Version, String[]) instead |
CJKAnalyzer(org.apache.lucene.util.Version matchVersion)
Builds an analyzer which removes words in
STOP_WORDS . |
CJKAnalyzer(org.apache.lucene.util.Version matchVersion,
java.lang.String[] stopWords)
Builds an analyzer which removes words in the provided array.
|
Modifier and Type | Method and Description |
---|---|
org.apache.lucene.analysis.TokenStream |
reusableTokenStream(java.lang.String fieldName,
java.io.Reader reader)
Returns a (possibly reused)
TokenStream which tokenizes all the text
in the provided Reader . |
org.apache.lucene.analysis.TokenStream |
tokenStream(java.lang.String fieldName,
java.io.Reader reader)
Creates a
TokenStream which tokenizes all the text in the provided Reader . |
public static final java.lang.String[] STOP_WORDS
public CJKAnalyzer()
CJKAnalyzer(Version)
insteadSTOP_WORDS
.public CJKAnalyzer(org.apache.lucene.util.Version matchVersion)
STOP_WORDS
.public CJKAnalyzer(java.lang.String[] stopWords)
CJKAnalyzer(Version, String[])
insteadstopWords
- stop word arraypublic CJKAnalyzer(org.apache.lucene.util.Version matchVersion, java.lang.String[] stopWords)
stopWords
- stop word arraypublic final org.apache.lucene.analysis.TokenStream tokenStream(java.lang.String fieldName, java.io.Reader reader)
TokenStream
which tokenizes all the text in the provided Reader
.tokenStream
in class org.apache.lucene.analysis.Analyzer
fieldName
- lucene field namereader
- input Reader
TokenStream
built from CJKTokenizer
, filtered with
StopFilter
public final org.apache.lucene.analysis.TokenStream reusableTokenStream(java.lang.String fieldName, java.io.Reader reader) throws java.io.IOException
TokenStream
which tokenizes all the text
in the provided Reader
.reusableTokenStream
in class org.apache.lucene.analysis.Analyzer
fieldName
- lucene field namereader
- Input Reader
TokenStream
built from CJKTokenizer
, filtered with
StopFilter
java.io.IOException
Copyright © 2000-2019 Apache Software Foundation. All Rights Reserved.