Package org.carrot2.language.extras
Class LuceneAnalyzerTokenizerAdapter
java.lang.Object
org.carrot2.language.extras.LuceneAnalyzerTokenizerAdapter
- All Implemented Interfaces:
org.carrot2.language.Tokenizer
public class LuceneAnalyzerTokenizerAdapter
extends Object
implements org.carrot2.language.Tokenizer
-
Field Summary
Fields inherited from interface org.carrot2.language.Tokenizer
TF_COMMON_WORD, TF_QUERY_WORD, TF_SEPARATOR_DOCUMENT, TF_SEPARATOR_FIELD, TF_SEPARATOR_SENTENCE, TF_TERMINATOR, TT_ACRONYM, TT_BARE_URL, TT_EMAIL, TT_EOF, TT_FILE, TT_FULL_URL, TT_HYPHTERM, TT_NUMERIC, TT_PUNCTUATION, TT_TERM, TYPE_MASK
-
Constructor Summary
ConstructorsConstructorDescriptionLuceneAnalyzerTokenizerAdapter
(org.apache.lucene.analysis.Analyzer analyzer) -
Method Summary
Modifier and TypeMethodDescriptionshort
void
void
setTermBuffer
(org.carrot2.util.MutableCharArray array)
-
Constructor Details
-
LuceneAnalyzerTokenizerAdapter
public LuceneAnalyzerTokenizerAdapter(org.apache.lucene.analysis.Analyzer analyzer)
-
-
Method Details
-
reset
- Specified by:
reset
in interfaceorg.carrot2.language.Tokenizer
- Throws:
IOException
-
nextToken
public short nextToken()- Specified by:
nextToken
in interfaceorg.carrot2.language.Tokenizer
-
setTermBuffer
public void setTermBuffer(org.carrot2.util.MutableCharArray array) - Specified by:
setTermBuffer
in interfaceorg.carrot2.language.Tokenizer
-