Class LuceneAnalyzerTokenizerAdapter

java.lang.Object
org.carrot2.language.extras.LuceneAnalyzerTokenizerAdapter
All Implemented Interfaces:
org.carrot2.language.Tokenizer

public class LuceneAnalyzerTokenizerAdapter extends Object implements org.carrot2.language.Tokenizer
  • Field Summary

    Fields inherited from interface org.carrot2.language.Tokenizer

    TF_COMMON_WORD, TF_QUERY_WORD, TF_SEPARATOR_DOCUMENT, TF_SEPARATOR_FIELD, TF_SEPARATOR_SENTENCE, TF_TERMINATOR, TT_ACRONYM, TT_BARE_URL, TT_EMAIL, TT_EOF, TT_FILE, TT_FULL_URL, TT_HYPHTERM, TT_NUMERIC, TT_PUNCTUATION, TT_TERM, TYPE_MASK
  • Constructor Summary

    Constructors
    Constructor
    Description
    LuceneAnalyzerTokenizerAdapter(org.apache.lucene.analysis.Analyzer analyzer)
     
  • Method Summary

    Modifier and Type
    Method
    Description
    short
     
    void
    reset(Reader reader)
     
    void
    setTermBuffer(org.carrot2.util.MutableCharArray array)
     

    Methods inherited from class java.lang.Object

    clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
  • Constructor Details

    • LuceneAnalyzerTokenizerAdapter

      public LuceneAnalyzerTokenizerAdapter(org.apache.lucene.analysis.Analyzer analyzer)
  • Method Details

    • reset

      public void reset(Reader reader) throws IOException
      Specified by:
      reset in interface org.carrot2.language.Tokenizer
      Throws:
      IOException
    • nextToken

      public short nextToken()
      Specified by:
      nextToken in interface org.carrot2.language.Tokenizer
    • setTermBuffer

      public void setTermBuffer(org.carrot2.util.MutableCharArray array)
      Specified by:
      setTermBuffer in interface org.carrot2.language.Tokenizer