Class ApostropheFilter

  extended by org.apache.lucene.util.AttributeSource
      extended by org.apache.lucene.analysis.TokenStream
          extended by org.apache.lucene.analysis.TokenFilter
              extended by com.jalios.util.lucene.ApostropheFilter

public final class ApostropheFilter
extends org.apache.lucene.analysis.TokenFilter

Remove starting apostrophe from tokens extracted with StandardFilter. Used in french to remove first part of token such as:

  • "l'amour" to "amour"
  • "d'ailleurs" to "ailleurs"
  • "c'est" to "est"
  • "j'avais" to "avais"
  • "m'avait" to "avait"
  • "n'avez" to "avez"
  • "s'était" to "était"
  • "t'étais" to "étais"
  • Inspired from StandardFilter.

    Olivier Jaquemet

    Nested Class Summary
    Nested classes/interfaces inherited from class org.apache.lucene.util.AttributeSource
    org.apache.lucene.util.AttributeSource.AttributeFactory, org.apache.lucene.util.AttributeSource.State
    Field Summary
    Fields inherited from class org.apache.lucene.analysis.TokenFilter
    Constructor Summary
    ApostropheFilter(org.apache.lucene.analysis.TokenStream in)
              Construct new ApostropheFilter filtering for specified stream.
    Method Summary
     org.apache.lucene.analysis.Token next()
              Returns the next token in the stream, or null at EOS.
    Methods inherited from class org.apache.lucene.analysis.TokenFilter
    close, end, reset
    Methods inherited from class org.apache.lucene.analysis.TokenStream
    getOnlyUseNewAPI, incrementToken, next, setOnlyUseNewAPI
    Methods inherited from class org.apache.lucene.util.AttributeSource
    addAttribute, addAttributeImpl, captureState, clearAttributes, cloneAttributes, equals, getAttribute, getAttributeClassesIterator, getAttributeFactory, getAttributeImplsIterator, hasAttribute, hasAttributes, hashCode, restoreState, toString
    Methods inherited from class java.lang.Object
    clone, finalize, getClass, notify, notifyAll, wait, wait, wait

    Constructor Detail


    public ApostropheFilter(org.apache.lucene.analysis.TokenStream in)
    Construct new ApostropheFilter filtering for specified stream.

    in - the TokenStream to be filtered
    Method Detail


    public org.apache.lucene.analysis.Token next()
                                          throws IOException
    Returns the next token in the stream, or null at EOS.

    Removes [cdjlmnstCDJLMNST]' from the start of words.

    next in class org.apache.lucene.analysis.TokenStream

    Copyright © 2001-2010 Jalios SA. All Rights Reserved.