[pylucene-dev] PyLucene custom charTokenizer?

keekles keekles kneaks at gmail.com
Thu Jul 19 22:55:20 PDT 2007


I'm not that familiar with lucene, but basically what im looking to
accomplish is the equivalent of a whitespace tokenizer with my own list of
delimiters, in Lucene docs it just looks like simple inheritance but I dont
really see any examples in PyLucene on how to subclass a charTokenizer other
then the class from the lia SimpleKeywordAnalyzer which does not appear to
be used or work as far as i can see. I realize this is probably a bit out of
place to be asking here, but could someone explain or show me a valid
example of a custom analyzer using a custom charTokenizer in PyLucene?

Thanks in advance
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://lists.osafoundation.org/pipermail/pylucene-dev/attachments/20070720/dd96dd68/attachment.html


More information about the pylucene-dev mailing list