[pylucene-dev] other tokenizers ?
Joh N.
joh12005 at yahoo.fr
Thu Jun 30 00:32:16 PDT 2005
Hello,
(sorry, i forgot to add this question in my previous
mail)
is PyLucene able to handle a custom tokenization
without any stemming process ?
actually i would like to feed the index myself with
words from different languages (thus inconsistant
tokenization), but also sgml tags, and maybe even some
numbers,
will it be possible ? where can i found hints on where
to look after that ?
best regards,
J.
___________________________________________________________________________
Appel audio GRATUIT partout dans le monde avec le nouveau Yahoo! Messenger
Téléchargez cette version sur http://fr.messenger.yahoo.com
More information about the pylucene-dev
mailing list