[pylucene-dev] General strategy for may fields
Andi Vajda
vajda at osafoundation.org
Tue Feb 21 10:05:55 PST 2006
On Tue, 21 Feb 2006, David Pratt wrote:
> Hi there. I have been working with semantic web type application for a while
> and it appears to me pylucene could help me to get a much needed speed boost.
> I have a couple of question to start.
>
> My schema is fairly involved and has more than 100 elements. It would be
> great to perform a search on any of these but how practical is that?
> Should I be creating a summary type search using a smaller index of say 10 -
> 12 fields and then have detailed search based on a larger broader number of
> fields in a different index. Any recommendations would be helpful. There is
> no doubt I will have to create a query parser for my app.
>
> Second questions is how can I get pydocs so I am aware of what functionality
> exists and also syntax. I see examples in the samples folder but some
> description of classes and methods is important. Is this available somewhere?
>
> My last initial question has to do with sorting. I see that there are
> advanced possibilities with the indexes to sort and filter. How advisable is
> using sort for large record sets. For example, say you have got 20000 records
> returned from your search. Because this will have a web interface I will only
> be showing first 20 likely so it will be batching results. Is the sorting
> filtering highly memory intensive?
These are all good questions best asked of the java-user at lucene.apache.org
mailing list. PyLucene is a compilation of Java Lucene integrated with Python
via SWIG. The pylucene-dev mailing list can help you with python-specific
Lucene questions and bugs. For help about Lucene in general, please refer to
these sources:
- java-user at lucene.apache.org
don't let the 'java' in the name of that list put you off, none of your
questions are java or python specific.
- java-dev at lucene.apache.org
for help in extending or adapting Java Lucene.
- the book 'Lucene in Action' written by Erik Hatcher and Otis Gospodnetic,
two of the Java Lucene developers. Most samples in this book were ported
to PyLucene and are in its 'samples/LuceneInAction' directory.
Andi..
More information about the pylucene-dev
mailing list