[pylucene-dev] General strategy for may fields

Andi Vajda vajda at osafoundation.org
Tue Feb 21 10:05:55 PST 2006


On Tue, 21 Feb 2006, David Pratt wrote:

> Hi there. I have been working with semantic web type application for a while 
> and it appears to me pylucene could help me to get a much needed speed boost. 
> I have a couple of question to start.
>
> My schema is fairly involved and has more than 100 elements. It would be 
> great to perform a search on any of these but how practical is that?
> Should I be creating a summary type search using a smaller index of say 10 - 
> 12 fields and then have detailed search based on a larger broader number of 
> fields in a different index. Any recommendations would be helpful. There is 
> no doubt I will have to create a query parser for my app.
>
> Second questions is how can I get pydocs so I am aware of what functionality 
> exists and also syntax. I see examples in the samples folder but some 
> description of classes and methods is important. Is this available somewhere?
>
> My last initial question has to do with sorting. I see that there are 
> advanced possibilities with the indexes to sort and filter. How advisable is 
> using sort for large record sets. For example, say you have got 20000 records 
> returned from your search. Because this will have a web interface I will only 
> be showing first 20 likely so it will be batching results. Is the sorting 
> filtering highly memory intensive?

These are all good questions best asked of the java-user at lucene.apache.org 
mailing list. PyLucene is a compilation of Java Lucene integrated with Python 
via SWIG. The pylucene-dev mailing list can help you with python-specific 
Lucene questions and bugs. For help about Lucene in general, please refer to 
these sources:

   - java-user at lucene.apache.org
     don't let the 'java' in the name of that list put you off, none of your
     questions are java or python specific.

   - java-dev at lucene.apache.org
     for help in extending or adapting Java Lucene.

   - the book 'Lucene in Action' written by Erik Hatcher and Otis Gospodnetic,
     two of the Java Lucene developers. Most samples in this book were ported
     to PyLucene and are in its 'samples/LuceneInAction' directory.

Andi..


More information about the pylucene-dev mailing list