[pylucene-dev] General strategy for may fields

Andi Vajda vajda at osafoundation.org
Tue Feb 21 11:29:51 PST 2006


On Tue, 21 Feb 2006, David Pratt wrote:

> Hi Andi. Many thanks for your reply. The pydocs question I believe is 
> relevant. Has anyone assembled this type of documentation anywhere for 
> pylucene?

Sorry, I missed that one. I saw the ones about schema and sorting and thought 
'java-user at lucene.apache.org'.

To answer your queetion about pydocs, no, there is no pydocs for PyLucene. The 
README file (http://svn.osafoundation.org/pylucene/trunk/README) contains the 
API documentation for PyLucene-specific pythonic API additions and changes. 
For APIs that did not change, please refer to the Java Lucene javadocs.
(http://lucene.apache.org/java/docs/api/index.html)

> I'll post the other questions to the lists as you suggest. I am just 
> beginning with pylucene so I may bring forward some relevant python questions 
> once I experimented a bit more. I am currently working through examples and 
> reading what I can find about lucene online.

Definitely feel free to ask questions. I was referring you to the 
java-user at lucene.apache.org mailing list because there are a lot more Lucene 
experts hanging out there. If there is anything I or others reading 
pylucene-dev can answer, we sure will.

Andi..

>
> Regards,
> David
>
>
> Andi Vajda wrote:
>> 
>> On Tue, 21 Feb 2006, David Pratt wrote:
>> 
>>> Hi there. I have been working with semantic web type application for a 
>>> while and it appears to me pylucene could help me to get a much needed 
>>> speed boost. I have a couple of question to start.
>>> 
>>> My schema is fairly involved and has more than 100 elements. It would be 
>>> great to perform a search on any of these but how practical is that?
>>> Should I be creating a summary type search using a smaller index of say 10 
>>> - 12 fields and then have detailed search based on a larger broader number 
>>> of fields in a different index. Any recommendations would be helpful. 
>>> There is no doubt I will have to create a query parser for my app.
>>> 
>>> Second questions is how can I get pydocs so I am aware of what 
>>> functionality exists and also syntax. I see examples in the samples folder 
>>> but some description of classes and methods is important. Is this 
>>> available somewhere?
>>> 
>>> My last initial question has to do with sorting. I see that there are 
>>> advanced possibilities with the indexes to sort and filter. How advisable 
>>> is using sort for large record sets. For example, say you have got 20000 
>>> records returned from your search. Because this will have a web interface 
>>> I will only be showing first 20 likely so it will be batching results. Is 
>>> the sorting filtering highly memory intensive?
>> 
>> 
>> These are all good questions best asked of the java-user at lucene.apache.org 
>> mailing list. PyLucene is a compilation of Java Lucene integrated with 
>> Python via SWIG. The pylucene-dev mailing list can help you with 
>> python-specific Lucene questions and bugs. For help about Lucene in 
>> general, please refer to these sources:
>>
>>   - java-user at lucene.apache.org
>>     don't let the 'java' in the name of that list put you off, none of your
>>     questions are java or python specific.
>>
>>   - java-dev at lucene.apache.org
>>     for help in extending or adapting Java Lucene.
>>
>>   - the book 'Lucene in Action' written by Erik Hatcher and Otis 
>> Gospodnetic,
>>     two of the Java Lucene developers. Most samples in this book were 
>> ported
>>     to PyLucene and are in its 'samples/LuceneInAction' directory.
>> 
>> Andi..
>> _______________________________________________
>> pylucene-dev mailing list
>> pylucene-dev at osafoundation.org
>> http://lists.osafoundation.org/mailman/listinfo/pylucene-dev
>> 
>


More information about the pylucene-dev mailing list