[pylucene-dev] General strategy for may fields
vajda at osafoundation.org
Tue Feb 21 11:29:51 PST 2006
On Tue, 21 Feb 2006, David Pratt wrote:
> Hi Andi. Many thanks for your reply. The pydocs question I believe is
> relevant. Has anyone assembled this type of documentation anywhere for
Sorry, I missed that one. I saw the ones about schema and sorting and thought
'java-user at lucene.apache.org'.
To answer your queetion about pydocs, no, there is no pydocs for PyLucene. The
README file (http://svn.osafoundation.org/pylucene/trunk/README) contains the
API documentation for PyLucene-specific pythonic API additions and changes.
For APIs that did not change, please refer to the Java Lucene javadocs.
> I'll post the other questions to the lists as you suggest. I am just
> beginning with pylucene so I may bring forward some relevant python questions
> once I experimented a bit more. I am currently working through examples and
> reading what I can find about lucene online.
Definitely feel free to ask questions. I was referring you to the
java-user at lucene.apache.org mailing list because there are a lot more Lucene
experts hanging out there. If there is anything I or others reading
pylucene-dev can answer, we sure will.
> Andi Vajda wrote:
>> On Tue, 21 Feb 2006, David Pratt wrote:
>>> Hi there. I have been working with semantic web type application for a
>>> while and it appears to me pylucene could help me to get a much needed
>>> speed boost. I have a couple of question to start.
>>> My schema is fairly involved and has more than 100 elements. It would be
>>> great to perform a search on any of these but how practical is that?
>>> Should I be creating a summary type search using a smaller index of say 10
>>> - 12 fields and then have detailed search based on a larger broader number
>>> of fields in a different index. Any recommendations would be helpful.
>>> There is no doubt I will have to create a query parser for my app.
>>> Second questions is how can I get pydocs so I am aware of what
>>> functionality exists and also syntax. I see examples in the samples folder
>>> but some description of classes and methods is important. Is this
>>> available somewhere?
>>> My last initial question has to do with sorting. I see that there are
>>> advanced possibilities with the indexes to sort and filter. How advisable
>>> is using sort for large record sets. For example, say you have got 20000
>>> records returned from your search. Because this will have a web interface
>>> I will only be showing first 20 likely so it will be batching results. Is
>>> the sorting filtering highly memory intensive?
>> These are all good questions best asked of the java-user at lucene.apache.org
>> mailing list. PyLucene is a compilation of Java Lucene integrated with
>> Python via SWIG. The pylucene-dev mailing list can help you with
>> python-specific Lucene questions and bugs. For help about Lucene in
>> general, please refer to these sources:
>> - java-user at lucene.apache.org
>> don't let the 'java' in the name of that list put you off, none of your
>> questions are java or python specific.
>> - java-dev at lucene.apache.org
>> for help in extending or adapting Java Lucene.
>> - the book 'Lucene in Action' written by Erik Hatcher and Otis
>> two of the Java Lucene developers. Most samples in this book were
>> to PyLucene and are in its 'samples/LuceneInAction' directory.
>> pylucene-dev mailing list
>> pylucene-dev at osafoundation.org
More information about the pylucene-dev