[Cosmo] iCalendar data queries
Cyrus Daboo
cyrus+lists.cosmo at daboo.name
Mon Nov 14 12:16:47 PST 2005
Hi Bobby,
--On November 14, 2005 2:58:50 PM -0500 Bobby Rullo <br at osafoundation.org>
wrote:
>> After some more examination of Jackrabbit code I discovered that it
>> was possible to write indexers for arbitrary data formats. By default
>> Jackrabbit just ships with a text/plain indexer, but there are
>> indexers for PDF, Word, Excel etc in the contrib directory. So what I
>> did was the following:
>>
>
> You mean Lucene, not Jackrabbit right?
>
No I mean Jackrabbit. What I did get wrong was use the term 'indexer' in
the scope of Jackrabbit. Jackrabbit does not do indexing - lucene does.
What Jackrabbit does is filter 'binary' data into a format suitable for
feeding into lucene's text indexer. In the case of text/plain resources, it
simply hands off the text to lucene. For PDF etc, it extracts the human
readable text portion from the binary data, and then feeds that to lucene.
--
Cyrus Daboo
More information about the Cosmo
mailing list