[pylucene-dev] ArrayIndexOutOfBoundsException with Field.TermVector.WITH_OFFSETS

Andi Vajda vajda at osafoundation.org
Fri Aug 18 02:12:18 PDT 2006


On Thu, 17 Aug 2006, Steven Bethard wrote:

> I'm getting ArrayIndexOutOfBoundsExceptions thrown whenever I use
> Field.TermVector.WITH_OFFSETS, Field.TermVector.WITH_POSITIONS or
> Field.TermVector.WITH_POSITIONS_OFFSETS, though the errors show up
> somewhat sporadically.  Here's an interactive session where I was able
> to provoke one:

Given that the error seems to come from inside Java Lucene, it could be 
either:
   - a gcj compilation error: please give us your gcj version and OS details
     (I've had to patch around various such bugs)
   - a Java Lucene bug: see if the same Java program causes the same Java
     Lucene error and if so contact java-user at lucene.apache.org or file a Java
     Lucene bug

Andi..

>
> ----------------------------------------------------------------------
>>>> import PyLucene as lucene
>>>> def add_doc(writer, text):
> 	doc = lucene.Document()
> 	doc.add(lucene.Field('text', text, lucene.Field.Store.YES,
> lucene.Field.Index.TOKENIZED, lucene.Field.TermVector.WITH_OFFSETS))
> 	writer.addDocument(doc)
>
> 
>>>> analyzer = lucene.StandardAnalyzer()
>>>> writer = lucene.IndexWriter('temp', analyzer, True)
>>>> for _ in xrange(1000):
> 	add_doc(writer, '''
> Mr Straw also called for changes in the application of the Dublin
> convention. Why?
>
> For the past year, Mr Straw has put forward several initiatives to
> combat "asylum-shopping". This is the practice whereby people whose
> asylum claim is rejected in one EU country or who fail to lodge a
> claim in the first port of entry travel further to countries like
> Britain, where social benefits might be better and the judges more
> sympathetic.
>
> Mr Straw has now called for a reform of the 1997 Dublin Convention
> that obliges the first EU country entered to process asylum
> applications, saying that it had not been working as intended.
>
> Many countries, Mr Straw suggests, turned a blind eye as
> asylum-seekers travelled to third countries. He wants to see an end to
> the current situation, where scores of asylum-seekers are waiting at a
> Red Cross reception centre near Calais, in France, trying to board
> trains and trucks to get into Britain.
>
> The European Commission is currently looking at the effectiveness of
> the Dublin Convention and measures are set to be adopted this year in
> order to limit the movement of asylum-seekers between EU member
> states.
> ''')
>
> 
> Traceback (most recent call last):
> File "<pyshell#103>", line 12, in -toplevel-
>   ''')
> File "<pyshell#99>", line 4, in add_doc
>   writer.addDocument(doc)
> JavaError: java.lang.ArrayIndexOutOfBoundsException
> ----------------------------------------------------------------------
>
> Can anyone reproduce these errors?  I don't get them when using
> Field.TermVector.YES or Field.TermVector.NO.
>
> Steve
> -- 
> I'm not *in*-sane. Indeed, I am so far *out* of sane that you appear a
> tiny blip on the distant coast of sanity.
>       --- Bucky Katt, Get Fuzzy
> _______________________________________________
> pylucene-dev mailing list
> pylucene-dev at osafoundation.org
> http://lists.osafoundation.org/mailman/listinfo/pylucene-dev
>


More information about the pylucene-dev mailing list