View Single Post

   
  #2 (permalink)  
Old 02-28-2008, 08:36 AM
petersprc@gmail.com
 
Posts: n/a
Default Re: tweaking full-text retrieval

Hi,

You could adjust the document length normalization factor by modifying
PIVOT_VAL in myisam/ftdefs.h and re-compiling.

The ft_stopword_file system variable controls the list of stopwords.
ft_min_word_len controls the minimum word length. There's some more
info at:

<http://dev.mysql.com/doc/refman/5.0/en/fulltext-fine-tuning.html>

Boolean mode FTS doesn't use the 50% threshold. Examples here:

<http://dev.mysql.com/doc/refman/5.0/en/fulltext-boolean.html>

If you need more control, you might try using the lucene search engine.

Jens Grivolla wrote:
> Hi,
>
> I want to use MySQL's full-text retrieval, but would need to optimize
> it for my application.
>
> It seems possible to switch between two term weighting schemes (IDF and
> IDFP), is there any way have greater control over this?
>
> More importantly, I will need to adjust the document length
> normalization, which is completely inadequate for my purpose. Is this
> possible?
>
> I will also need to adjust the stop-word list, reduce the minimum
> length of indexed words, and remove the 50% frequency cutoff, all of
> which seem to be nicely documented. No problems there, at least.
>
> Thanks,
> Jens


Reply With Quote