Re: tweaking full-text retrieval Hi,
You could adjust the document length normalization factor by modifying
PIVOT_VAL in myisam/ftdefs.h and re-compiling.
The ft_stopword_file system variable controls the list of stopwords.
ft_min_word_len controls the minimum word length. There's some more
info at:
<http://dev.mysql.com/doc/refman/5.0/en/fulltext-fine-tuning.html>
Boolean mode FTS doesn't use the 50% threshold. Examples here:
<http://dev.mysql.com/doc/refman/5.0/en/fulltext-boolean.html>
If you need more control, you might try using the lucene search engine.
Jens Grivolla wrote:
> Hi,
>
> I want to use MySQL's full-text retrieval, but would need to optimize
> it for my application.
>
> It seems possible to switch between two term weighting schemes (IDF and
> IDFP), is there any way have greater control over this?
>
> More importantly, I will need to adjust the document length
> normalization, which is completely inadequate for my purpose. Is this
> possible?
>
> I will also need to adjust the stop-word list, reduce the minimum
> length of indexed words, and remove the 50% frequency cutoff, all of
> which seem to be nicely documented. No problems there, at least.
>
> Thanks,
> Jens |