What analyzer are you using?
On Dec 28, 2005, at 7:13 PM, jennyw wrote:
that term would show up in the result set. Does this indicate that
short
words like “for” are not indexed?
Jen - what analyzer are you using?
If you’re using the default, it is the StandardAnalyzer, which
removes these stop words during tokenization:
ENGLISH_STOP_WORDS = [
"a", "an", "and", "are", "as", "at", "be", "but", "by", "for",
“if”,
“in”, “into”, “is”, “it”, “no”, “not”, “of”, “on”, “or”, “s”,
“such”,
“t”, “that”, “the”, “their”, “then”, “there”, “these”,
“they”, “this”, “to”, “was”, “will”, “with”
]
Off the cuff, you should be able to adjust this to not remove any
stop words by using:
:analyzer => StandardAnalyzer.new([])
if you’re using the Index class Ferret provides.
Erik