Strip out non-alphanumeric characters before saving to index

Does anyone know a simple way, with ferret or a_a_f, to strip out
everything that’s not a letter, number or space before saving to the
index? I know that i could do a custom method for every indexed field
that regexes them out but i thought that there might be a universal
option for it…

thanks
max

Hi!

That’s a typical job for an analyzer, I think Ferret’s StandardAnalyzer
which is used by default does exactly that. If not, try RegexpAnalyzer.

Cheers,
Jens

On Fri, Jun 13, 2008 at 02:43:47PM +0200, Max W. wrote:


Ferret-talk mailing list
[email protected]
http://rubyforge.org/mailman/listinfo/ferret-talk


Jens Krämer
webit! Gesellschaft für neue Medien mbH
Schnorrstraße 76 | 01069 Dresden
Telefon +49 351 46766-0 | Telefax +49 351 46766-66
[email protected] | www.webit.de

Amtsgericht Dresden | HRB 15422
GF Sven Haubold

great, i’ll check those out.

thanks!
max

2008/6/16 Jens K. [email protected]: