Which Search plugin?

laurin1 · February 14, 2007, 10:39am

Hi,

I am getting ready to add searching to the property model of a real
estate
site I am working on and was looking for advice on
which search plugin to use.

I came to Rails to escape Java and I really don’t want to run a Java
Server
which as far as I know rules out Acts as Solr.

So it looks like a choice between Soda Search and acts as ferret. How
well
will these two handle normalized VS non-normalized
data. Like if I am using relations with other objects like has_many etc
will
these search plugins index the items owned by
my model as well.

Thanks,

Keith

laurin1 · February 17, 2007, 5:12pm

Acts as Ferret works wonderfully for me. YMMV.

But I doubt it.

RSL

laurin1 · February 17, 2007, 8:42pm

Russell N. wrote:

Acts as Ferret works wonderfully for me. YMMV.

But I doubt it.

RSL

is acts as ferret a good search plugin to use with a big site with a lot
of people and a lot of searches and inserts ?

laurin1 · February 17, 2007, 10:35pm

Ferret’s based on Lucene [Apache Lucene - Wikipedia] so I’m
pretty
sure it’s solid.

RSL

laurin1 · February 17, 2007, 10:52pm

Ferret has seg faulted a number of times for me taking down my mongrel
processes in a flaming ball of fire. I have monit checking my
processes, so they get restarted, but for some reason it can take up
to 10 minutes.

I’m currently in the process of switching all my acts_as_ferret to
tsearch2.

-carl

On 2/17/07, Russell N. [email protected] wrote:

Acts as Ferret works wonderfully for me. YMMV.

–
EPA Rating: 3000 Lines of Code / Gallon (of coffee)

laurin1 · February 17, 2007, 11:03pm

I’ve seen problems in peoples app from ferret as the index gets
bigger, it will sometimes segfault and crash mongrel or you will
start to get ferret locking errors on the index with multiple rails
processes trying to read/write from the same index.

HyperEstraier, and Sphinx have held up better with bigger indexes
then ferret for the apps I’ve seen.

I did hear that they are working on a drb daemon that will be the
only thing to write to the ferret index with your app talking to it
over drb, this may fix the locking and index corruption issues but i
haven’t seen it in the wild yet.

-Ezra

On Feb 17, 2007, at 1:51 PM, Carl L. wrote:

Russell N. wrote:

–
EPA Rating: 3000 Lines of Code / Gallon (of coffee)

– Ezra Z.
– Lead Rails Evangelist
– [email protected]
– Engine Y., Serious Rails Hosting
– (866) 518-YARD (9273)

laurin1 · February 17, 2007, 11:18pm

Ferret also has stemming. Ferret actually has more full text searching
capabilities than tsearch2, however I’ve never needed more than
tsearch2 can offer. tsearch2 is also faster when you need to combine
FT conditions with other conditions. Since tsearch2 is built on a gist
index, you can build multiple column gist indexes including the
tsvector column.

-carl

On 2/17/07, Just S. [email protected] wrote:

specific. There are dictionaries for stop-words for many many

I’m currently in the process of switching all my acts_as_ferret to tsearch2.

of people and a lot of searches and inserts ?

Family management on rails: http://www.famundo.com
My development related blog: http://devblog.famundo.com

–
EPA Rating: 3000 Lines of Code / Gallon (of coffee)

laurin1 · February 18, 2007, 12:54am

Just S. wrote:

tsearch2 “problem” is that it is Postgres specific. Not a problem for
me as I use ONLY Postgres. But if the app is MySQL based you are out
of luck.

But tsearch2 has a huge advantage over Ferret - it has lexical
capabilities. This meand that searching for plurals or otherwise
modified workd, will find them and the related words. FOr example,
searching for rabit or searching for rabbits, will both bring all docs
with rabbit or rabbits. This is a VERY cool feature, and it’s language
specific. There are dictionaries for stop-words for many many
languages.

i use postgres too, so is better use tsearch2 than ferret? (with a
really big database)

laurin1 · February 17, 2007, 11:04pm

tsearch2 “problem” is that it is Postgres specific. Not a problem for
me as I use ONLY Postgres. But if the app is MySQL based you are out
of luck.

But tsearch2 has a huge advantage over Ferret - it has lexical
capabilities. This meand that searching for plurals or otherwise
modified workd, will find them and the related words. FOr example,
searching for rabit or searching for rabbits, will both bring all docs
with rabbit or rabbits. This is a VERY cool feature, and it’s language
specific. There are dictionaries for stop-words for many many
languages.

On 2/17/07, Carl L. [email protected] wrote:

On 2/17/07, Russell N. [email protected] wrote:

Acts as Ferret works wonderfully for me. YMMV.

EPA Rating: 3000 Lines of Code / Gallon (of coffee)

–
Family management on rails: http://www.famundo.com
My development related blog: http://devblog.famundo.com

laurin1 · February 18, 2007, 1:31am

Carl L. wrote:

It depends on your requirements.

just find something in a lot of data faster as possible and without
problem if there are insert in the same moment

laurin1 · February 18, 2007, 1:18am

i use postgres too, so is better use tsearch2 than ferret? (with a
really big database)

It depends on your requirements.

–
Posted via http://www.ruby-forum.com/.

–
EPA Rating: 3000 Lines of Code / Gallon (of coffee)

laurin1 · February 18, 2007, 2:25pm

On Feb 14, 1:37 am, “Keith D.” [email protected] wrote:

will these two handle normalized VS non-normalized
data. Like if I am using relations with other objects like has_many etc will
these search plugins index the items owned by
my model as well.

Here’s a couple i thought were helpful, touch on the major issues:
sorting vs searching, stemming/tokenization, stopwords, UTF-8 (i
think), tf/idf calculations

http://agnessa.railsplayground.com/ferret.pdf

http://mojodna.net/searchable/ruby/railsconf.pdf