| | |
How do you handle site searches?
Please support our Social Media and Online Communities advertiser: Get a Free Web Site Analysis!
![]() |
I was just wondering how those of you with large communities handle site searches? With very large forums, even MySQL fulltext isn't good enough, nevermind the search feature built into software like vBulletin or phpBB.
Anyone play with www.sphinxsearch.com or something similar? Or are we all switching to Google's CBE nowadays?
Anyone play with www.sphinxsearch.com or something similar? Or are we all switching to Google's CBE nowadays?
Dani the Computer Science Gal 
Follow my Twitter feed! twitter.com/DaniWeb
And if you're interested in Internet marketing there is twitter.com/DaniWebAds

Follow my Twitter feed! twitter.com/DaniWeb
And if you're interested in Internet marketing there is twitter.com/DaniWebAds
•
•
Join Date: Oct 2008
Posts: 34
Reputation:
Solved Threads: 5
I've been developing a custom search for my site using lucene. I've found lucene fast and very flexible. I have a small site so the flexibility is more important to me than the speed.
I messed around with sphinx a bit. There were a couple of restrictions on how the DB tables were organized that would have made it a pain for me to use (like all primary keys had to be ints). But it was very fast for me. But again, I have a very small site, so YMMV.
When you say "isn't good enough" do you mean with respect to speed or something else?
I messed around with sphinx a bit. There were a couple of restrictions on how the DB tables were organized that would have made it a pain for me to use (like all primary keys had to be ints). But it was very fast for me. But again, I have a very small site, so YMMV.
When you say "isn't good enough" do you mean with respect to speed or something else?
I mean with respect to speed and also searches tend to be super literal ... I've had the literal problem with Sphinx as well (i.e. it doesn't automatically do searches including/excluding prefixes/suffixes). I've heard really good things about Lucene but never tried it out myself.
Dani the Computer Science Gal 
Follow my Twitter feed! twitter.com/DaniWeb
And if you're interested in Internet marketing there is twitter.com/DaniWebAds

Follow my Twitter feed! twitter.com/DaniWeb
And if you're interested in Internet marketing there is twitter.com/DaniWebAds
•
•
Join Date: Oct 2008
Posts: 34
Reputation:
Solved Threads: 5
Yeah, I agree with you that a lot of search implementations tend to be too literal.
Lucene is very nice in that respect. If you have a domain specific site, you can plug in your own aliases for various terms, as well as different stemming engines.
I'm using Roller for my blog engine, and since they're both written in java, it's a nice fit for me.
Lucene is very nice in that respect. If you have a domain specific site, you can plug in your own aliases for various terms, as well as different stemming engines.
I'm using Roller for my blog engine, and since they're both written in java, it's a nice fit for me.
![]() |
Similar Threads
- Exploding Niche + Residual Income + 1yr Old Domain Name (Websites for Sale)
- memory management in wndows 2000 (Windows NT / 2000 / XP)
Other Threads in the Social Media and Online Communities Forum
- Previous Thread: Content Development
- Next Thread: Governments Using Twitter to Update Constituents
Views: 948 | Replies: 3
| Thread Tools | Search this Thread |
Tag cloud for Social Media and Online Communities
ads analytics aol bebo bing blockbuster bloggers blogging blogs building business card celebrity censorship cloud communities community content craigslist crime davidmeermanscott digg e-learning education election email employment engagement enterprise enterprise2.0 facebook facebookfriends forrester ftc gambling gender gifts gmail google government handle holiday hp influencers internet iphone legal linkedin marketing mashable media membership metrics microblogging mobile myspace netflix networking news obama online onlinemovies page phishing policy politics privacy psychographics reader research retweet search small social socialmedia socialmediameasurement socialnetworking socialnetworks study success survey technology trademark transparency tweetdeck tweeting twitter user users video viral virtual wave web web2.0 webanalytics word wordpress yahoo youtube






