<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Gudasoft &#187; se</title>
	<atom:link href="http://www.gudasoft.com/tag/se/feed" rel="self" type="application/rss+xml" />
	<link>http://www.gudasoft.com</link>
	<description>Impossible is nothing</description>
	<lastBuildDate>Thu, 06 Oct 2011 07:17:13 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0.5</generator>
		<item>
		<title>Search engines</title>
		<link>http://www.gudasoft.com/uncategorized/06/03/61/search-engines/2008</link>
		<comments>http://www.gudasoft.com/uncategorized/06/03/61/search-engines/2008#comments</comments>
		<pubDate>Tue, 03 Jun 2008 19:32:30 +0000</pubDate>
		<dc:creator>guda</dc:creator>
				<category><![CDATA[Development]]></category>
		<category><![CDATA[Fun]]></category>
		<category><![CDATA[Linux]]></category>
		<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[se]]></category>

		<guid isPermaLink="false">http://www.gudasoft.com/?p=61</guid>
		<description><![CDATA[It seems that I know too little for the world of the search engines. I will try to summarize the search engines that can be used for real applications. The first one that I know was Lucene. There is a lot of code build on top Lucene . SOLR Compass sphinx ruby interface for it [...]]]></description>
			<content:encoded><![CDATA[<p>It seems that I know too little for the world of the search engines. I will try to summarize the search engines that can be used for real applications.<span id="more-61"></span> The first one that I know was <a title="Lucene" href="http://lucene.apache.org/java/docs/index.html">Lucene</a>. There is a lot of code build on top <a href="http://lucene.apache.org/java/docs/index.html">Lucene</a></p>
<p>.</p>
<ul>
<li><a title="SOLR" href="http://lucene.apache.org/solr/">SOLR</a></li>
<li><a title="Compass" href="http://www.compass-project.org/">Compass</a></li>
<li>sphinx
<ul>
<li>ruby interface for it http://blog.evanweaver.com/files/doc/fauna/ultrasphinx/files/README.html</li>
</ul>
</li>
</ul>
<p>Then it seems that those seach engines are nice connected to data grids, like <a href="http://terracotta.org/">Terracotta</a>. On this page there is also nice tutorial how to <a href="http://www.terracotta.org/confluence/display/~jboner/Build+a+POJO-based+Data+Grid+Using+Open+Terracotta">start</a> with the grids.  Search engine resources  stopwords:</p>
<ul>
<li>http://en.wikipedia.org/wiki/Stop_words</li>
<li>http://www.semantikoz.com/2008/04/02/free-stop-word-lists-in-23-languages/</li>
<li>http://snowball.tartarus.org/algorithms/swedish/stop.txt</li>
<li>check my list of <a href="http://www.gudasoft.com/development/07/09/84/bulgarian-stopwords/2008">stop words</a></li>
</ul>
<p>
CouchDB</p>
<ul>
<li>http://couchdb.apache.org/docs/overview.html</li>
<li>http://thailehuy.wordpress.com/2009/02/13/couchfoo-like-activerecord-but-for-couchdb/</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.gudasoft.com/uncategorized/06/03/61/search-engines/2008/feed</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
	</channel>
</rss>

