<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Wikipedia Page Similarities</title>
	<atom:link href="http://abeautifulwww.com/2007/02/12/wikipedia-page-similarities/feed/" rel="self" type="application/rss+xml" />
	<link>http://abeautifulwww.com/2007/02/12/wikipedia-page-similarities/</link>
	<description></description>
	<lastBuildDate>Tue, 24 Apr 2012 15:20:04 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.2</generator>
	<item>
		<title>By: JB</title>
		<link>http://abeautifulwww.com/2007/02/12/wikipedia-page-similarities/comment-page-1/#comment-48</link>
		<dc:creator>JB</dc:creator>
		<pubDate>Tue, 29 May 2007 19:53:52 +0000</pubDate>
		<guid isPermaLink="false">http://abeautifulwww.com/?p=6#comment-48</guid>
		<description>This is definitely useful. And your pseudo code looks suspiciously like actual perl code. ;-)

Besides the co-citation similarity measure, have you tried other similarity measures for the wiki pages? Perhaps cosine similarity of feature (word) vectors... but then again, having feature vectors for all those wiki pages might be rather expensive and cumbersome.

Btw, you might find this lib useful. It&#039;s still in the works, but a lot of the algorithms behind IR and search engine technology have been implemented in perl:

http://tangra.si.umich.edu/clair/clairlib/</description>
		<content:encoded><![CDATA[<p>This is definitely useful. And your pseudo code looks suspiciously like actual perl code. <img src='http://abeautifulwww.com/wp-includes/images/smilies/icon_wink.gif' alt=';-)' class='wp-smiley' /> </p>
<p>Besides the co-citation similarity measure, have you tried other similarity measures for the wiki pages? Perhaps cosine similarity of feature (word) vectors&#8230; but then again, having feature vectors for all those wiki pages might be rather expensive and cumbersome.</p>
<p>Btw, you might find this lib useful. It&#8217;s still in the works, but a lot of the algorithms behind IR and search engine technology have been implemented in perl:</p>
<p><a href="http://tangra.si.umich.edu/clair/clairlib/" rel="nofollow">http://tangra.si.umich.edu/clair/clairlib/</a></p>
]]></content:encoded>
	</item>
</channel>
</rss>

