intertwingly

It’s just data

Meme Tracker in IronPython


Dare Obasanjo: My weekend project was to read Dive Into Python and learn enough Python to be able to port Sam Ruby’s meme tracker (source code) from CPython to Iron Python. Sam’s meme tracker, shows the most popular links from the past week from the blogs in his RSS subscriptions.

More recent code can be found here.  Fetches titles from HTML, handles etags, matches both www. and non-www. versions of a URI.  Handles people who point to things multiple times.  Allows you to group people who tend to all “vote” in bulk.  Note: I consider the alternate link to be a vote too, which gives a small bump to people who post original content vs links.

I’d also recommend that you invest some time into converting from a simple regular expression to a real HTML parser.  You’ll need it anyway for titles.