Sniffing Titles
In RSS 2.0, HTML has not only crept into titles, but also into other nooks and crannies. Can you put HTML in copyright elements? The RSS Advisory Board says no. But CNet News does it anyway.
I’ve committed a change (with tests!) to the Universal Feed Parser which attempts to guess the content type of various RSS feed elements based on a few primitive heuristics. I’m trying it against the Share Your OPML’s top 100 feeds list.
So far, so good.