Do you have a question? Post it now! No Registration Necessary. Now with pictures!
- Posted on
December 6, 2006, 3:21 pm
rate this thread
(http://kinja.com/user/thedigestibleaggie ) for awhile and have come up
with a fairly nice list of feeds but I have run into an annoying
(though not critical) problem, duplicate stories. Apparently there is
overlap with some of the sites I subscribe to so I get duplicate
stories. Does anyone know of some sort of filter (software or online
service) that can remove duplicate stories? Any help or suggestions
would really be appreciated!
Re: Removing duplicate entries/stories from a RSS feed?
Write a script in a language that supports associative arrays (as do Java,
unique key created out of elements in the various RSS feed items. Fill the
associative array using the generated key.
Unfortunately, it is rare for two RSS feed items to be truly identical.
Often, they tell the same story with small differences in wording (to avoid
accusations of plagiarism) and of course the URL is normally different.
Without some complex coding to detect items that are almost the same, the
above method will remove only genuinely identical items from different RSS