Searching file tree for redundancies

Do you have a question? Post it now! No Registration Necessary.  Now with pictures!

Threaded View
I'm wondering if there are any programs out there (preferably linux,
though others are OK) which can search through a given directory tree
and parse the contents of all HTML files it finds - thus mapping out the
  linked files and identifying redundant (or not linked-to) files
(including images, plugins etc...) .

This would be useful for removing crap from a website which has now gone
through many revisions.

Thanks for any suggestions.


Re: Searching file tree for redundancies

Quoted text here. Click to load it

b  r  u  c  i  e

Site Timeline