Last week, as part of my work with Threepress building HTML5-based ebook reading software, I had to debug a problem with inaccurate cache manifests. Like any good engineer, rather than spend three hours doing it by hand, I spent six hours building a tool to do it.

The result is Caveman, a Python tool to validate HTML5 cache manifests. It scrapes the HTML page you specify, finding resources, then compares them to the cache manifest and reports problems.

When run as a command-line tool, it writes problems to its standard output. When run as a module, it's encapsulated enough that you can define how it gets HTTP data, and where it writes its results.

It worked well for me, and it might also for you. Try it and let me know.

tagged: , , » 2 reactions

Comments

[gravatar]
Pam 8:29 AM on 7 Nov 2011

http://xkcd.com/974/

[gravatar]
Ned Batchelder 9:00 AM on 7 Nov 2011

@Pam: I saw that xkcd comic this morning, and thought, "Nailed it." He knows of what he speaks.

Add a comment:

name
email
Ignore this:
not displayed and no spam.
Leave this empty:
www
not searched.
 
Name and either email or www are required.
Don't put anything here:
Leave this empty:
URLs auto-link and some tags are allowed: <a><b><i><p><br><pre>.