The stuff over at BlogCensus.net is pretty neat. They’ve made available their raw data files for anyone’s data mining. They also expose their tool’s data through a Web services API.
I don’t know that I’ll do anything with it, but I’m glad to know that I could. I can’t imagine the bandwidth they must have to be able to offer up a 2.9GB file to all comers.