DrueY 0 Light Poster

Does anybody have an idea of how this data is collected:

http://www.boxoffice.com/statistics/analysis/most_discussed

I'd like to do something similar, but I'm not sure where to start...web crawling maybe?

Drue