Interesting manipulations of data, but a shoddy attempt at analysis. This guy needs to work on constructing meaningful variables for his interpretation. "Important" means nothing. Is he looking at popularity, and if so, what kind? How representative is the sample of Wikipedia's film pages against the total films produced? What factors led to the "influence" metrics?
My post wasn't really aimed at a technical audience. It was more of "look what cool stuff we can do without looping in a human". I've given a slightly more technical introduction in my post on ranking universities: http://blog.argteam.com/coding/university-ranking-wikipedia/