Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Interesting manipulations of data, but a shoddy attempt at analysis. This guy needs to work on constructing meaningful variables for his interpretation. "Important" means nothing. Is he looking at popularity, and if so, what kind? How representative is the sample of Wikipedia's film pages against the total films produced? What factors led to the "influence" metrics?


My post wasn't really aimed at a technical audience. It was more of "look what cool stuff we can do without looping in a human". I've given a slightly more technical introduction in my post on ranking universities: http://blog.argteam.com/coding/university-ranking-wikipedia/

It really is just PageRank. The source code is available on github https://github.com/cosbynator/WikiRank




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: