Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Their geo-data is utter crap. The vast majority of it is based on 'profile location', which means that there are almost a million people tweeting from the exact center of Atlanta. It's a crowded spot, must be a Starbucks there or something.


Just find those mass locations and remove them from the data set.


You end up with about 0.01% of tweets having locations after that. It's basically just iphone users.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: