They should run the analysis on the U.S.

languagehacker: I worry that if they're using sentiment analysis that they're not correctly isolating out the multitude of hidden variables involved with analyzing journalistic text. Examples include journalistic bias, extreme quotes from one viewpoint juxtaposed to moderate quotes from an opposing viewpoint, etc. Then again, the more data you throw at a model, the better its predictions become, even using terribly naive algorithms. Since they're in the terabytes range, it's certainly credible.

posted 4606 days ago