linked from here: https://www.techdirt.com/articles/20110725/05335715239/dailydirt-big-data-isnt-necessarily-better.shtml
For having painfully experimented data over-fitting. It's so subtle and powerful, and deadly, it will probably the demise of any future serious machine-learning system.
The more data you have, the more prone the system to over-fitting.
And it seems machine learning function like a black-box, : the programmers weren't able to understand the processes leading to certain moves by AlphaGo (the go playing machine).
If you cant check how the decision came to be, you're even more subject to have an obvious over-fitting problem on your hand . Eg: basketball linked to flu.