Big Dimensions, and What You Can Do About It

1 · Jeremy Kun · Feb. 8, 2016, 10 a.m.
Summary
Data is abundant, data is big, and big is a problem. Let me start with an example. Let’s say you have a list of movie titles and you want to learn their genre: romance, action, drama, etc. And maybe in this scenario IMDB doesn’t exist so you can’t scrape the answer. Well, the title alone is almost never enough information. One nice way to get more data is to do the following:...