Data & Big Data
Correlating data to me means making sense of two data sets at the same time, and understanding the dynamics between the two sets. For this assignment I am exploring the trends in the emerging fields of “Data & Big Data”. I will be using Article Search from the New York Times API and looking for the words ‘Data’ and a subset of it ‘Big Data’. In the Seattle Public Library, I will be searching for books with the titles containing the words ‘Data’ & ‘Big Data’.
I am going for a Swiss poster design style: featuring bold fonts and a minimalistic design. I am using negative spaces that complement each other, similar to yin and yang. As the ‘big data’ is a subset of ‘data’ I am using the same space to represent the two data sets. . Data is arranged vertically (like a time line) instead of the usual horizontal flow. I will be using a combination of simple bars & lines for the project. The mouse position will highlight the number of books/ articles that the bar represents for both SPL & NYT.
The results were pretty striking. One can see an exponential increase in the NYT article search for ‘Data’ in August of 2012. The results on the ‘Big Data’ double during the same month. A corresponding increase is noted for the Seattle Public Library as well. There is a noticeable increase for ‘Data’ related books in August 2013: an 8-month time difference. What is interesting is that the first checkout for the first Big Data related book happens in September 2013: One full year after the field of data (and by extension big data) exploded in the computer science industry. It brings in interesting questions for the publishing industry highlighting the time taken to write a technical book on a new emerging field and the trends and readership of books and articles on emerging technology fields.