Chapter 1: What is Big Data?

Photo by Alexander Sinn on Unsplash

This chapter first describes how we measure data, and how its creation has skyrocketed in recent years. We then define Big Data and psychology for the purposes of the book, and motivate why their intersection is important to study. The chapter ends with a guide to how to use the book, and brief summaries of the upcoming chapters.


  • Class survey
  • Here is the code for running the class survey for the beginning of class. The ZIP folder contains:

    1. A HTML file (bigdataregistration.html) with the client-side code (HTML and Javascript) for the survey.
    2. A PHP file (bigdataregistration.php) with the server-side code (PHP, MySQL) that handles saving of the survey data. You will need to have your own web server database to save their data. As an example, I host my website and data with IONOS, but there are many web hosts available.
    3. If you do not want to deal with creating a database to save the survey data, there is also a different HTML file (bigdatareg-noserver.html) that instead has the student send you a code-generated e-mail with their responses.
    4. I include MATLAB code (viewmousemovements.m and also uses registrationpage.png) to generate videos of mouse movements from the students' data. MATLAB is not the optimal coding language to show this but it works! (Implementing this in Python is on my to-do list).

    5. How much data do we make each day?
    6. Here's a 2025 blog post on Exploding Topics with statistics on how much data is generated each day.
      Here's an older article (but with a great visualization) on Forbes of how much data is collected every minute.

    7. How is data measured?
    8. Here's an easy-to-understand primer on how the binary number system works.
      Here's a page that can convert between decimal and binary numbers.

    9. What was it like moving the world's first hard drive?
    10. Here is a video about the world's first hard drive, from IBM in 1956: