Module 2:

Big Data vs. Statistics

1

Welcome to Module 2!

Introduction

  • Now that you have a basic understanding about what Big Data is, it is time for you to dive deeper into Big Data's World.

  • But, wait! There is so much information. Where do we even start?

  • Data science is a broad field with many sub-fields. In this module, we will be comparing and contrasting big data and statistics, two of the most popular sub-fields of data science.

Objectives

  1. Understand the difference between big data and statistics

  2. Identify the advantages of using big data

2

Definitions

What is Statistics?

Statistics is the science of collecting, analyzing and understanding data, and accounting for the relevant uncertainties.
As such, it permeates the physical, natural and social sciences; public health; medicine; business; and policy.

What is Big Data?

Big Data is the collection and analysis of data sets that are complex in terms of the volume and variety, and in some cases the velocity at which they are collected.
Big Data are especially challenging because some of them were not collected to address a specific scientific question.

Source: American Statistical Association

Video Conference

3

What's the Difference?

Assignment

Create Infographics based on Modules 1-2.

Infographic 1: Compare and contrast two of the sub-fields of Data Science (see section 3).

Infographic 2: Talk about important contents from the previous modules.

Rubric: https://tinyurl.com/jryrsak7

Infographics Guidelines for Module 1 & 2_SunnyZhang_page-0001.jpg
Screen Shot 2021-05-28 at 3.32.43 AM.png