
Unveiling the Data Revolution: Insights from Information Explosion
Delve into the era of data explosion with this engaging slide deck, exploring the significance of data, its forms, and its ubiquitous presence in our daily lives. Gain perspectives on the vast amount of information created daily and the transformative power hidden within data, as highlighted by notable figures like Eric Schmidt and Atul Butte. Discover the practical applications of data collection, from personal details to global insights, and the universal value of learning from data as emphasized by John Elder. Join the discussion on defining and understanding data, touching upon examples of data collection and its potential implications for decision-making. Explore how major platforms like Facebook and Google utilize data to personalize experiences, hinting at the valuable insights waiting to be uncovered in the sea of information.
Download Presentation

Please find below an Image/Link to download the presentation.
The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author. If you encounter any issues during the download, it is possible that the publisher has removed the file from their server.
You are allowed to download the files provided on this website for personal or commercial use, subject to the condition that they are used lawfully. All files are the property of their respective owners.
The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author.
E N D
Presentation Transcript
Milestone #1 Slide Deck Let s get started!
There were 5 exabytes of information created between the dawn of civilization through 2003, but that much information is now created every 2 days. Eric Schmidt Note: 1 exabyte = 10 ^ 18 bytes
Hiding within those mounds of data is the knowledge that could change the life of a patient, or change the world. Atul Butte
An Explosion of Data A key aspect of the electronic age is the ability to capture and store large amounts of data. Discussion: What is data?
An Explosion of Data A key aspect of the electronic age is the ability to capture and store large amounts of data. Discussion: What is data? Data is information. Data can take many forms: Numbers Text Images ...
Discussion: Lets Define Data Examples What data can we collect in this room? Temperature Number of people in the room Arrangement of chairs/tables Average age of the people in the room .. What data can we use to describe ourselves? Name Birth Date Height Time we woke up this morning Total number of pets we own Average number of steps we walk a day .
Discussion: Lets Define Data Examples What data can we collect in this room? Temperature Number of people in the room Arrangement of chairs/tables Average age of the people in the room .. What data can we use to describe ourselves? Name Birth Date Time we woke up this morning The total number of text messages we sent yesterday Total number of pets we own Average number of steps we walk a day .
Learning from data is virtually universally useful. Master it and you will be welcomed anywhere. John Elder
Exploring and Understanding Data As mentioned before, lots of data is being collected about the people, places, and things that comprise our world! Facebook uses your clicks to collect data on your interests as a way of routing ads. Google collects data on your searches to understand your information preferences. Are there any others you can think of? Hidden in that data: answers about critical patterns, outliers, and average phenomena that can help us gain a better understanding of the world around us and the people in it in the past, present, and future. In other words, with the right techniques, we can use data to: Understand insights about the past Learn about our current status Predict the future by observing trends and patterns
Consider This Data Set... Calculus Test Scores for Test #3 Student Birth Date Grade Progress Comments Anna SEP 15, 2010 83 Anna has improved! Barry JUN 08, 2010 68 Invite Barry to tutoring. Carmella AUG 22, 2010 100 Carmella is an excellent student. Donterio MAY 05, 2010 83 Donterio is getting better. Erica DEC 09, 2009 87 This is Erica s lowest grade to date. What insights can we glean from it?
Data Set Quick Analysis By observing the prior data set, we can answer questions about: The number of students in the class The average age of students in the class The average grade students received on Test #3 The progress students have been making to date This is just one example but, with the right data set and the right questions, we can learn interesting things! The data science process covers the spectrum of: Formulating questions Finding appropriate data sets Cleaning/formatting data Analyzing the data Presenting findings
Data science is an exciting discipline that allows you to turn raw data into understanding, insight, and knowledge. ----[1] [1]https://www.oreilly.com/library/view/r-for-data/9781491910382/preface01.html
Consider this... Data Science is about drawing useful conclusions from large and diverse data sets through exploration, prediction, and inference[b]. How does one leverage data science to analyze data and understand critical, yet possibly hidden, phenomenon?[a] We will explore this idea in this project!
In This Project... You (and the partner you select) will: Select a data set, Define a set of questions to ask, Analyze the data via a set of ubiquitous tools/technology to answer the questions, and Present findings.
Project Overview (AKA The Big Picture) Driving Question: How can we explore data sets to answer interesting questions? Public Product A discussion about the data set you chose, the questions you posed, and the answers you discovered with a teammate. You will present to the other teams and, finally, to a group of experts.
Keep in mind... Your project is to explore a data set of your choice and leverage it to answer questions that are interesting to you. Over the course of the project, you will become familiar with the data analytics available to you in Microsoft Excel. Your final product will a presentation about your analysis process and the answers you discovered to a group of visitors.
Milestones #4a: Student teams analyze their data and share their preliminary findings with other teams. #1: Student explores the project, is introduced to the driving question, and develops a set of need to know questions (NTKs). #2: Student explores the data science process and begins getting acquainted with a powerful, yet highly available tool. #3: Student teams select the data set and prepare it for analysis based on the questions they want to ask. #4b: Student teams develop their presentations by drafting them, receiving feedback, and making modifications. #5: Student teams share their data models, analyses, and results with visitors, either virtually or in person.
Looking at the Milestones, what are your questions? Problem solving is the process of being at a start state, having an idea about the goal state, and developing a process to navigate from one to the other. A key factor that can drive the problem solving process is being aware of the questions you have and actively working, researching, and exploring to answer those questions -one at a time. Given what you know now about about the project, what are your thoughts? What are your questions? Jot those down!
Lets Discuss Expectations Participate in discussion NTKs During instruction Reflection during and at the end of the process Work well with your partner to select your data set, identify your question set, and discover your answers via data exploration Deliver and receive critique gracefully Participate in the collaborative development of the presentation Deliver the presentation along with your partner
We will revisit... Milestones Questions Rubric ...for now, just start jotting down your thoughts while they re fresh in your mind! The questions/thoughts you have now can be the door to your solution.
NTKs What do you already know about data science and data analysis? What do you need to learn in order to do data analysis? What do you need to know to know or learn to be a real data scientists? What questions do you have about what you need to have or know in order to complete the data exploration process?
Important NTK: what kinds of data will we work with? Equity in Athletics https://ope.ed.gov/athletics/#/datafile/list U.S. College Information https://nces.ed.gov/ipeds/use-the-data COVID-19 Data https://data.cdc.gov/browse?tags=covid-19
Lets Take a Survey! A popular way to collect data about human experiences, preferences, expectations, and statuses is to develop and administer a survey. Let s look at that process by going through it first hand. Use the link below to access a survey about you: [INSERT LINK HERE] We will ultimately aggregate, or combine, all of your responses and do analyses on it to drive home ideas about the data analysis process.
Next Steps Thank you for taking the survey! We will explore the results a little later. In the meantime, let s be reminded of the data sets we will explore for this project: Athletics, U.S. College Data, and Coronavirus Data Of the four presented, which data set do you want to explore? Find a partner who also wants to explore the same data you do and discuss the following: Why is this data set interesting to you? What might you want to learn from the data? Have you ever used Microsoft Excel for data analysis? Finally, begin jotting your preliminary ideas down so that you can revisit them later