Correlation in Data Analysis

Correlation in Data Analysis
Slide Note
Embed
Share

Correlation is a statistical measure that explains the relationship between two variables being measured. This involves assessing the direction and strength of the relationship, from negative to positive correlations. Explore how correlation is determined, its significance, and practical applications through various examples and scenarios.

  • Correlation
  • Data Analysis
  • Relationship
  • Strength
  • Variables

Uploaded on Mar 06, 2025 | 0 Views


Download Presentation

Please find below an Image/Link to download the presentation.

The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author.If you encounter any issues during the download, it is possible that the publisher has removed the file from their server.

You are allowed to download the files provided on this website for personal or commercial use, subject to the condition that they are used lawfully. All files are the property of their respective owners.

The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author.

E N D

Presentation Transcript


  1. *What is the Correlation?

  2. A recent newspaper article reported that the number of personal computers being sold is increasing. In addition, the number of athletic shoes being sold is also increasing. They report that there is a strong correlation between these two events? What do you think?

  3. *What is a Correlation? *A relationship between two variables that are being measured. Negative Correlation No Positive Correlation Correlation - the value of one variable will increase and the other will decrease -no apparent relationship between the variables increasing or decreasing -the value of both variables increases

  4. *Order Card Set #1 on your Desk based on the Direction of the Correlation. *(Negative to Positive) *What order did you place the cards in? *Did you have any difficulty with this task?

  5. *Strength of a Correlation? *You can further classify a correlation by determining if it is a strong or weak correlation. Perfect Negative Correlation Strong Negative No Weak Positive Perfection Positive Correlation Weak Negative Strong Positive Correlation

  6. *Order Card Set #2 on your Desk based on the Strength and Direction of the Correlation. *Negative to Positive *How sure are you that you have the cards in the correct order?

  7. *Correlation using Scatter Plots *If you have actual collected data for a situation, you can use a scatter plot to determine the correlation of the variables that are being compared. *Attach the data charts and scatter plot graphs from Card Set #3 to the situations from Card Set #2. Then look at the scatter plots and reorder your cards if needed. *Negative to Positive

  8. *How sure are you that you have the cards in the correct order? *How did you use the scatter plots to change the position of a card, or to confirm that you had it in the correct spot? *Are any of the situations still hard to decide where they fit in relation to the correlations of the other situations?

  9. Does adding a line of best fit help determine the correlation of two events? Would finding the equation of the line of best fit help us determine anything about the correlation?

  10. *Add the line of best fit to each of the scatter plots on your cards and calculate the equation for the line of best fit. Then reorder the cards if needed. *Negative to Positive *To get the equation of a line: *1 pick two points that the line passes through and calculate the slope using the slope formula ?2 ?1 *2 Use the slope an one point in the point slope form ? ?1= ? ? ?1 *3 Change the equation to slope intercept form ? = ?? + ? ?2 ?1

  11. *What do you notice about the slope of the line of best fits that you found and the correlation that you have chosen for each situation? *What problems could arise from relying on the line of best fit to determine the correlation?

  12. *Correlation Coefficient *Defined as a numerical representation of the strength and direction of the relationship between two variables/events. *It measures the amount of variance that you have in your data *It is a number between -1 and 1 that represents how similar the two variables/events are. *The closer a the number is to 1 or -1, the stronger the correlation. The closer the number is to 0, the weaker the correlation.

  13. -1 0 +1 Perfect Negative Correlation Strong Negative No Weak Positive Perfection Positive Correlation Weak Negative Strong Positive Correlation 0.3 Weak Positive Correlation Strong Negative Correlation Strong Positive Correlation No Correlation -0.82 0.7 0 -0.04 Weak Negative Correlation 1.2 Not a correlation #

  14. *Calculate *There are two ways to find the correlation coefficient *By hand using the formula: *Using Technology, we can calculate the correlation coefficient using several different methods. *Using Microsoft Excel Spreadsheets *Using Graphing Calculators

  15. *Graphing Calculator using Linear Regression *Be sure that your Diagnostics is turned on *On newer calculators *Select MODE *Scroll Down to bottom of screen to STAT DIAGNOSTICS and select on

  16. *Example What is the correlation of temperature and ice cream sales? How strong is the correlation?

  17. *Example *Create the x/y table on your calculator *Select STAT on calculator *Under EDIT tab, select 1:Edit *Enter all x values (temperature) under ?1 *Enter all y values (ice cream sales) under ?2 *Calculate the Linear Regression of your data *Select STAT on calculator *Under CALC tab, select 1:LinReg(ax+b) *Xlist: ?1 and Ylist:?2 or LinReg(ax+b) ?1,?2 *a and b are used to get line of best fit in the form of ? = ?? + ? *r is the correlation coefficient

  18. *Example Line of Best Fit: ? = 30.09? 159.47 Correlation Coefficient: 0.96 Strong Positive Correlation

  19. *Using your situation cards from yesterday, input the data into your graphing calculator and use the Linear Regression Feature on to: *Calculate the actual Line of Best Fit *Find the Correlation Coefficient *Reorder the Cards *Negative to Positive

  20. A recent newspaper article reported that the number of personal computers being sold is increasing. In addition, the number of athletic shoes being sold is also increasing. They report that there is a strong correlation between these two events? What would be the best way to determine the answer to this question?

  21. *Comparing More Variables *Our school wants to start offering nutritional cereal for breakfast. They ask you to look at the nutritional rating of different cereal brands and decide which ones we should offer. *They recommend that you start by looking at the correlation of the nutritional rating to each of three variables: fat, sugar, and protein. *What do you think that you can determine by looking at these correlations?

  22. *Sugar and Fat have Negative Correlation with Nutritional Rating *Protein has Positive Correlation with Nutritional Rating *Need Correlation Coefficient to determine which one has the strongest correlation *We CANNOT state that any of theses factors causes the high nutritional rating!

More Related Content