Basic Statistical Principles and Analysis Techniques

univariate statistics n.w
1 / 33
Embed
Share

Explore essential statistical concepts like central tendency, dispersion, standardization, mode, median, mean, frequency distributions, and skewed distributions. Learn how these tools are used to analyze data effectively and make informed decisions in various fields.

  • Statistics
  • Data Analysis
  • Central Tendency
  • Frequency Distributions
  • Skewed Distributions

Uploaded on | 0 Views


Download Presentation

Please find below an Image/Link to download the presentation.

The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author. If you encounter any issues during the download, it is possible that the publisher has removed the file from their server.

You are allowed to download the files provided on this website for personal or commercial use, subject to the condition that they are used lawfully. All files are the property of their respective owners.

The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author.

E N D

Presentation Transcript


  1. Univariate Statistics

  2. Basic Statistical Principles Central tendency Dispersion Standardization

  3. Central tendency Mode Median Mean Skewed distributions

  4. Frequency distributions Show n of cases falling in each category of a variable Starting point for analysis Reveals out of range data Signals missing data to be specified Identifies values to be recoded

  5. Frequency Distribution Example MOST PEOPLE ARE HONEST Cumulative Percent Frequency Percent Valid Percent Valid 1.00 2.00 3.00 4.00 5.00 6.00 Total System 264 378 680 1145 669 171 3307 43 3350 7.9 11.3 20.3 34.2 20.0 5.1 98.7 1.3 100.0 8.0 11.4 20.6 34.6 20.2 5.2 100.0 8.0 19.4 40.0 74.6 94.8 100.0 Missing Total

  6. Frequency Distribution Example MOST PEOPLE ARE HONEST 1400 1200 1000 800 600 400 Frequency 200 0 1.00 2.00 3.00 4.00 5.00 6.00 MOST PEOPLE ARE HONEST

  7. Mode The most common score E.g. (gender): Frequency 123 148 Males Females -Female is the modal category

  8. Median Arrange individual scores from top to bottom and take the middle score E.g. (Quiz scores): Score Frequency 100 1 90 3 80 6 Median = 80 70 3 60 2 50 1

  9. Mean Statistical average (total scores/number of scores) E.g. (Quiz scores): Score Frequency 100 1 90 3 80 6 Median = 80 70 3 Mean = 76.88 60 2 50 1

  10. Skewed distributions Median may be a better indicator of central tendency Example: Typical employee income CEOs make 100 times average worker Outlier distorts the average Median works better Income Frequency $5,000,000 1 Mean= $99,500 $50,000 99 Median = $50,000

  11. The Normal Curve 50% of cases are above the midpoint 50% of cases are below the midpoint

  12. Importance of the Normal Curve Many of the statistical analysis techniques that we ll be talking about assume Normally distributed variables This assumption is: Rarely checked Often violated

  13. Positive and negative skews

  14. Positive Skew Example MY OPINIONS DON'T COUNT MUCH 1000 800 600 400 Frequency 200 0 1.00 2.00 3.00 4.00 5.00 6.00 MY OPINIONS DON'T COUNT MUCH

  15. Negative Skew Example BIG COMPANIES ARE OUT FOR THEMSELVES 1000 800 600 400 Frequency 200 0 1.00 2.00 3.00 4.00 5.00 6.00 BIG COMPANIES ARE OUT FOR THEMSELVES

  16. Correcting for skewed distributions Ways to correct for skewed variables: Square root a positively skewed variable Square a negatively skewed variable

  17. Dispersion How spread out are the scores from the mean? Are they tightly packed around the mean Or Are they spread out?

  18. Dispersion Measures Range Standard Deviation Variance

  19. Range Distance between the top and bottom score E.g., Hi Score = 96, Lo Score = 42, Range = 54 Only tells you about the extremity of the scores These 3 distributions have the same range: 10, 11, 12, 13, 14, 15, 90 10, 85, 86,87,88,89,90 10,48,49,50,51,52,90

  20. Standard Deviation and Variance Both account for the position of all the scores Both measure the spread of the scores

  21. Standard Deviation Small Variance Large Variance (large SD) (small SD)

  22. Standard Deviation and Variance: Measures of Dispersion Standard deviation measure of the width of the dispersion or spread of the scores or size of the average distance of scores from mean The squared value of the standard deviation (sd2) is called the variance

  23. Standardization Converting variables to a uniform scale Mean = 0 Standard deviation = 1 Formula: z score = (score mean)/standard deviation

  24. Standardization and Normal Curve 68% of cases fall within 1 standard deviation of the mean 95% of cases fall within 2 standard deviations of the mean 99% of cases fall within 3 standard deviations of the mean

  25. Functions of Standardization Makes two variables comparable Allows us to compare within groups Allows us to compare across collections Stepping stone to other procedures (e.g., Pearson Correlation Coefficient)

  26. Steps in Statistics How to calculate various statistics 26

  27. Steps in Calculating Standard Deviation Steps: 1. Calculate the mean 2. Subtract mean from each score (deviations) 3. Square all deviations 4. Add up squared deviations 5. Divide sum of squared deviations by N 6. Take the square root of the resulting value

  28. Formula for Standard Deviation Formula averages distance of scores from mean: For a population For a sample used to estimate population sd

  29. Example of Calculation (sd) Scores x-M 16 12 10 6 6 Mean = 10 (50/5) Sum of Squares = 72 72/5 = 14.4 Sq root = 3.79 16-10 = 6 12-10 = 2 10-10 = 0 6-10 = -4 6-10 = -4 Square 36 4 0 16 16

  30. Calculating Variance Same as standard deviation without last step Standard deviation s descriptive utility If standard deviation is 5, the average distance from the mean is 5 Variance is building block for other procedures

  31. Standardizing and Variable Comparability Example Students took two exams: Exam 1 Exam 2 Student A 90 90 Student B 80 100 Student C 80 100 Student D 80 100 Student E 70 10 Mean = 80 80

  32. Standardizing and Variable Comparability Example Exam 1 Z1 Exam 2 Z2 A 90 1.58 90 .28 B 80 0 100 .57 C 80 0 100 .57 D 80 0 100 .57 E 70 -1.58 10 -1.99

  33. Standardizing and Within Group Comparability Person: Amos Burt Cedric Arlene Bertha Carla Height: 5 8 6 1 6 5 5 1 5 4 5 11 z-Height: -.50 .75 1.75 -1.33 -.33 2.00 Population Mean Population SD 5 10 Men Women 5 5 3 4

More Related Content