Improve Data Analytics Skills with Stata at CIES 2017 Workshop

complex data analysis n.w
1 / 19
Embed
Share

Enhance your data analytic abilities using Stata with Early Grade Reading and Mathematics Assessment Data at the upcoming CIES 2017 workshop in Atlanta. Join experts to learn about complex data analysis techniques and practical applications using real-world examples.

  • Data Analysis
  • Stata
  • CIES 2017
  • Workshop
  • Georgia

Uploaded on | 0 Views


Download Presentation

Please find below an Image/Link to download the presentation.

The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author. If you encounter any issues during the download, it is possible that the publisher has removed the file from their server.

You are allowed to download the files provided on this website for personal or commercial use, subject to the condition that they are used lawfully. All files are the property of their respective owners.

The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author.

E N D

Presentation Transcript


  1. Complex Data Analysis Improve your data analytic abilities using Stata and Early Grade Reading and Mathematics Assessment Data Sunday March 5, 8:30 14:45 Georgia 2 (South Tower) CIES 2017 Downtown Sheraton Atlanta, GA Marissa Gargano Sarrynna Sou Susan Edwards Chris Cummiskey Marissa Gargano Sarrynna Sou Susan Edwards Chris Cummiskey www.rti.org 1 RTI International is a registered trademark and a trade name of Research Triangle Institute.

  2. Before we data dive Introductions around the room: Name, Affiliation, Position, Favorite thing to do in ATL. What Stata version do you have (12, 13, 14)? Stata-Use/Comfort Level. Statistical Background Experience with data from EGRA/EGMA. What you hope to get out of this class. 2

  3. Assumption for Participation Currently have your computer with you. Currently have Stata on your computer. Computer has access to the internet. Currently have Microsoft Work and Excel on you computer. Have conducted analysis in Stata. Have a general grasp of statistics. Are not terrified with the word: regression . 3

  4. Workshop Brief: Review the Agenda Two Sections: Tanzania Cross-sectional Survey [Morning] Kenya Impact Evaluation Survey [Afternoon] 4

  5. Data Access Agreements Make sure everyone has signed the data access agreement Make sure everyone has the Workshop Folder copied over to their computer. 5

  6. Install Groups into your Stata. Connect to the internet Open Stata, type the following into your command bar ssc install groups 6

  7. Lets get Started on Simple Survey Statistics. 7

  8. Census vs. Random Sample Random Sample Census Portion of population Entire population Calculate true values No uncertainty No need to be statistically significant! Calculate estimated values Will always have some uncertainty Need for statistical significances To expensive Not practical Cost effective 8

  9. Background: Simple Random Sample vs. School-Based Complex Random Simple Random Sample Complex (Cluster) Sample Sample school student More than 1 stage sampled Directly sample students Only 1 stage sampled Not practical Practical Too expensive (not cost effective) Cost effective 9

  10. Sample Analysis: Two Types Inferential Project the sample to the population Descriptive describe the sample (not the population from which the same came) Cluster Effect Sample Sample 10

  11. Inferential Statistics: It is all about 2 simple things Point Estimates: (i.e. means, percentages) Use Sample Weights To make the sample representative of the population Accounts for over any over/under representation in the sample Minimize bias Precision Estimates: (i.e. standard error) Use sufficient sample size and small cluster size Try to make the point estimates as precise as possible Minimize the uncertainty of the point estimate. 11

  12. Inferential Statistics: Put Point and Precision together Once you have the proper Point Estimate and Precision Estimate, you can create various statistical figures: 95% Confidence Intervals Representative of the population and sufficiently tight. Formal statistical tests Test for difference among groups Test for correlations 12

  13. Inferential Analysis: Failure to account for sample methodology If students were sampled with Analyzed accounting for Not Properly Analyzed Data SRS Properly Analyzed Data the proper sample methodology 70 70 60 60 53.8 Words per Minute Words per Minute 51.2 50 50 45.6 43.4 40 40 30 30 Weighted Not weighted 20 20 Accounts for cluster effect Does not account for cluster effect 10 10 0 0 Public Private* Public Private Mean(orf) Lower 95%CI Upper 95% CI Mean(orf) Lower 95%CI Upper 95% CI * p<0.05 Source: RTI International, Grade 2 Early Grade Reading Assessment (EGRA) and Snapshot of School Management Effectiveness in Indonesia (EdData II), March-April 2014. 13

  14. Stata Coding: If students were sampled with Analyzed accounting for Not Properly Analyzed Data SRS Properly Analyzed Data the proper sample methodology 70 70 60 60 53.8 Words per Minute Words per Minute 51.2 50 50 45.6 43.4 40 40 30 30 svy: mean orf, over(private) mean orf, over(private) 20 20 10 10 0 0 Public Private* Public Private Mean(orf) Lower 95%CI Upper 95% CI Mean(orf) Lower 95%CI Upper 95% CI * p<0.05 Source: RTI International, Grade 2 Early Grade Reading Assessment (EGRA) and Snapshot of School Management Effectiveness in Indonesia (EdData II), March-April 2014. 14

  15. What is svy? svy for survey design stata command used for complex sample analysis. equivalent to SPSS s csaplan (complex sample analysis plan) IF set up correctly: Accounts for the over/under representation in the sample Using the sample weights Accounts for the cluster effect Using the Taylor linearized variance estimation Very easy to use once the svyset as been specified and saved to the dataset. 15

  16. What is svyset? svyset for survey design set up stata command used at data processing stage. tells Stata how the complex sample was drawn (so you can analyze using the svy command). very easy to mess up if: The data processer has little experience with complex sample The data processer has not been involved with the sample methodology and drawing the sample The data processer has not been involved in the questionnaire design, pilot data collection. 16

  17. OK, TOO MUCH THEORY LETS GET TANGIBLE in TZ Open a do-file. Save it to the Tanzania Do Files Folder: <path>\Analysis Workshop\Materials\Tanzania_2013\Do Files\ 1_Analyze_Tanzania-Data_CIES2017_Analysis-Workshop.do 17

  18. In the Do-File Read in the Tanzania Census Data [School Level] use <path>\CIES-2017 Complex Analysis Workshop\Materials\Tanzania_2013\Data\ Census_List-Frame For Tanzania2013-National Survey_PLSE7.dta , clear Read in the Tanzania Sampled Data [Student Level] use ...<path>\CIES-2017 Complex Analysis Workshop\Materials\Tanzania_2013\Data\ PUF_3.Tanzania2014-National_grade2_EGRA-EGMA-SSME_English- Kiswahili.dta , clear 18

  19. 10 Minute Break? 19

Related


More Related Content