Using the IDB Analyzer I
Andrés Sandoval-Hernández – IEA DPC
Workshop on using PISA, PIAAC, TIMSS & PIRLS, TALIS datasets
Ispra, Italy- June 24-27, 2014
Note: These slides were prepared as part of the IEA training portfolio with the collaboration of IEA staff and resource persons.
2
Table of contents
• Data preparation for TIMSS 2011
• Data preparation for PIRLS 2011
• Data preparation for TIMSS and PIRLS 2011 combined
• Data preparation for TALIS 2013
• Data preparation for PIAAC
• Data preparation for PISA 2012
4
Retrieving TIMSS DATA
Click on links to download dataset
Download data here:
http://timssandpirls.bc.edu/timss2011/international-database.html
Save downloaded data set under C:\EU_Workshop\Data
Then Run IDB ANALYZER
http://timssandpirls.bc.edu/timss2011/international-database.html
6
IDB Analyzer: Select Countries
Available Participants: All participating countries
Select countries with this button
Next >
7
IDB Analyzer: Merge Data Select files to be merged:
School Background and
Student Background
Select Variables:
All variables from school and student file
Achievement scores and weight variables are selected automatically
Output Files: C:\EU_Workshop\Work\TIMSS_Merged.*
10
Table of contents
• Data preparation for TIMSS 2011
• Data preparation for PIRLS 2011
• Data preparation for TIMSS and PIRLS 2011 combined
• Data preparation for TALIS 2013
• Data preparation for PIAAC
• Data preparation for PISA 2012
12
Retrieving PIRLS DATA
Select Study (PIRLS)
Select Year Y2011
Select file type and countries
Download data here:
http://rms.iea-dpc.org/#
http://rms.iea-dpc.org/#
13
Retrieving PIRLS DATA
Select Format (SPSS)
Select desired documentation
Chose Download name
Add to Basket
View Basket
14
Retrieving PIRLS DATA
Check box Click to save/Download data set
Save downloaded data set under C:\EU_Workshop\Data
Then Run IDB ANALYZER
16
IDB Analyzer: Select Data and Countries
Select Directory…
C:\EU_Workshop\Data
Available Participants: All participating countries
Select countries with this button
Next >
17
IDB Analyzer: Merge Data Select files to be merged:
School Background and
Student Background
Select Variables:
All variables from school and student file
Output Files: C:\EU_Workshop\Work\PIRLS_merged.*
Achievement scores and weight variables are selected automatically
Start SPSS
20
Table of contents
• Data preparation for TIMSS 2011
• Data preparation for PIRLS 2011
• Data preparation for TIMSS and PIRLS 2011 combined
• Data preparation for TALIS 2013
• Data preparation for PIAAC
• Data preparation for PISA 2012
21
Data preparation for TIMSS and PIRLS 2011 combined
Download data here:
http://timssandpirls.bc.edu/timsspirls2011/international-database.html
http://timssandpirls.bc.edu/timsspirls2011/
Click SPSS Data
23
Data merge in IDB Analyzer
Select Directory: C:\EU_Workshop\Data
Select Study: TIMSS/PIRLS
Select Year and Grade predefined
24
Data merge in IDB Analyzer, cntd.
Select Participants: Austria, Spain, Slovak Republic, Italy, Poland
Next >
25
Data merge in IDB Analyzer, cntd. Select File Types:
School Background
Student Background
Select and move all variables
Define path + name:
C:\EU_Workshop\Work\TIMSS_PIRLS_Merged.*
Start SPSS
27
Table of contents
• Data preparation for TIMSS 2011
• Data preparation for PIRLS 2011
• Data preparation for TIMSS and PIRLS 2011 combined
• Data preparation for TALIS 2013
• Data preparation for PIAAC
• Data preparation for PISA 2012
29
Retrieving TALIS DATASET
Download data here:
http://www.oecd.org/edu/school/talis.htm
Click on the link “Complete database”
http://www.oecd.org/edu/school/talis.htm
31
Retrieving TALIS DATASET
Scroll down to the desired countries and click on their names and save
Save downloaded data set under C:\EU_Workshop\Data
Then Run IDB ANALYZER
34
IDB Analyzer: Select Countries
Available Participants: All participating countries
Select countries with this button
Next >
35
IDB Analyzer: Merge Data Select files to be merged:
School Background and
Student Background
Select Variables:
All variables from school and student file
40
Table of contents
• Data preparation for TIMSS 2011
• Data preparation for PIRLS 2011
• Data preparation for TIMSS and PIRLS 2011 combined
• Data preparation for TALIS 2013
• Data preparation for PIAAC
• Data preparation for PISA 2012
42
Public Use Files (PUFs)
• Contain 23 of 24 participating countries, includes Cyprus, excludes Australia, available at:
http://vs-web-fs-1.oecd.org/piaac/puf-data
• PUF for Australia (called “CURF”) available on application, not distributed, via: [email protected]
• Extended public-/research-use files might be available from countries directly, e.g. U.S. and Germany
43
Public Use Files (PUFs), cntd.
PUFs available in two standard formats
• SPSS (.sav) for version 11 or later
• Unicode (UTF8) encoded to preserve national strings
• Full dictionary information
• Variable types and formats
• Variable labels
• Value labels (including any missing value labels)
• Missing value definitions (except for strings)
• Variable measurement levels
• Missing values represented numerically (more later)
44
Public Use Files (PUFs), cntd.
PUFs available in two standard formats
• SAS (.sas7bdat) • Standard, compressed data files for Windows environments
• Encoded in Unicode (UTF8)
• Variable types, widths, decimals, and labels assigned
• Each .sas7bdat PUF file is accompanied by an equivalently named .sas file that includes syntax to assign formats • SAS cannot store value labels permanently
• Includes the relevant LIBNAME (in), PROC FORMATS, DATA and FORMATS statements.
• These syntax files can be executed against each individual SAS file in order to display value labels in analytical procedures such as PROC UNIVARIATE, PROC FREQ …
• Missing values represented as SAS special missings (e.g. “.V”)
45
Data difference for PUFs and PDX (PIAAC Data Explorer)
• The set of cases across databases (national master, analysis, public-use) is identical, in general.
• One exception applies to the Canadian PUF
• Some cases were excluded and corresponding weights were loaded onto others in a particular domain to comply with Statistics Canada’s minimum weight standards
• It will not be possible to replicate reported estimates precisely
• This should not have practical relevance and should not affect the agreement of rounded estimates published by the OECD, those produced by the Data Explorer, and those made on the basis of the PUF
46
Data difference for PUFs and PDX, cntd.
• Data underlying the PDX and PUF contain different sets of variables
• Certain variable sets are not informative/useful for analysis in the PDX yet are included in the PUF for secondary analysis
• Each PUF includes a comprehensive set of 1,328 variables
• Of these, only 575 are included in the PDX
• The majority of variables included only in the PUF relate to the individual cognitive item scores and process information
47
Available documentation
Web: http://www.oecd.org/site/piaac/
• Frameworks (BQ, Literacy, Numeracy, PSTRE) • Background questionnaire • International report, national reports • Technical Report • Proficiency level descriptions • PUF codeplan (variables and values) • Derived variable scripts • Webpackages (tables and exhibits) • Compendia for background and cognitive variables • Sections “Publications” and “Public Data & Analysis”
48
Data retrieval for this analysis
PUFs from: http://vs-web-fs-1.oecd.org/piaac/puf-data
Click on SPSS for SPSS files
49
Data retrieval for this analysis
http://vs-web-fs-1.oecd.org/piaac/puf-data/SPSS/
Save downloaded data set under C:\EU_Workshop\Data
Then Run IDB ANALYZER
55
Merge Module
Syntax file is automatically stored in specified directory AND SPSS and the created syntax file are automatically opened.
56
Merge Module
• From the SPSS syntax editor choose: – Run > All
– The syntax is executed and the merged file is saved under the location you have chose in the last step of the IDB Analyzer Merge Module
58
Table of contents
• Data preparation for TIMSS 2011
• Data preparation for PIRLS 2011
• Data preparation for TIMSS and PIRLS 2011 combined
• Data preparation for TALIS 2013
• Data preparation for PIAAC
• Data preparation for PISA 2012
59
PISA
• Where to get the data:
• http://pisa2012.acer.edu.au/downloads.php
Download the respective files
60
PISA creating SPSS files
Step 1:
Replace xxx with a proper file location where you saved the previous files
Step 2:
Select all and run SPSS
Step 3:
Save the active dataset with the file name as a “.sav” SPSS file
Step 4:
Repeat steps 1-3 for all other files
• Open the SPSS control file
61
Merging data files
• Merging data files has to be done with SPSS
• Preparation for the merge:
• Reduce number of countries
Student data file
Cognitive item file
Scored cognitive items file
School data file
Parents data file
62
Merging data files
• Merging data files has to be done with SPSS
• Preparation for the merge:
• Reduce number of countries
• Merge files pairwise
Open the first file and sort by CNT SCHOOLID STIDSTD
Open the second file and sort by CNT SCHOOLID STIDSTD
Merge first and second file
Continue to add files to the previously merged one
Remember all files have to be sorted by CNT SCHOOLID STIDSTD if the variables are present in the data file