Title: Data Sets for Psychometric Modeling
Version: 1.3.0
Description: Collection of data sets from various assessments that can be used to evaluate psychometric models. These data sets have been analyzed in the following papers that introduced new methodology as part of the application section: Jimenez, A., Balamuta, J. J., & Culpepper, S. A. (2023) <doi:10.1111/bmsp.12307>, Culpepper, S. A., & Balamuta, J. J. (2021) <doi:10.1080/00273171.2021.1985949>, Yinghan Chen et al. (2021) <doi:10.1007/s11336-021-09750-9>, Yinyin Chen et al. (2020) <doi:10.1007/s11336-019-09693-2>, Culpepper, S. A. (2019a) <doi:10.1007/s11336-019-09683-4>, Culpepper, S. A. (2019b) <doi:10.1007/s11336-018-9643-8>, Culpepper, S. A., & Chen, Y. (2019) <doi:10.3102/1076998618791306>, Culpepper, S. A., & Balamuta, J. J. (2017) <doi:10.1007/s11336-015-9484-7>, and Culpepper, S. A. (2015) <doi:10.3102/1076998615595403>.
URL: https://tmsalab.github.io/edmdata/, https://github.com/tmsalab/edmdata/
BugReports: https://github.com/tmsalab/edmdata/issues
Depends: R (≥ 4.1.0)
License: MIT + file LICENSE
Encoding: UTF-8
LazyData: true
RoxygenNote: 7.3.2
NeedsCompilation: no
Packaged: 2024-10-02 06:27:02 UTC; ronin
Author: James Joseph Balamuta ORCID iD [aut, cre, cph], Steven Andrew Culpepper ORCID iD [aut, cph], Jeffrey Alan Douglas [aut, cph], Auburn Jimenez [ctb, cph]
Maintainer: James Joseph Balamuta <balamut2@illinois.edu>
Repository: CRAN
Date/Publication: 2024-10-02 06:50:02 UTC

edmdata: Data Sets for Psychometric Modeling

Description

Collection of data sets from various assessments that can be used to evaluate psychometric models. These data sets have been analyzed in the following papers that introduced new methodology as part of the application section: Jimenez, A., Balamuta, J. J., & Culpepper, S. A. (2023) doi:10.1111/bmsp.12307, Culpepper, S. A., & Balamuta, J. J. (2021) doi:10.1080/00273171.2021.1985949, Yinghan Chen et al. (2021) doi:10.1007/s11336-021-09750-9, Yinyin Chen et al. (2020) doi:10.1007/s11336-019-09693-2, Culpepper, S. A. (2019a) doi:10.1007/s11336-019-09683-4, Culpepper, S. A. (2019b) doi:10.1007/s11336-018-9643-8, Culpepper, S. A., & Chen, Y. (2019) doi:10.3102/1076998618791306, Culpepper, S. A., & Balamuta, J. J. (2017) doi:10.1007/s11336-015-9484-7, and Culpepper, S. A. (2015) doi:10.3102/1076998615595403.

Author(s)

Maintainer: James Joseph Balamuta balamut2@illinois.edu (ORCID) [copyright holder]

Authors:

Other contributors:

See Also

Useful links:


Examination for the Certificate of Proficiency in English (ECPE) Item Responses

Description

Examination for the Certificate of Proficiency in English (ECPE) Item Responses

Usage

items_ecpe

Format

An object of class matrix (inherits from array) with 2922 rows and 28 columns.

Details

The subjects answered the following assessment items:

References

Data originated from:

Data used in:


Fraction Subtraction and Addition Assessment Item Responses

Description

Fraction Subtraction and Addition Assessment Item Responses

Usage

items_fractions

Format

An object of class matrix (inherits from array) with 536 rows and 20 columns.

Details

The subjects answered the following assessment items:

References

Data originated from:

Data used in:


Human Connectome Project's Penn Progressive Matrices Fluid Intelligence Assessment

Description

Trial data from Form A of an abbreviated version of the Raven’s Progressive Matrices developed by Gur and colleagues (Bilker et al. 2012). Participants are presented with patterns made up of 2x2, 3x3 or 1x5 arrangements of squares, with one of the squares missing. The participant must pick one of five response choices that best fits the missing square on the pattern. The task has 24 items and 3 bonus items, arranged in order of increasing difficulty. However, the task discontinues if the participant makes 5 incorrect responses in a row.

Usage

items_hcp_penn_matrix

items_hcp_penn_matrix_missing

Format

An object of class matrix (inherits from array) with 1201 rows and 24 columns.

An object of class matrix (inherits from array) with 1201 rows and 24 columns.

Details

The subjects answered the following assessment items:

Data Cleaning

The items_hcp_penn data set includes N = 1,201 observations. However, the observation count was calculated after removing participants that did not respond to any questions (Subject ID's: 117728, 137431, 145531, 236130, and 614439). From there, we generated the items_hcp_penn_missing data set that retains a missing data structure after the participant makes 5 incorrect responses in a row.

References

Data originated from:

Data were provided in part by the Human Connectome Project, WU-Minn Consortium (Principal Investigators: David Van Essen and Kamil Ugurbil; 1U54MH091657) funded by the 16 NIH Institutes and Centers that support the NIH Blueprint for Neuroscience Research; and by the McDonnell Center for Systems Neuroscience at Washington University.

Data used in:


Experimental Matrix Reasoning Test Item Responses

Description

Experimental Matrix Reasoning Test Item Responses

Usage

items_matrix_reasoning

Format

An object of class matrix (inherits from array) with 400 rows and 25 columns.

Details

Items included:

Answer coding

The subjects answered a set of assessment items seeking to determine their matrix reasoning abilities. Subjects that answered with a value between 0 to 7 were marked as incorrect. Subjects who answered a question with 10 selected the correct answer and, thus, were marked as correct.

Notes

From the OpenPsychometrics' code book that accompanied the data, they noted:

  1. The possible answers were presented in two rows of four with a random order for each participant.

  2. The collection of this data was of mediocre quality.

References

Data originated from:

Data used in:


Narcissistic Personality Inventory Item Responses

Description

Narcissistic Personality Inventory Item Responses

Usage

items_narcissistic_personality_inventory

Format

An object of class matrix (inherits from array) with 11243 rows and 40 columns.

Details

Items with their desired option response bolded:

Data pre-processing

We have applied list-wise deletion during pre-processing to remove any observations with missing values from the data set.

Answer coding

The subjects answered a set of assessment items seeking to determine the level of anxiety. Answers given in bold represent the desired response. If a subject matched this response, they were given a 1 inside of the item matrix, otherwise they received a zero.

References

Assessment Design:

Data originated from:

Data used in:


Subset of Early Childhood Longitudinal Study, Kindergarten (ECLS-K)'s Approaches to Learning Item Responses

Description

Subset of Early Childhood Longitudinal Study, Kindergarten (ECLS-K)'s Approaches to Learning Item Responses

Usage

items_ordered_eclsk_atl

Format

An object of class matrix (inherits from array) with 13354 rows and 12 columns.

Details

Items were split between being answered by Parents and Teachers.

Data pre-processing

The Early Childhood Longitudinal Study, Kindergarten (ECLS-K) has been subset down both the number of observations and variables. In particular, only observations without any missing values from a set of reduced variables – given above – are included. If additional data is required, please visit the data download page found in the reference section.

Answer coding

Parents and teachers each answered a set of survey items involving a likert scale that ranged from "1 = never" to "4 = very often" regarding the subject. Within the teacher responses, they also had the option of marking "-7 = no opportunity to observe" option, which was treated as a missing observation. To align with C++, we perform a index shift backward of 1 and, thus, make the scale "0=never" to "3=very often".

References

Data originated from:

Data used in:


Programme for International Student Assessment (PISA) 2012 U.S. Student Questionnaire Problem-Solving Vignettes

Description

Programme for International Student Assessment (PISA) 2012 U.S. Student Questionnaire Problem-Solving Vignettes

Usage

items_ordered_pisa12_us_vignette

Format

An object of class matrix (inherits from array) with 3075 rows and 12 columns.

Details

Obtained from the PISA 2012 questionnaire responses given to 15-year olds, we selected 12 items under three vignettes and four item stems to measure problem-solving strategies applied by students.

Answer Coding

All items were reverse-coded such that responses categories begin with negative responses at 0 and go to 3 for a positive responses.

References

Assessment Design:

Data used in:


Calculus-based probability and statistics course homework problems

Description

Calculus-based probability and statistics course homework problems

Usage

items_ordered_pswc_hw

Format

An object of class matrix (inherits from array) with 288 rows and 29 columns.

Details

Data set contains the subject's responses to calculus-based probability and statistics course homework problems.

References

Data used in:

Data originated from:


Trends in International Mathematics and Science Study 2015 (TIMSS) Grade 8 Student Background Survey Item Responses

Description

Trends in International Mathematics and Science Study 2015 (TIMSS) Grade 8 Student Background Survey Item Responses

Usage

items_ordered_timss15_background

Format

An object of class matrix (inherits from array) with 9672 rows and 16 columns.

Details

From the school background data file, we selected 16 items under two distinct item stems that sought to measure students’ school experiences using a four-level likert-scale.

Answer Coding

Items 8 through 16 were reverse-coded.

References

Data originated from:

Data used in:


Programme for International Student Assessment (PISA) 2012 US Math Assessment

Description

Programme for International Student Assessment (PISA) 2012 US Math Assessment

Usage

items_pisa12_us_math

Format

An object of class matrix (inherits from array) with 4978 rows and 76 columns.

Details

PISA assessment booklets have a spiral pattern. As a result, we've opted to re-order the data by booklet ID

Items alongside of the problem name:

Answer coding

The subjects answered a set of assessment items seeking to determine their knowledge of mathematics. The original PISA 2012 data set included variables that had problems with the following labels:

Note, this data set includes items with four levels only. Further, items are coded with ⁠Score 0⁠ as 0, ⁠Score 1⁠ as 1, ⁠Score 7: N/A⁠ as NA, and ⁠Score 8: Not Attempted⁠ as 0.

References

Assessment Design:

Data originated from:

Data used in:


Elementary Probability Theory Assessment Item Responses

Description

Elementary Probability Theory Assessment Item Responses

Usage

items_probability_part_one_full

items_probability_part_one_reduced

Format

An object of class matrix (inherits from array) with 504 rows and 12 columns.

An object of class matrix (inherits from array) with 431 rows and 12 columns.

Details

Questions wording and answers are from the pks package documentation.

Items with their desired responses bolded:

Data Cleaning

The items_probability_part_one_full data set includes N = 504 observations used in Chen et al. (2021). However, the observations contained missing trial data as NA. Excluding the missing response values from part one respondents yields N = 431. The data set with reduced items is given in items_probability_part_one_reduced.

References

Data originated from:

Data used in:


Revised PSVT:R Item Responses

Description

Revised PSVT:R Item Responses

Usage

items_revised_psvtr

Format

An object of class matrix (inherits from array) with 516 rows and 30 columns.

Details

Data set contains the subject's responses to Revised PSVT:R items. Correct answers are denoted by 1 and incorrect answers are denoted by 0.

References

Assessment Design:

Data originated from:

Data used in:


Last Series of the Standard Progressive Matrices (SPM-LS) Item Responses

Description

Last Series of the Standard Progressive Matrices (SPM-LS) Item Responses

Usage

items_spm_ls

Format

An object of class matrix (inherits from array) with 499 rows and 12 columns.

Details

Items with the correct answer response based off of Table 9 of the Robitzsch (2020) pre-print paper.

Answer coding

The subjects answered a set of assessment items seeking to determine the level of matrix reasoning. Answers given in bold represent the desired response. If a subject matched this response, they were given a 1 inside of the item matrix, otherwise they received a zero.

References

Assessment Design:

Data originated from:

Data used in:


Taylor Manifest Anxiety Scale Item Responses

Description

Taylor Manifest Anxiety Scale Item Responses

Usage

items_taylor_manifest_anxiety_scale

Format

An object of class matrix (inherits from array) with 4468 rows and 50 columns.

Details

Questions alongside of their correct answer is based off of Table 1 of the Taylor (1953) paper.

Items with their desired response bolded:

Data pre-processing

We have applied list-wise deletion during pre-processing to remove any observations with missing values from the data set.

Answer coding

The subjects answered a set of assessment items seeking to determine the level of anxiety. Answers given in bold represent the desired response. If a subject matched this response, they were given a 1 inside of the item matrix, otherwise they received a zero.

References

Assessment Design:

Data originated from:

Data used in:


Examination for the Certificate of Proficiency in English (ECPE) Expert-Derived Q matrix

Description

Examination for the Certificate of Proficiency in English (ECPE) Expert-Derived Q matrix

Usage

qmatrix_ecpe

Format

An object of class q_matrix (inherits from matrix) with 28 rows and 3 columns.

Details

Each entry in the matrix is either 1, if the item uses the skill, or 0, if the item does not use the skill. The skills identified by this matrix are:

The subjects answered the following assessment items:

References

Data originated from:

Data used in:


Fraction Subtraction and Addition Assessment Expert-Derived Q Matrix

Description

Fraction Subtraction and Addition Assessment Expert-Derived Q Matrix

Usage

qmatrix_fractions

Format

An object of class matrix (inherits from array) with 20 rows and 8 columns.

Details

Each entry in the matrix is either 1, if the Item uses the Trait, or 0, if the Item does not use the Trait. The traits identified by this matrix are:

The subjects answered the following assessment items:

References

Data originated from:

Data used in:


Oracle Q Matrices

Description

Pre-generated identified Q matrices used in simulation studies to verify recovery.

Format

A matrix with varying numbers of traits (K) and items (J).

Specifically:

Details

Each entry in the matrix is either 1, if the item uses the skill, or 0, if the item does not use the skill.


Elementary Probability Theory Assessment Expert-Derived Q Matrix

Description

Elementary Probability Theory Assessment Expert-Derived Q Matrix

Usage

qmatrix_probability_part_one

Format

An object of class matrix (inherits from array) with 12 rows and 4 columns.

Details

Each entry in the matrix is either 1, if the item uses the trait, or 0, if the item does not use the trait. The traits identified by this matrix are:

For a detailed overview of items, please see items_probability_part_one_full or items_probability_part_one_reduced.

Identifiability

Note, the expert supplied Q-matrix is not strictly identified. Though, the expert matrix is generically identified.

References

Data originated from:

Data used in:


Strategy Oracle Sets

Description

Pre-generated strategy matrices used in simulation studies to verify recovery.

Format

An array with varying numbers of items (J), traits (K), and strategies (S).

Specifically:

Details

Each entry in a strategy is denoted by either 1, if the item uses the skill under strategy s, or 0, if the item does not use the skill under strategy s.

Note: Each matrix in the strategy was generated independently under the criterion for a strictly identifiable Q matrix.

mirror server hosted at Truenetwork, Russian Federation.