The IRW Data Standard
Data standards are an essential mechanism of increasing reusability and interoperability of data amongst researchers. As such, they play a critical role in increasing the extent to which science is FAIR (findable, accessible, interoperable, reusable). The IRW data standard is meant to be a straightforward way of standardizing item response data such that it can be readily used by interested researchers in a wide variety of psychometric analyses.
Key Elements
The below three features are essential components of an IRW dataset (with some small exceptions to be explained below). Note that the typesetting of the data elements is consistent with that of the IRW.
id
This is a persistent identifer for the focal unit, i.e. the thing being measured. This will typically be a person but in some cases may be, for example, a word (when interest is in establishing some property of the word).item
This is a persistent identifier for the probe being used to measure. There are some cases wherein a unique item identifier cannot be identified; for more information on these cases see the discussion of trials below.resp
This is the item responses. It is coded as a numeric value that can be directly utilized in various psychometric models. Given its centrality, we make a few additional points:Response values that were imputed in the original data have been removed.
Within an item, values are meant to be consistently coded so that higher numbers indicate a consistently meanginful change in a response (i.e., higher numbers indicate stronger agreement). However, values may load on the latent variable in opposite ways across items (i.e., higher numbers may indicate strong agreement for some items and weaker agreement for others).
Other elements
These elements are optional features of IRW datasets. When IRW datasets have these columns, they have been consistently formatted as per the data standards described here.
rt
The response time used to produce an item response is coded in seconds.date
The calendar time at which a response was produced is included. Coding of this variable is done in two ways. In some cases, there were only relative dates (e.g., 30 days into data collection). In that case, we convert to the number of seconds since the first piece of data in the dataset was collected). In cases where more exact information was given (e.g., 1:30PM 03/04/2008), we convert to Unix time (i.e., seconds since JAN 01 1970. (UTC)).qmatrix
When items are classified into a small number of skills (i.e., for the purposes of cognitive diagnostic modeling ), we have included these item-level classifications. Note that the column headers will beqmatrix1
, …,qmatrixN
when there areN
skills.rater
In scenarios wherein the focal unit is being rated by other observers (perhaps along multiple dimensions), an identifier associated with the rater producing the rating.wave
In settings wherein data is collected longitudinally (but precise timing as indate
is not available), we indicate the wave of data collection. Values are ordered such that larger values represent data collected at a later date relative to smaller values. Note that this is also used to indicate pre/post treatment in the case of data collected from randomized controlled trials (RCTs).treat
An indicator of whether a respondent was in a treatment group (1=Yes) in an experimental study (e.g., an RCT or quasiexperiment).cov_
Covariates that are invariant for the foci of measurement (denoted byid
; as an example, common covariates for persons may be gender and age) will be identified via thecov_
prefix. NOTE: This was implemented as of V11.25; data added before that may not be reverse-compatible with this standardization element.itemcov_
Covariates that are invariant for the measurement probe. NOTE: This was implemented as of V16.1; data added before that may not be reverse-compatible with this standardization element.item_family
An identifier for groups of items that potentially have family resemblances such that assumptions related to local independence may be violated. Examples include items with a testlet structure, items that have common features (e.g., ‘how important is it that people wash their hands regularly?’ vesrus ‘how important is it that you wash your hands regularly’), or items that are clones of each other. Each element of a family of items will have a unique identifier (e.g., items in the first testlet have id1
, items in the second testlet have id2
, etc) and items that are not a member of a family will beNA
. NOTE: This was implemented as of V11.25; data added before that may not be reverse-compatible with this standardization element.
Additional Considerations
In some cases, we deviate from the above rules for specific reasons. We describe those here. Some of the below may be considered more experimental features of the IRW; more information about these data will be forthcoming.
Process data: In general, a single row of data in the IRW corresponds to a unque response from an individual to an item. In the context of ‘process data’, more information about a response is available. In such cases, responses have identifers that can be used to understand the sequence of actions that led to a given response.
Trials: Many constructs in cognitive psychology are measured via repeated trials of similar tasks wherein the probes either do not vary or vary in terms of some quantifiable feature of the stimulus. In such cases, the
item
column is uninformative and the columns beginningtrial_
will contain information about the different trials.
Contributing data to the IRW
Constructing an IRW-compliant dataset
Below are critical instructions for formatting data for the IRW.
Numeric values of a response should be meaningful. For example, missing values cannot be coded as numbers (e.g., -9).
Please check that the
id
anditem
identifers have an appropriate number of unique values.Additional data elements (e.g.,
rt
,date
,wave
,rater
,qmatrix
) should all be formatted as specified above.If there are multiple scales available, the responses need to be split into mutiple files (one per scale). If multiple groups are assessed via the same scale, these data can be put into a single file (if desired, a column indicating group membership can be added).
While we have tried to offer generic guidance on formatting data to the IRW standard, there are innumerable idiosyncracies that may merit additional conversation. To discuss specific issues associated with your formatting your data to the IRW standard, please feel free to each out to us at itemresponsewarehouse@stanford.edu
. We would be happy to talk more!
Adding data to the IRW
To add properly formatted data to the IRW repository there are three todo items:
Create a Github issue for this repository that describes any decisions you had to make and also includes a file with data (in CSV format).
Once the IRW team has confirmed that the data is appropriate, submit a pull request so that the code used to format the data gets added here. The pull request should go to the original repository (not your forked version of it) and to the main branch.
Finally, work with the IRW team to ensure that the ‘data index’ page here gets updated with the relevant information.
Completion of these three steps will place the data in the IRW repository. This will be reflected on the Redivis site and the website at our next scheduled update time.