The IRW Data Standard

Data standards are an essential mechanism of increasing reusability and interoperability of data amongst researchers. As such, they play a critical role in increasing the extent to which science is FAIR (findable, accessible, interoperable, reusable). The IRW data standard is meant to be a straightforward way of standardizing item response data such that it can be readily used by interested researchers in a wide variety of psychometric analyses.

Key Elements

The below three features are essential components of an IRW dataset (with some small exceptions to be explained below). Note that the typesetting of the data elements is consistent with that of the IRW.

id This is a persistent identifer for the focal unit, i.e. the thing being measured. This will typically be a person but in some cases may be, for example, a word (when interest is in establishing some property of the word).
item This is a persistent identifier for the probe being used to measure. There are some cases wherein a unique item identifier cannot be identified; for more information on these cases see the discussion of trials below.
resp This is the item responses. It is coded as a numeric value that can be directly utilized in various psychometric models. Given its centrality, we make a few additional points:
- Response values that were imputed in the original data have been removed.
- Within an item, values are meant to be consistently coded so that higher numbers indicate a consistently meanginful change in a response (i.e., higher numbers indicate stronger agreement). However, values may load on the latent variable in opposite ways across items (i.e., higher numbers may indicate strong agreement for some items and weaker agreement for others).

Other elements

These elements are optional features of IRW datasets. When IRW datasets have these columns, they have been consistently formatted as per the data standards described here.

rt The response time used to produce an item response is coded in seconds.
date The calendar time at which a response was produced is included. Coding of this variable is done in two ways. In some cases, there were only relative dates (e.g., 30 days into data collection). In that case, we convert to the number of seconds since the first piece of data in the dataset was collected). In cases where more exact information was given (e.g., 1:30PM 03/04/2008), we convert to Unix time (i.e., seconds since JAN 01 1970. (UTC)).
qmatrix When items are classified into a small number of skills (i.e., for the purposes of cognitive diagnostic modeling ), we have included these item-level classifications. Note that the column headers will be qmatrix1, …, qmatrixN when there are N skills.
rater In scenarios wherein the focal unit is being rated by other observers (perhaps along multiple dimensions), an identifier associated with the rater producing the rating.
wave In settings wherein data is collected longitudinally (but precise timing as in date is not available), we indicate the wave of data collection. Values are ordered such that larger values represent data collected at a later date relative to smaller values. Note that this is also used to indicate pre/post treatment in the case of data collected from randomized controlled trials (RCTs).
treat An indicator of whether a respondent was in a treatment group (1=Yes) in an experimental study (e.g., an RCT or quasiexperiment).
cov_ Covariates that are invariant for the foci of measurement (denoted by id; as an example, common covariates for persons may be gender and age) will be identified via the cov_ prefix. NOTE: This was implemented as of V11.25; data added before that may not be reverse-compatible with this standardization element.
itemcov_ Covariates that are invariant for the measurement probe. NOTE: This was implemented as of V16.1; data added before that may not be reverse-compatible with this standardization element.
item_family An identifier for groups of items that potentially have family resemblances such that assumptions related to local independence may be violated. Examples include items with a testlet structure, items that have common features (e.g., ‘how important is it that people wash their hands regularly?’ vesrus ‘how important is it that you wash your hands regularly’), or items that are clones of each other. Each element of a family of items will have a unique identifier (e.g., items in the first testlet have id 1, items in the second testlet have id 2, etc) and items that are not a member of a family will be NA. NOTE: This was implemented as of V11.25; data added before that may not be reverse-compatible with this standardization element.

Additional Considerations

In some cases, we deviate from the above rules for specific reasons. We describe those here. Some of the below may be considered more experimental features of the IRW; more information about these data will be forthcoming.

Process data: In general, a single row of data in the IRW corresponds to a unque response from an individual to an item. In the context of ‘process data’, more information about a response is available. In such cases, responses have identifers that can be used to understand the sequence of actions that led to a given response.
Trials: Many constructs in cognitive psychology are measured via repeated trials of similar tasks wherein the probes either do not vary or vary in terms of some quantifiable feature of the stimulus. In such cases, the item column is uninformative and the columns beginning trial_ will contain information about the different trials.