Introduction to PROscorerTools

Ray Baser

2023-10-17

Overview

PROscorerTools provides tools to score patient-reported outcome (PRO) measures and other quality of life (QoL) and psychometric instruments. PROscorerTools also provides the building blocks of the functions in the PROscorer package.

PROscorerTools contains several “helper” functions, each of which performs a specific task that is common when scoring PRO-like instruments (e.g., reverse coding items). But most users will find that the scoreScale() function alone can address their scoring needs.

The scoreScale() Function

The workhorse function in PROscorerTools is the scoreScale() function. Its basic job is to take a data frame containing responses to some items, and output a single score for those items.

The scoreScale() function has simple, flexible arguments that enable it to handle nearly all scoring situations.

Features:

Installation and Basic Usage

Install the stable version from CRAN (recommended):

install.packages("PROscorerTools")

If you want to contribute to the development of the PROscorerTools or PROscorer packages, then you can install the development version from GitHub (generally NOT recommended):

devtools::install_github("MSKCC-Epi-Bio/PROscorerTools")

Load PROscorerTools in your R workspace:

library(PROscorerTools)

As an example, we will use the makeFakeData() function to make a data frame of responses to 6 fake items from 20 imaginary respondents. The created data set (named “dat”) has an “id” variable, plus responses to 6 items (named “q1”, “q2”, etc.) from 20 imaginary respondents. There are also missing responses (“NA”) scattered throughout.

dat <- makeFakeData(n = 20, nitems = 6, values = 0:4, id = TRUE)

Below we use the scoreScale function to score the fake responses in “dat”. We use the items argument to tell scoreScale which variables are the items we want to score. We will score the items by summing the responses (type = "sum"). We will save the scores from the fake questionnaire in a data frame named “dat_scored”.

dat_scored <- scoreScale(df = dat, items = 2:7, type = "sum")
dat_scored

By default, scoreScale will score the items for a given respondent as long as the respondent is missing no more than 50% of the items. This can be changed with the okmiss argument. Above, okmiss = 0.50 by default, so a respondent could be missing 3 of the 6 items and still be assigned a score (if missing 4 or more items, they were assigned a score of NA). Below, we again score the items, but this time we allow less than half of the items to be missing to be scored (okmiss = 0.49).

dat_scored <- scoreScale(df = dat, items = 2:7, type = "sum", okmiss = 0.49)
dat_scored

By default, scoreScale will score the items for a given respondent as long as the respondent is missing no more than 50% of the items. This can be changed with the okmiss argument. Above, okmiss = 0.50 by default, so a respondent could be missing 3 of the 6 items and still be assigned a score (if missing 4 or more items, they were assigned a score of NA). Below, we again score the items, but this time we allow less than half of the items to be missing to be scored (okmiss = 0.49).

dat_scored <- scoreScale(df = dat, items = 2:7, type = "sum", okmiss = 0.49)
dat_scored

For more information on the scoreScale function, you can access its “help” page by typing ?scoreScale into R.

Future Development Plans

The PROscorer family of R packages includes PROscorer, PROscorerTools, and FACTscorer. My priorities for developing these 3 packages are:

  1. Streamline how the packages check arguments and processes input to scoreScale and other custom-written scoring functions. The current system gets the job done, but it is not very pretty. I believe that a more elegant, easy-to-use system for performing these tasks (possibly using the assertive package) will speed up the expansion of the PROscorer package (which contains custom scoring functions for specific, commonly-used PROs). I hope to have a stable version of this system in the next major update of PROscorerTools.

  2. Make the unit testing framework of PROscorer and PROscorerTools more comprehensive. Most of the code underlying the PROscorer functions will be already be tested by the PROscorerTools tests; however, I intend to come up with a standard set of tests for PROscorer functions to make it easier for me and others to add unit tests to their scoring functions.

  3. Expand PROscorer with more scoring functions for specific PROs.

  4. Finalize the collaborative infrastructure (e.g., on GitHub) by which users can use PROscorerTools to write scoring functions for their favorite PROs and submit them for inclusion in PROscorer. A major component of this is to add a few instructional vignettes, including a step-by-step guide for writing the scoring functions, guidelines for writing the instrument descriptions, and templates for writing the function documentation.

  5. Update the FACTscorer package. FACTscorer scores the FACT (Functional Assessment of Cancer Therapy) and FACIT (Functional Assessment of Chronic Illness Therapy) family of measures.
    Before writing PROscorerTools, I had completely re-written most of the underlying FACTscorer code to be more foolproof and easier to update in the future. I also wrote an “Instrument Descriptions” vignette, similar to what is included with PROscorer. I will put the finishing touches on the FACTscorer update and release it as soon as the above work is done.

  6. Add capability to generate IRT-based scores for PROs that use that scoring method. I know many researchers that use various PROMIS measures. They would prefer to use the IRT-based scoring method, but find it too difficult to integrate into their research workflow. PROscorer could make IRT-based scores accessible to a much wider group of researchers.

Resources for More Information