How are the data organized?
The PTSD Repository contains over 300 variables extracted from randomized controlled trials (RCTs) of PTSD treatments published from January 1, 1988 through April 1, 2024. It continues to be updated annually.
For ease of
use, the variables are broken up into separate tables (called datasets), each with a focus
designed to “chunk” related variables together.
Each dataset is organized at either the study level
or the arm level. Study-level datasets correspond to
the overall study, whereas arm-level datasets correspond to each
specific treatment condition in the study. For example, an RCT might be
classified at the study-level as a psychotherapy study and at the arm level as
having a Prolonged Exposure (PE) arm, a Cognitive Processing Therapy (CPT) arm,
and a treatment as usual arm. Users are
cautioned to pay attention to whether tables are organized at the study or arm
level. To help, we include information on the primer page for each dataset that identifies if each row is a "study" or an arm (listed as "intervention group in a study". See below for examples.
The organization scheme for tables and coding rules—decisions made about variables and response items—are outlined below. Each of the datasets also includes a series of repeated variables—the variables included in every dataset (also described below).
Understanding Study ID
Each dataset in the PTSD Repository starts with a unique Study ID that helps the user understand key aspects of the study they are viewing. The Study ID always begins with the first author's name and year that the study was published (i.e., AuthorXXXX). If an author has 2 or more studies published in the same year, a lower-case letter is used to distinguish between the studies (i.e., AuthorXXXXa and AuthorXXXXb).
In datasets that include arm-level data, Study ID may also include additional specifiers such as: measure name, intervention arm (or intervention arms being compared), assessment timepoint, and the analysis type. Specifically:
- Measure name: In the outcome datasets, an abbreviation of the measure name is included to differentiate outcomes associated with different measures (i.e., AuthorXXXX_CAPS and AuthorXXXX_PCL). A list of measure abbreviations can be found in the Abbreviations dataset.
- Intervention arm: Each treatment arm is identified
with a capital letter separated by an underscore (i.e., AuthorXXXX_A,
AuthorXXXX_B). For datasets that include between-groups data analysis in which
the conditions are compared, this could also look like AuthorXXXX_CAPS_AvB
(i.e., condition A versus B).
- Assessment timepoint: Each assessment time point
is indicated with a number separated by an underscore. Timepoints are either
coded as “baseline” or represent the number of weeks post-treatment where “0”
represents immediate posttreatment (i.e., AuthorXXXX_A_Baseline, AuthorXXXX_A_0, and
AuthorXXXX_A_4).
- Analysis type: Each study is coded with an underscore
of “ITT” to identify if the analyses are intention to treat (i.e., include
all participants who were randomized) or “Comp” to indicate completer
analyses in which only people who completed the treatment are included (i.e., AuthorXXXX_A_1_ITT and AuthorXXXX_A_1_Comp).
Important Note: Datasets organized at the arm level have
multiple rows per study based on the study characteristics above. For example,
a study that compares PE to CPT would have at least 2 rows corresponding to
the 2 treatment arms. If there were 2 different measures (e.g., an interview and a self-report) administered to
participants within 2 treatment arms, there would be 2 rows for each measure
for each arm—equating to 4 total rows. If participants were measured at 2 different timepoints, that would add 2 more rows, etc. When creating visualizations, users are cautioned to be aware of occurrences in which individual
studies may be contributing more than one data point (i.e. multiple arms of the same study included).
Understanding Treatment Codes
The RCTs in the PTSD Repository are
organized in several ways to allow users to identify and sort those studies
that interest them most.
Data are organized at the study level
by Treatment focus (i.e., PTSD or PTSD plus substance use disorder [SUD]) and Study
class that indicates the type of treatment studied. Then, treatments are coded
at the arm level by Treatment focus, Study class subcode and Treatment name. See below for more detail.
Treatment Focus (study level)
Each RCT in the PTSD Repository
is coded as being a study that is focused on PTSD or on both PTSD and SUD
(PTSD+SUD). PTSD+SUD studies are treatment studies that are designed to target
both PTSD symptoms and SUD symptoms. Studies that are coded as PTSD+SUD also
have a subclass code for each arm to indicate whether the treatments are
integrated or not integrated. Users can choose to access all studies in dataset
or limit results to PTSD studies or PTSD+SUD studies by using a filter.
Treatment Focus (Arm level)
Each arm in the
PTSD Repository is coded as being focused on PTSD, SUD, both PTSD and SUD
(PTSD+SUD), active control, or inactive control.
- PTSD+SUD arms include interventions that target both PTSD and SUD within a single arm.
- Active control arms are those where a treatment is offered that is designed to have some positive effect but not as strong as the intervention it is being compared to.
- Inactive control arms offer no treatment (e.g., waitlist) or only a treatment that is not designed to help mental health symptoms (e.g., an educational intervention).
Study Class (study level)
Each RCT is coded with regard to 8 study classes:
- Pharmacotherapy
- Psychotherapy
- Complementary and Integrative Health (CIH)
- Nonpharmacologic Biological
- Nonpharmacologic Cognitive
- Collaborative Care
- Other Study Class
- Control
There are 3
additional study classes. If a study included more than one type of active
treatment (either within a single treatment arm or in different treatment
arms), the study was coded as:
- Psychotherapy & pharmacotherapy
- Psychotherapy & CIH
- Other Mixed
Study Class Subcodes (arm
level)
To help users view data by
different types of treatments, the specific treatment arms are further
classified into subtypes. See our Reference Guide: Treatments Found in the PTSD Repository for more information about the organization of treatments.
- Psychotherapy subclass: Coded as trauma-focused or non-trauma-focused.
- Pharmacotherapy subclass: Coded as Antidepressant:
Selective serotonin reuptake inhibitor (SSRI); Antidepressant:
Serotonin-norepinephrine reuptake inhibitor (SNRI); Antidepressant: Tricyclic
antidepressant (TCA); Antidepressant: Monoamine oxidase inhibitors (MAOI);
Antidepressant: Other; Antiadrenergic; Antipsychotic; Antianxiety:
benzodiazepines; Antianxiety: nonbenzodiazepines; Phytocannabinoid; Mood stabilizer;
Psychostimulant; Steroids; Other pharmacotherapy; Mixed
- Complementary
and Integrative Health (CIH) subclass: Coded as Acupuncture; Clinical
hypnosis;
Meditation; Massage therapy; Tai
chi/qi gong;
Yoga; Energy therapies; Natural products; Creative
therapies;
Relaxation; Spirituality; Other CIH
If a treatment arm included
more than one type of treatment, both treatment types were coded in separate
variables. These dichotomous variables are labeled according to the same study
class categories noted above. For example, a treatment arm that includes the
psychotherapy CPT and the CIH intervention, relaxation, was coded as both
psychotherapy (Psychotherapy = yes) and CIH (CIH = yes).
- Because of this nuance, PTSD Repository users are cautioned that if they create a visualization based on a specific treatment type using the individual treatment type variables like Psychotherapy or Pharmacotherapy, in a few cases it could include some other treatments as well.
- If users want to identify treatment arms that only include one specific type of treatment, they can limit the studies by first filtering those from the Study Class variable that focuses on the type of treatment of interest (e.g., selecting Psychotherapy as treatment class would ensure that a CPT plus relaxation study would not be included because it would have been coded as “Psychotherapy and CIH”) or by reviewing the treatment arms that are identified by searching psychotherapy and then omitting those that have another type of treatment indicated as well.
Treatment Name (arm level)
Treatments are assigned 2 types of names: the name used by the investigators in the research (Treatment
Name) and the standardized National Center for PTSD (Standardized Treatment Name), which
applies more consistent names across treatments to allow for easier identification
and comparison of treatments. Most users of the PTSD Repository will likely
find the Standardized Treatment Name to be the one they want to use.
What datasets are included in the PTSD Repository?
An easy way to locate a dataset
is to select the “Data Catalog” button in the top navigation (menu icon on the left). If you are not
signed in, look for the “Browse” button on the top left. Then, in the Data Catalog, use the left
navigation or filter, select "View Types" and choose “Datasets.”
Below is a short description of
all datasets included in the PTSD Repository. Click the dataset name to go to
the primer page which provides information about the dataset, a
full list and description of variable columns, and metadata (keywords or
phrases used to describe, tag or organize the data). Included in the description is information
about whether the data are organized at the study or arm level.
Study Characteristics (study
level). The Study Characteristics dataset
includes basic information about study design. In addition to study class
(defined above), the dataset includes clinical setting, whether the study
looked at PTSD symptom clusters, whether there were any subgroup analyses, type
of diagnostic measure used (e.g., diagnostic interview, self-report), and
provider credentials. Use this dataset to better understand the variety of RCT
designs employed, compare studies by study class or to identify gaps in the
literature for potential new RCTs. Visualizations made from this dataset will
be based on the 550RCTs included in the PTSD Repository.
Sample Characteristics
(study level). The Sample Characteristics dataset includes
information on the participants who are included in the studies. This
information is provided at the study level—meaning, data are not broken down
into the specific treatment arms such as the intervention or control groups.
Use this dataset if you want to learn more about the number of participants in
the study; inclusion and exclusion criteria related to substance use and
suicidality; baseline clinical characteristics such as PTSD severity, trauma
type, military status, and comorbidities; and basic demographic information
such as age, gender or race for the sample as a whole. Visualizations made from this table will be based
on the 550RCTs included in the PTSD Repository and answer questions such
as Who Has Been Studied.
Study Interventions (arm level). The Study Interventions dataset includes information about each of 1199 specific treatment
arms that were studied in all 550 RCTs. Each study arm was coded to indicate the
type of intervention or comparison condition. Download our Reference Guide: Treatments Found in the PTSD Repository. This dataset includes the study-level Study Class as
well as individual variables for each category of treatment, coded as Yes or No
for each arm. Study arm treatment category variables are as follows:
- Pharmacotherapy (as well as a subclass such as antidepressant, antianxiety, etc.)
- Psychotherapy (as well as a subclass to identify trauma-focused or non-trauma-focused therapy)
- Complementary and Integrative Health (CIH; as well as a subclass such as relaxation or meditation)
- Nonpharmacologic Biological
- Nonpharmacologic Cognitive
- Collaborative Care
- Other Study Class
- Control
The Study Intervention dataset
also includes information on the format of the treatment (individual, group,
couples, mixed); treatment delivery method (in person, by phone, by video,
technology alone, technology assisted, written or mixed); dose or amount of
treatment and, treatment completion and adherence. Use this dataset to
learn about treatment studies of a particular type, including our story, Which PTSD treatments have been studied?
Datasets related to PTSD outcomes (arm level). There are 4
PTSD outcome datasets. Data is reported separately for continuous and
dichotomous outcome measures. Datasets are further categorized as to
whether the comparisons are made within or between arms.
- PTSD Continuous Outcomes Within Arms. This dataset provides within-arm results for continuous measures. Included is information on score means and variance, and pre-post comparisons (pre-post score difference, statistical test p-value, and study-reported effect sizes). Where possible, the within-arm standardized effect size (analog of Cohen’s d) was calculated. For the calculated effect size, negative values indicate a lower score at follow-up compared to baseline, while positive values indicate a higher score at follow-up. Each treatment arm is presented on its own row. There are also separate rows for studies with more than one measure, time point and analysis type.
- PTSD Dichotomous Outcomes Within Arms. This dataset provides within-arm results for dichotomous outcomes: loss of diagnosis and clinically meaningful response. Included is information on how loss of diagnosis and clinically meaningful response were defined as well as the percent of participants who achieved loss of PTSD diagnosis/clinically meaningful response Each treatment arm is presented on its own row. There are also separate rows for studies with more than one measure, time point and analysis type or when there is more than one definition of diagnostic change or clinically meaningful change.
- PTSD Continuous Outcomes Between Arms. This dataset provides results for between-arm comparisons of continuous measures. Included is information on score differences, p-value for statistical test, and study-reported effect sizes. Where possible, the between-arm standardized effect size was calculated, using Hedges’ g. We calculated Hedges’ g based on the following (in order of preference): 1) adjusted mean difference; 2) follow-up scores; 3) unadjusted mean difference; 4) change scores. For the calculated effect size, information on the basis for calculation is included along with measures of variance. Negative values for Hedges’ g indicate a larger decrease (or lower follow-up score) in the first arm than in the second arm, while positive values indicate the reverse. Each comparison is on a separate row, and pairwise as well as omnibus (multi-arm) comparisons are included. There are also separate rows for each measure, time point and analysis type.
- PTSD Dichotomous Outcomes Between Arms. This dataset provides between-arm results for dichotomous outcomes: loss of diagnosis and clinically meaningful response. Included is information on how loss of diagnosis and clinically meaningful response were defined, p-value for statistical test, and study-reported effect sizes. Each comparison is on a separate row, and pairwise as well as omnibus (multi-arm) comparisons are included. There are also separate rows for studies with more than one measure, time point and analysis type or when there is with more than one definition of diagnostic change or clinically meaningful change.
Other
Outcomes (arm level). The Other Outcomes dataset
includes information on whether the trial includes measures of depression,
anxiety, substance use, sleep, anger, quality of life and functioning. Results
in this dataset are provided for each treatment arm. The name of the measure is
included as well as the between-group effect sizes. Use this
dataset to learn how about the effects of PTSD treatments on other outcomes.
Risk
of Bias (study level). Each study in the PTSD Repository was coded for
risk of bias (ROB) in five specific domains as well as an overall risk of bias
for the study using Cochrane’s
Risk of Bias 2 rating system. Each domain and the
overall rating were coded as high risk, low risk or some concerns.
For more
information about ROB, see our story, Risk of Bias Assessment in the
PTSD Repository.
Harms (arm level). The Harms
dataset includes information on serious adverse events and participant
withdrawals from the studies due to adverse events. Adverse events are reported
as the percent of participants within a treatment group that experienced an
adverse event or who withdrew from the study. Detail on the specific
adverse event or reason for withdrawal is provided when available. This dataset
also reports on the percentage of patients who attempted or completed suicide
(when available).
Other non-data datasets. There are 3 other datasets that are purely informational and
do not contain specific data elements:
- References and Study Identifiers. Identifying information about each study, including citation, PubMed ID, PTSDpubs ID, ClinicalTrials.gov ID, location and funding source.
- Secondary Studies. A list of secondary studies included with primary studies reported in the PTSD Repository. These secondary, referenced studies may or may not have contributed data to the abstraction of the listed primary study.
- Abbreviations. Definitions of abbreviations found throughout the PTSD Repository.
Key Variables Repeated in Each Dataset
Repeated
variables include a subset of variables that generally describe the treatment
and sample; these are included in all datasets described
above. Having some variables repeated in every dataset means that it
is easier to create visualizations that include these variables.
In a few cases there are study-level variables included in
datasets organized by arm. In these cases, the variable name will include
“(Study level)” to remind the user that the data applies to the full RCT, not
any particular arm of the study. For example, the “Percent Female (Study
level)” variable is coded at the study level, so the values correspond to the
overall study, not the specific treatment arm and will thus be identical for
each arm (row) of a study.
Next Steps
Now
that you understand how the data are organized, read our How do I get started working with the PTSD Repository? story to
learn tips on using the platform to create your own visualizations,
download data and more.
- Using
the Data Catalog button, choose "Files and Document" in the View
Type menu to download our Reference Guide: Treatments Found in the PTSD Repository that
explains the classification of treatments included in the studies