How do I get started working with the PTSD Repository?
The PTSD Repository contains data from 496 randomized
controlled trials (RCTs) focused on PTSD treatment. If you want to dig into
the data a bit more, these tips can help you navigate the platform.
Below, we provide quick information for some common tasks, as well as links to
more detailed documentation and step-by-step videos.
On the platform, you can read our stories that use the PTSD Repository data to learn more about
PTSD treatment and clinical trials. You are not able to manipulate the data or run statistical
analyses on the platform, but you can download the data for your own projects.
We
expect visitors to the site may have varied levels of comfort in working with
data. This guide is intended for anyone, with more information for researchers looking to download and analyze data from the PTSD Repository.
Top Tips to Get Started
Create a Profile
Before you get too far, we
encourage you to create an account with Tyler Data & Insights (the platform that hosts the PTSD Repository). While anyone using the
PTS Repository may create visualizations and other content using the data, in
order to save it you will need to sign up with the platform. This is easy and quick to do. You can click the "Sign In" button on the top right and within minutes you will
have a username and password. Or use this link to create an account.
Learn About the PTSD Repository
We offer an overview of the PTSD Repository platform in our "About" stories, highlighting different features for unique audience groups.
As noted,
data was abstracted (taken) from 496 RCTs of PTSD treatment. To help simplify
use on this platform, data was reformatted into multiple datasets.
The data were formatted in such a way that each table (or
dataset) is intended to capture a core characteristic of the studies. For example,
some tables are at the study level highlighting characteristics of the RCT
overall, such as demographics, methodology (e.g., inclusion/exclusion criteria), risk of bias assessment, or reference information. Other tables are at the arm level and provide
detailed information about each intervention arm in the RCTs, such as treatment
specifics or outcomes.
To best work with the data in the PTSD Repository, you first
need to identify which table contains your data of interest. You can learn more about the specific data in
each table in our story, How are the data organized?.
Easily Find What You Need: Data Catalog
Now that you have identified your table of interest, you can go
directly to it. At the top left of the page is a menu icon with a link to our Data Catalog, which is a
“home base” for navigating the site. (If you are not signed in, the
Data Catalog is found using the “Browse” tab at the top left.) The Data Catalog
lists all content on the PTSD-Repository. On the left, there is a “View Types”
menu--choosing “Datasets” will give you a list of all data tables included in
the PTSD Repository. You can also choose to see a list of all “Stories,” "Files and Documents" (e.g. reference materials) or
other content options.
Using the search bar at the top of the data catalog is also
an easy way to locate content.
Read Our Stories
We write Data Stories that help you see and interact with
data in the PTSD Repository. Our stories help you understand the context of
data and see trends using visualizations we created. The combination of text
and visuals you can interact with help explain key themes from PTSD treatment
studies. You can read a preview of all of our Data Stories.
Viewing Our Data
Whether you use the Data
Catalog, Browse or Search options to locate a dataset, you will be taken to its
primer page. Take some time to read through the information on the primer page,
which includes:
- Description: A text overview of the data found in the dataset is given at the top of the primer page
- Metadata (About the Dataset): Information about the dataset includes last update, tags and a button to contact the dataset owner if you have questions
- What’s in this Dataset?: A high-level summary of dataset properties (e.g., number of rows and columns)
- Columns in this Dataset: List of all variables (columns) in the dataset with a description and list of response options where possible
It is important that users pay
attention to dataset information to ensure incorrect conclusions or
“misreading” of data does not occur.
From the primer page, you can
select the “View Data” button in the top right (or lower on the screen, above
the Table Preview) to open the full dataset for viewing.
Understanding row-level organization of the data is
also important. Because data from RCTs in the PTSD Repository are organized into both study-level and arm-level information, there are a few
pointers to remember.
Datasets at the study-level include one row for each RCT. When a dataset includes arm-level data, there are
multiple rows for a single RCT. (See the image on the left for an example of arm-level data.) Each arm represents a treatment intervention
(or control) used in the study. For example, the table to the left includes 1
row for each treatment arm. Some studies only have 2 arms, A and B. Others,
like the Buhmann2016 example here, have as many as 4.
In addition to multiple arms, a single RCT may report
outcomes at multiple assessment points, using different outcome measures (e.g.,
CAPS, PCL, etc.), or using different analyses (e.g., Intention to Treat/ITT vs.
Completer/Comp). In each case, a separate row is included and labeled to allow for
comparison. More detail about this is included where “Study ID” is explained in
our How are the data organized? story.
Create Your Own Visualizations
Once you have a general understanding about the data
included in the PTSD Repository and locate the table you want, you are ready to
get started creating your own visualizations.
Whether you are on a dataset primer page or viewing the full
dataset, there are buttons to “Visualize” or “Create Visualization”. If you are
not signed in, the platform will prompt you to do so to be able to save your
visualization. It’s not necessary to login. If you do not have an account, you
will need to capture the visualization in a screenshot; the visualization will
lose interactive features in this option.
The Support page for how to explore,
visualize and analyze data on the platform offers a number of tutorial
videos. Below is an example on how to
create a stacked bar chart:
Downloading Data
You can download a dataset directly from a primer page (by
choosing the “Export” button on the top right). Another option is to click the
“View Data” button from the dataset primer page. (The “View Data” button is at
the top right and above the Table Preview on the primer page.) Once you open
the data view, you can select the “Export” button on the top right to download
different data formats:
- CSV (Comma Separated Values)
- CSV for Excel (Tab Separated Values)
- CSV for Excel (Europe)
- JSON
- RDF
- RSS (with GeoRSS information if there is a Location column in the dataset)
- TSV for Excel
- XLM
Note that the data were formatted to be machine-readable,
meaning that the structure of each table includes column variables that contain
one value per row. This means that for some variables, there is more than one
row for each RCT. Read more in our story, How are the data organized?.
If you are exporting to Excel (CSV) and notice some
inconsistencies in value options (e.g., added time stamp, 00:00) for some
variables, this Exporting
to CSV article is helpful.
Filtering data prior to downloading
You may want to filter the variables of primary interest to
you. For example, you may be interested in PTSD studies that were specific to
Veteran samples (Column/Variable: Military Status) or included psychotherapies
(Column/Variable: Study Class). Note that this filtering option is available
only if you are signed in.
Once you find the dataset you need, go to the primer page
and select “View Data”. You’ll land on a page that includes a tabular view of
the dataset. Note: You may need to scroll to the right to see the whole
dataset.
Watch this short video on how to filter data within a dataset.
Once you have filtered the dataset, you need to save
the view. Choose “Save As …” from the top left. Once you have saved this
filtered dataset, you can find it within the Data Catalog by clicking “Filtered
view” category in the left navigation.
This help page will introduce you to the platform's "Exploration Canvas" where you will explore and shape your data.
Access via API
Each
dataset on the platform has a corresponding Application Programming Interface, or
API, document that is hosted on dev.socrata.com. This page
contains details on utilizing the API for the particular dataset.
The
API Docs page can be found from either dataset's primer page or data table
page.
Access via Odata
Use OData to open the dataset in a number of tools, such as Tableau. This provides a direct connection to the data that can be refreshed on-demand within the connected application. See the video below to learn how to connect PTSD Repository data with Tableau.
Need More Help?
The Tyler Data & Insights platform that hosts the PTSD Repository is a powerful
tool with many features that are beyond the scope of this story. We
encourage you to explore the PTSD Repository and what it can do. Most of
the features and controls are intuitive and can be learned with just a bit of curiosity and experience. If you're stuck, please do not hesitate to contact us with
questions: ncptsd@va.gov. (Please note "PTSD Repository" in the subject line.)
For full help documentation, see Data Insights and Knowledge Base.