📋 PheKnowVec: Medication Code Set Verification

Task Overview

Description
Verification of the mappings between phenotype source strings and source codes from OMOP common data model vocabularies
Status
In progress 🔨
Team
Code Set Generator: @Tiffany C 

Evaluators:

Domain Expert:
  • Katy Trinkley, PharmD @Katy T 
Spreadsheet
GitHub Repository

Table of Contents


Timeline

 
Verification Task
May 20Aug 23
22T
23F
24S
25S
26M
27T
28W
29T
30F
31S
September 2019
1S
2M
3T
4W
5T
6F
7S
8S
9M
10T
11W
12T
13F
14S
15S
16M
17T
18W
19T
20F
21S
22S
23M
24T
25W
26T
27F
28S
29S
30M
October 2019
1T
2W
3T
4F
5S
6S
7M
8T
9W
10T
11F
12S
13S
14M
15T
16W
17T
18F
19S
20S
21M
22T
23W
24T
25F
26S
27S
28M
29T
30W
31T
November 2019
1F
2S
3S
4M
5T
6W
7T
8F
9S
10S
11M
12T
13W
14T
15F
16S
17S
18M
19T
20W
21T
22F
23S
24S
25M
26T
27W
28T
29F
30S


Project Description


Background

  • "Phenotypes are the measurable biological, behavioral and clinical markers of a condition or disease.  The process of deriving research-grade phenotypes from clinical data using computer-executable algorithms is called computational phenotyping (phenotyping for short)” (PMID: 27506131)

Computational phenotyping approaches have great potential to aid in diagnosis, prognosis, therapeutic decision-making, and identification of mechanisms or novel biomarkers. Currently, these methods have limited:
  • Generalizability because they are tailored to specific source vocabularies or hospital systems.
  • Translational relevance because they primarily rely on clinical data, which requires additional mapping to incorporate, for example, molecular or physiologic data.
  • Scalability because creating definitions is a time-consuming, iterative process requiring both domain expertise and robust external validation.

Objective: Develop a method (PheKnowVec: Phenotype Knowledge Vectors) for deriving, implementing, and validating computational phenotypes that addresses the aforementioned limitations by:
  • Mapping standardized clinical terminology concepts to linked open data.
  • Using embedding methods, which convert large complex heterogeneous data into scalable compressed vectors without semantic information loss.

Phenotypes
We will implement all phenotypes appropriate for use with pediatric and adult populations from the eMERGE network's Phenotype KnowledgeBase (n=9). Additional information on the phenotypes listed below can be found here.