TY - JOUR
T1 - Columbia Open Health Data, clinical concept prevalence and co-occurrence from electronic health records
AU - Ta, Casey N.
AU - Dumontier, Michel
AU - Hripcsak, George
AU - Tatonetti, Nicholas P.
AU - Weng, Chunhua
PY - 2018/11/27
Y1 - 2018/11/27
N2 - Columbia Open Health Data (COHD) is a publicly accessible database of electronic health record (EHR) prevalence and co-occurrence frequencies between conditions, drugs, procedures, and demographics. COHD was derived from Columbia University Irving Medical Center's Observational Health Data Sciences and Informatics (OHDSI) database. The lifetime dataset, derived from all records, contains 36,578 single concepts (11,952 conditions, 12,334 drugs, and 10,816 procedures) and 32,788,901 concept pairs from 5,364,781 patients. The 5-year dataset, derived from records from 2013-2017, contains 29,964 single concepts (10,159 conditions, 10,264 drugs, and 8,270 procedures) and 15,927,195 concept pairs from 1,790,431 patients. Exclusion of rare concepts (count
AB - Columbia Open Health Data (COHD) is a publicly accessible database of electronic health record (EHR) prevalence and co-occurrence frequencies between conditions, drugs, procedures, and demographics. COHD was derived from Columbia University Irving Medical Center's Observational Health Data Sciences and Informatics (OHDSI) database. The lifetime dataset, derived from all records, contains 36,578 single concepts (11,952 conditions, 12,334 drugs, and 10,816 procedures) and 32,788,901 concept pairs from 5,364,781 patients. The 5-year dataset, derived from records from 2013-2017, contains 29,964 single concepts (10,159 conditions, 10,264 drugs, and 8,270 procedures) and 15,927,195 concept pairs from 1,790,431 patients. Exclusion of rare concepts (count
KW - ESTIMATING DISEASE PREVALENCE
KW - UNITED-STATES
KW - STATISTICS
KW - KNOWLEDGE
UR - https://springernature.figshare.com/articles/dataset/5-year_data_set_paired_concept_counts/6731309/1
UR - https://springernature.figshare.com/articles/dataset/5-year_dataset_paired-concept_deviations/7148075/1
UR - https://springernature.figshare.com/articles/dataset/5-year_data_set_single_concept_count/6731318/1
UR - https://springernature.figshare.com/articles/dataset/5-year_dataset_single_concept_deviations/7148081/1
UR - https://springernature.figshare.com/articles/dataset/Lifetime_dataset_single_concept_deviations/7148078/1
UR - https://springernature.figshare.com/articles/dataset/Lifetime_data_set_single_concept_count/6731315/1
UR - https://springernature.figshare.com/articles/dataset/Lifetime_dataset_paired-concept_deviations/7148084/1
UR - https://springernature.figshare.com/articles/dataset/Lifetime_data_set_paired_concept_counts/6731321/1
UR - https://springernature.figshare.com/articles/dataset/Concepts/6731312/1
U2 - 10.1038/sdata.2018.273
DO - 10.1038/sdata.2018.273
M3 - Article
C2 - 30480666
SN - 2052-4463
VL - 5
JO - Scientific data
JF - Scientific data
M1 - 180273
ER -