Informatics | Center for Precision Medicine

An automated approach to calculating the daily dose of tacrolimus in electronic health records.

Xu H, Doan S, Birdwell KA, Cowan JD, Vincz AJ, Haas DW, Basford MA, Denny JC. An automated approach to calculating the daily dose of tacrolimus in electronic health records. AMIA Joint Summits on Translational Science proceedings AMIA Summit on Translational Science. 2010(2010). 71-5. PMID: 21347153 [PubMed] PMCID: PMC3041548

Clinical research often requires extracting detailed drug information, such as medication names and dosages, from Electronic Health Records (EHR). Since medication information is often recorded as both structured and unstructured formats in the EHR, extracting all the relevant drug mentions and determining the daily dose of a medication for a selected patient at a given date can be a challenging and time-consuming task.

Electronic medical records for genetic research: results of the eMERGE consortium.

Kho AN, Pacheco JA, Peissig PL, Rasmussen L, Newton KM, Weston N, Crane PK, Pathak J, Chute CG, Bielinski SJ, Kullo IJ, Li R, Manolio TA, Chisholm RL, Denny JC. Electronic medical records for genetic research: results of the eMERGE consortium. Science translational medicine. 2011 Apr 20;3(3). 79re1. PMID: 21508311 [PubMed] PMCID: PMC3690272 NIHMSID: NIHMS478009.

Clinical data in electronic medical records (EMRs) are a potential source of longitudinal clinical data for research. The Electronic Medical Records and Genomics Network (eMERGE) investigates whether data captured through routine clinical care using EMRs can identify disease phenotypes with sufficient positive and negative predictive values for use in genome-wide association studies (GWAS). Using data from five different sets of EMRs, we have identified five disease phenotypes with positive predictive values of 73 to 98% and negative predictive values of 98 to 100%.

A study of machine-learning-based approaches to extract clinical entities and their assertions from discharge summaries.

Jiang M, Chen Y, Liu M, Rosenbloom ST, Mani S, Denny JC, Xu H. A study of machine-learning-based approaches to extract clinical entities and their assertions from discharge summaries. Journal of the American Medical Informatics Association : JAMIA. 18(18). 601-6. PMID: 21508414 [PubMed] PMCID: PMC3168315

The authors' goal was to develop and evaluate machine-learning-based approaches to extracting clinical entities-including medical problems, tests, and treatments, as well as their asserted status-from hospital discharge summaries written using natural language. This project was part of the 2010 Center of Informatics for Integrating Biology and the Bedside/Veterans Affairs (VA) natural-language-processing challenge.

Detecting drug interactions from adverse-event reports: interaction between paroxetine and pravastatin increases blood glucose levels.

Tatonetti NP, Denny JC, Murphy SN, Fernald GH, Krishnan G, Castro V, Yue P, Tsao PS, Tsau PS, Kohane I, Roden DM, Altman RB. Detecting drug interactions from adverse-event reports: interaction between paroxetine and pravastatin increases blood glucose levels. Clinical pharmacology and therapeutics. 2011 Jul;90(90). 133-42. PMID: 21613990 [PubMed] PMCID: PMC3216673 NIHMSID: NIHMS329087.

The lipid-lowering agent pravastatin and the antidepressant paroxetine are among the most widely prescribed drugs in the world. Unexpected interactions between them could have important public health implications. We mined the US Food and Drug Administration's (FDA's) Adverse Event Reporting System (AERS) for side-effect profiles involving glucose homeostasis and found a surprisingly strong signal for comedication with pravastatin and paroxetine.

Analyses of longitudinal, hospital clinical laboratory data with application to blood glucose concentrations.

Schildcrout JS, Haneuse S, Peterson JF, Denny JC, Matheny ME, Waitman LR, Miller RA. Analyses of longitudinal, hospital clinical laboratory data with application to blood glucose concentrations. Statistics in medicine. 2011 Nov 30;30(30). 3208-20. PMID: 21948391 [PubMed] PMCID: PMC3339442 NIHMSID: NIHMS371558.

Electronic medical record (EMR) systems afford researchers with opportunities to investigate a broad range of scientific questions. In contrast to purposeful study designs, however, EMR data acquisition procedures typically do not align with any specific hypothesis. Subsequent investigations therefore require detailed characterization of clinical procedures and protocols that underlie EMR data, as well as careful consideration of model choice. For example, many intensive care units currently implement insulin infusion protocols to better control patients' blood glucose levels.

PASTE: patient-centered SMS text tagging in a medication management system.

Stenner SP, Johnson KB, Denny JC. PASTE: patient-centered SMS text tagging in a medication management system. Journal of the American Medical Informatics Association : JAMIA. 19(19). 368-74. PMID: 21984605 [PubMed] PMCID: PMC3341792

To evaluate the performance of a system that extracts medication information and administration-related actions from patient short message service (SMS) messages.

Use of diverse electronic medical record systems to identify genetic risk for type 2 diabetes within a genome-wide association study.

Kho AN, Hayes MG, Rasmussen-Torvik L, Pacheco JA, Thompson WK, Armstrong LL, Denny JC, Peissig PL, Miller AW, Wei WQ, Bielinski SJ, Chute CG, Leibson CL, Jarvik GP, Crosslin DR, Carlson CS, Newton KM, Wolf WA, Chisholm RL, Lowe WL. Use of diverse electronic medical record systems to identify genetic risk for type 2 diabetes within a genome-wide association study. Journal of the American Medical Informatics Association : JAMIA. 19(19). 212-8. PMID: 22101970 [PubMed] PMCID: PMC3277617

Genome-wide association studies (GWAS) require high specificity and large numbers of subjects to identify genotype-phenotype correlations accurately. The aim of this study was to identify type 2 diabetes (T2D) cases and controls for a GWAS, using data captured through routine clinical care across five institutions using different electronic medical record (EMR) systems.

Predicting clopidogrel response using DNA samples linked to an electronic health record.

Delaney JT, Ramirez AH, Bowton E, Pulley JM, Basford MA, Schildcrout JS, Shi Y, Zink R, Oetjens M, Xu H, Cleator JH, Jahangir E, Ritchie MD, Masys DR, Roden DM, Crawford DC, Denny JC. Predicting clopidogrel response using DNA samples linked to an electronic health record. Clinical pharmacology and therapeutics. 2012 Feb;91(91). 257-63. PMID: 22190063 [PubMed] PMCID: PMC3621954 NIHMSID: NIHMS346495.

Variants in ABCB1 and CYP2C19 have been identified as predictors of cardiac events during clopidogrel therapy initiated after myocardial infarction (MI) or percutaneous coronary intervention (PCI). In addition, PON1 has recently been associated with stent thrombosis. The reported effects of these variants have not yet been replicated in a real-world setting.

Modeling drug exposure data in electronic medical records: an application to warfarin.

Liu M, Jiang M, Kawai VK, Stein CM, Roden DM, Denny JC, Xu H. Modeling drug exposure data in electronic medical records: an application to warfarin. AMIA ... Annual Symposium proceedings / AMIA Symposium. AMIA Symposium. 2011(2011). 815-23. PMID: 22195139 [PubMed] PMCID: PMC3243123

Identification of patients' drug exposure information is critical to drug-related research that is based on electronic medical records (EMRs). Drug information is often embedded in clinical narratives and drug regimens change frequently because of various reasons like intolerance or insurance issues, making accurate modeling challenging. Here, we developed an informatics framework to determine patient drug exposure histories from EMRs by combining natural language processing (NLP) and machine learning (ML) technologies.

Detecting abbreviations in discharge summaries using machine learning methods.

Wu Y, Rosenbloom ST, Denny JC, Miller RA, Mani S, Giuse DA, Xu H. Detecting abbreviations in discharge summaries using machine learning methods. AMIA ... Annual Symposium proceedings / AMIA Symposium. AMIA Symposium. 2011(2011). 1541-9. PMID: 22195219 [PubMed] PMCID: PMC3243185

Recognition and identification of abbreviations is an important, challenging task in clinical natural language processing (NLP). A comprehensive lexical resource comprised of all common, useful clinical abbreviations would have great applicability. The authors present a corpus-based method to create a lexical resource of clinical abbreviations using machine-learning (ML) methods, and tested its ability to automatically detect abbreviations from hospital discharge summaries.

RSS: