Innovation: Endo and Stone: Big Data informatics and health records

Big Data, informatics, and Electronic Health Records

Health Record Graph

Big data from electronic health record datasets are another focus area for Dr. Hsi.  He is building tools to structure the electronic health records for kidney stone research.  He has an interest in epidemiology and patient outcomes.  These bioinformatics approaches will enable collaborative research to examine difficult-to-answer questions on how and why kidney stones form.  An institutional resource that is available as the electronic health records repository is the Vanderbilt Research Derivative and Synthetic Derivative.

With the informatics team at Vanderbilt, Dr. Hsi developed a natural language processing method to accurately identify kidney stone patients and kidney stone composition within text data within over 125 million notes from the Vanderbilt de-identified electronic health record[HR1].  The project was also able to replicate known disease associations of kidney stone disease.  The team is now developing methods to extract kidney stone outcomes from the electronic health record.

Changing stone compositions

Kidney stone patients often have repeat stone composition data over time, providing longitudinal information.  This figure shows changing stone compositions among the kidney stone patient population the Vanderbilt EHR. 

proportions of stone compositions

This figure shows the proportions of stone compositions and combinations of stone types within the Vanderbilt EHR (UpSet plot)