Austin Shih
Senior, Applied Mathematics (Data Science and Economics minors), Vanderbilt University
Mentors: Yu Shyr and Chih-Yuan Hsu
During his internship, Shih focused on the use of computer simulations to create new methods to determine sample size for running a randomized Phase III trial for treatment crossover with respect for all three kinds of endpoints (continuous, categorical, and survival).
Jiming Yu
Advisor: Matt Shotwell
BS, Mathematics, Baylor University
Nicholas "Nick" Micheletti
MS thesis abstract:
The standard cost-effectiveness acceptability curve is a tool often employed in cost-effectiveness analysis that is calculated by determining one-sided bootstrapped p-values for a test of a hypothesis on incremental net monetary benefits, and mapping them over a range of acceptable resources. With the recent development of the second generation p-value comes an opportunity to innovate upon the often used cost-effectiveness acceptability curve. The second generation acceptability curve is constructed by comparing bootstrapped incremental net monetary benefit confidence intervals to pre-specified null hypothesis intervals. This comparison creates second generation p-values over a range of acceptable resources. We present the process of constructing a second generation acceptability curve and explore methods to depict the curve. The means of interpreting and understanding the second generation acceptability curve are also discussed. Various simulation studies are performed in order to explore the properties of the second generation acceptability curves, from changes to the distribution of costs or effectiveness measures, to other simulation parameters such as like null hypothesis interval width and sample size. We further provide example comparisons of the standard and second generation curves. The second generation curve is also employed in an applied example motivated by a real-world data set from a cancer cost-effectiveness study; specifically we follow a paper that looks at how cancer data sets are handled by cost-effectiveness analysis methods, and how they manage in handling uncertainty and inconclusiveness. Ultimately, the second generation acceptability curve is shown to provide improvements to the standard cost-effectiveness acceptability curve in its ability to depict regions of approximate equivalence and inconclusiveness.
Advisor: Andrew Spieker
BS Mathematics & BA Ancient Studies, Washington University
At NIH Center for Cancer Research since 2024. Currently Biostatistician.
Kenneth Liao
Liao previously worked as a research analyst in VUMC’s Genetic Medicine division. He’s interested in biostatistics because he is really drawn to being able to help people through avenues in statistics and mathematics.
MS thesis abstract:
There has been a growing criticism of hypothesis testing. Neuroimaging analysis almost exclusively focuses on hypothesis testing. Recent research has developed methods to use confidence sets instead, to draw conclusions, but has only focused on particular parameters or effect sizes that do not generalize to all statistics, such as noncontinuous and nonfunctional data. Here, we use the robust effect size index (RESI) framework, which is generally defined across many different types of models. We use RESI to develop a general approach to effect size-based inference for neuroimaging data using confidence sets derived from simultaneous confidence intervals (SCI) using bootstrapping from the pbj (parametric bootstrap joint) R package. From a given effect size threshold, this approach will identify regions of the brain that belong to the null set (areas where the effect size is less than the threshold) or the target set (areas where the effect size is greater than the threshold), with a prespecified confidence level. We then conduct simulations to evaluate this approach, using a parametric standard error normalization, and one method without any normalization. The coverage, average interval width, and the maximum interval width are reported to evaluate the SCIs. This approach can be applied to areas of research that focus on multivariate outcomes, such as genomics and imaging.
Thesis: Deriving Confidence Sets for Effect Sizes Using Simultaneous Confidence Intervals
Advisor: Simon Vandekar
BS, Statistics (minor in Ecology and Evolutionary Biology), University of Tennessee–Knoxville
At University of Minnesota since 2024. Currently Biostatistician.
Elisa Yazdani
MS, Biostatistics, Vanderbilt University
Elisa Yazdani
Additional projects:
- Harnessing Big Data to Arrest the HIV/HCV/Opioid Syndemic in the Rural and Urban South (Peter Rebeiro, supervisor; 2023–present)
- Developing Sigmoid Models for Alzheimer's Disease Progression (Dandan Liu, supervisor; 2022)
Honors include the Commodore Award in Biostatistics, 2023, "for enriching the department and graduate program through ingenuity, dedication, and altruism."
Activities include: Biostatistics Graduate Student Association treasurer, 2023; first-year liaison, 2022–2023
Over the course of three internships – one at Vanderbilt and two at Duke – Yazdani worked on a variety of projects, studying neurodevelopment in animals, cognitive diseases, electrophysiology, and the impacts of drugs. “Paternal THC Exposure in Rats Causes Long-Lasting Neurobehavioral Effects in the Offspring,” a paper on one of her projects at Duke, was published in the Neurotoxicology & Teratology Journal.
Thesis: A Pipeline for High-Throughput Drug Screening
Advisor: Amir Asiaee
BA, Chemistry and Psychology (minor in Mathematics), Washington & Jefferson College
At Vanderbilt University Medical Center since 2023. Currently Biostatistician.
Megan Hall
MS thesis abstract:
Ansa cervicalis stimulation (ACS) has been proposed as a treatment option for obstructive sleep apnea. Airflow, CPAP pressure, and ACS stimulation status were measured continuously during drug-induced sleep endoscopy. Data was divided into breaths and experimental variables were summarized across each breath. Airflow was summarized in three ways: Vimax, mean flow, and middle third mean flow. The treatment effects of ACS were ∆Pcrit and ∆Popen. ∆P is the difference in pressure between ACS and no stimulation required to reach set levels of airflow. Airflow is zero for ∆Pcrit and the average flow for non-flow limited breaths for ∆Popen. Breaths were divided into experimental conditions, which are a set of successive breaths a unique patient, nasal pressure and stimulation status. Current experimental protocol allows three breaths per experimental condition. ∆P estimates were compared via sampling first N breaths per experimental condition and a simulation of the same sampling method. ∆P estimates were also compared via bootstrap sampling the number of patients. For the breath sampling, ∆Pcrit sub-sample estimates approached the full-sample estimates at three breaths per experimental condition, while ∆Popen estimates did not. For the simulation, bias between the true ∆P and the simulated ∆P estimate decreased as number of breaths sampled increased. Some ∆P estimates (∆Popen for Vimax, ∆Pcrit for mean and middle third mean flow) do not plateau at the true ∆P, indicating bias in the ∆P calculation. Estimates tended to plateau at or before six breaths per condition. Standard error decreased as number of breaths sampled increased for the simulation, but standard error across flow measures stabilized by four breaths per condition for breath sampling. The number of patients needed to obtain stable estimates of ∆Pcrit and ∆Popen was small for Vimax and mean flow but greater for middle third mean flow. Because sampling three breaths per condition may yield biased estimates of ∆P, increasing the number of breaths per condition to six may be beneficial. Few patients were needed to obtain stable estimates of ∆P but increasing the number of patients decreased standard errors for all flow measures.
Thesis: Sampling Considerations in Intensive Longitudinal Data
Advisor: Matt Shotwell
BS, Biochemistry (minors in Mathematics and Applied Statistics), Western Kentucky University
At University of Kentucky since 2024. Currently Biomedical Data Science Assistant.
Nate Dowd
Research interests include: statistical modeling, causal inference, machine learning, R, data visualization
Thesis: Strategies and Considerations for Receiver Operating Characteristic (ROC) Regression Modeling
Advisor: Andrew Spieker
BS, Biology (minor in Statistics), Colby College
At Frist Center for Autism and Innovation since 2024. Currently Research Statistician.
Lydia Yao
BS, Data Science and Mathematical Statistics, Purdue University