Programming Experience

R-Studio

With over four years of experience in R and RStudio, I specialize in data cleaning, visualization, and statistical modeling.

Highlighted projects (as seen on Github):

  • Deepfake Detection Data – Applied regression and group comparison models to detect patterns in human responses to AI-generated stimuli.

  • Life Sampling Dataset – Processed and analyzed millions of longitudinal observations to identify developmental and aging trends.

  • Non-Profit Demographics – Built analyses and visualizations of mentorship and diversity data to inform program planning.

Libraries: tidyverse, ggplot2, dplyr, readr, lme4
Methods: Linear regression, multilevel regression, ANOVA, ANCOVA, MANOVA

Link to R Portfolio

Python

Experienced in using Python for machine learning, data analysis, and forecasting across research and applied projects.

Highlighted projects (as seen on Github):

  • Aviation Safety Forecasting – Applied time-series models (SARIMA) to predict trends in NASA ASRS accident reports.

  • Mentorship Demographics Forecasting – Built machine learning pipelines (Ridge regression, one-hot encoding) to forecast mentor/mentee participation.

Libraries: pandas, numpy, scikit-learn, matplotlib, seaborn
Methods: Regression modeling, classification, cross-validation, time-series forecasting, data visualization

Link to Python Portfolio

SQL

Skilled in using SQL for data cleaning, querying, and extracting actionable insights from large, complex datasets.

Highlighted projects (as seen on Github):

  • COVID-19 Trends Analysis (Our World in Data) – Used SQL to aggregate and compare global case counts, vaccination rates, and mortality trends across countries, highlighting the use of SQL for public health monitoring and time-series reporting.

Skills: Joins, subqueries, window functions, CTEs, aggregate functions
Applications: Business intelligence, customer behavior insights, public health analytics

Link to SQL Portfolio