Data scientist with over 6 years of experience in higher education and institutional research, specializing in data science, data analysis, statistical modeling, and data visualization.
M.S. in Mathematical Sciences, University of West Florida, 2018
- GPA: 3.97/4.00
B.S. in Statistics and B.A in Mathematics, University of Florida, 2016
- GPA: 3.85/4.00
- Completed a logistic regression analysis for the Health, Leisure, and Sports (HLS) Facility, discovering a strong correlation between gym usage and fall-to-fall retention among first-time-in-college (FTIC) students. Findings were utilized in HLS promotional materials.
- Designed and deployed interactive Tableau dashboards to track graduation progress for various student cohorts, saving countless hours of time previously spent on manual processes.
- Performed a k-means cluster analysis to select peer and aspirant institutions for SACSCOC accreditation, significantly reducing manual review time.
- Trained and mentored a data analyst, gradually handing over former responsibilities and facilitating a speedier and more efficient fulfillment of data requests.
- Developed and maintained an R package for Institutional Research, streamlining and modularizing routine departmental functions to enhance efficiency and accuracy. Tools: R, SQL Server, Oracle.
- Wrote R and Python scripts that automated the collection of approximately weekly progress metrics related to graduation metrics for relevant student cohorts, saving countless hours.
- Developed an algorithm using Markov Chains to project fall headcounts up to five years into the future as part of the required annual Accountability Plan reporting.
- Obtained certification for TerminalFour to maintain and update the departmental website, quickly learning new skills to assume responsibilities from a sudden vacancy.
- Fulfilled ad-hoc requests in a timely and accurate manner, highlighting key insights with differing methods according to the intended audience and establishing methods to carry out routine or automated updates if necessary.
R, Python
ggplot2, Shiny, Tableau
Regression, Hypothesis Testing
Classification, Clustering, Time Series, H2O AutoML
SQL (Microsoft SQL Server and Oracle SQL Developer), MongoDB
Excel, Word, Access, PowerPoint
Alteryx
TerminalFour, Amazon Web Services, Ubuntu, Nginx, Docker
HTML, CSS, JavaScript
AWS EC2
GitHub, GitLab
Apache Spark, sparklyr package in R
Proficient with JSON, YAML, RDS, XML, Parquet, Feather, Arrow, and other file formats
Experience with data wrangling, data cleaning, and ETL processes using tidyverse packages and other related tools.
IPEDS, US News, College Scorecard, Department of Education, Common Data Set
State University System Performance Based Funding Model
Experience with assessment processes and accreditation requirements such as SACSCOC and HLC
Designing, administering, and analyzing surveys for student feedback and alumni tracking
Knowledge of data governance principles, privacy laws (e.g., FERPA), and data management policies
Involvement in enrollment forecasting, retention analysis, and strategic enrollment management
Involvement in performing statistical analyses for academic research projects and writing methodology sections for academic papers and white papers
Member of the Association for Institutional Research (AIR) and have given multiple presentations at the national and regional forums
Patterns of ovarian cancer and uterine cancer mortality and incidence in the contiguous USA
Science of the Total Environment · Dec 20, 2019
Summary: A comprehensive analysis of the patterns of ovarian and uterine cancer mortality and incidence across the contiguous United States.
Against Common Assumptions, the World's Shark Bite Rates Are Decreasing
Journal of Marine Biology · Feb 6, 2019
Summary: This study provides evidence that contrary to popular belief, shark bite rates around the world are decreasing.
Email: [email protected]
LinkedIn: linkedin.com/in/jonathanelee1993