Carolyn Davis

Data Scientist

Carolyn Davis


Data Scientist


(907) 750-1653



Available Work Locations: Remote

Military Veteran

Inactive Top Secret

Bachelor's Degree

Core Technologies: Python, MySQL, Spark, Tableau, Pandas, NumPy, Matplotlib, Seaborn, SciKitLearn, Anaconda, Jupyter Notebooks, Git/GitHub

Core Competencies: Data Storytelling, Applied Statistics, Machine Learning, Natural Language Processing, Classification, Regression, Clustering, Time Series Analysis, Anomaly Detection

Hire Me Because

I possess the the desire to solve real world issues with the aid of machine learning, data science, and practical experience. My passion for data insight is evident in any project I undertake.

My Capstone Project: Stroke Prediction

This project involves the analysis of health data to identify the features that are indicative of stroke occurrence. With the exploration of key drivers of stroke, the dataset is being utilized for the development and evaluation of predictive models that perform better than a baseline classification prediction for stroke occurrence. The classification models being tested and evaluated on our data include Logistic Regression, Naive Bayes, Random Forest Classifier, K Nearest Classifier, and Decision Tree. The top models are being compared in performance with the use of a Receiver Operating Characteristic (ROC) curve to assess the diagnostic ability of a binary classifier system as its discrimination threshold is varied. The best performing model will then be identified as the top predictor given the engineered features in classification of key drivers of stroke occurrence.

“With technology, I am most passionate about discovering previously unseen relationships in the data and using methods of visualization to paint an insightful picture.