I am an excellent problem solver, I bring good energy, and I’ve got thick skin! I enjoy receiving constructive feedback, learning new skills, and staying sharp on my current skills. I have performed every phase of the data science pipeline from acquire to preparation to exploration to modeling and delivery of reports.
Available Work Locations: Remote • San Antonio, TX
Core Technologies: Python, MySQL, Spark, Tableau, Pandas, NumPy, Matplotlib, Seaborn, SciKitLearn, Anaconda, Jupyter Notebooks, Git/GitHub
Core Competencies: Data Storytelling, Applied Statistics, Machine Learning, Natural Language Processing, Classification, Regression, Clustering, Time Series Analysis, Anomaly Detection
Hire Me Because
My Capstone Project: Steam Hours-Played Predictor
The Steam Hours-Played Predictor is a classification model that predicts how much time a player will invest on a given game hosted by Steam, a digital platform for distributing PC games. The purpose of this project was to find games that will be highly-played based on key features such as genre, publisher, developer, and initial price. This can help determine what video games will be worth investing time into for content creators, streamers, and gaming companies that provide entertainment online. The project resulted in a XGBoost model that detected highly-played games with 88% precision and 38% recall above the baseline model.