I am eager to showcase my newly acquired skillset and expertise across the full data science pipeline. With 10 years’ experience in government analytics, I possess a well-rounded perspective of a project’s high-level impact on an organization, as well as a strong comprehension of the technical details. Finally, I am a highly effective communicator who will provide actionable insights at the time of project completion.
Available Work Locations: San Antonio, TX
Core Technologies: Python, MySQL, Spark, Tableau, Pandas, NumPy, Matplotlib, Seaborn, SciKitLearn, Anaconda, Jupyter Notebooks, Git/GitHub
Core Competencies: Data Storytelling, Applied Statistics, Machine Learning, Natural Language Processing, Classification, Regression, Clustering, Time Series Analysis, Anomaly Detection
Hire Me Because
My Capstone Project: Predicting datacenter hard drive failures using Spark and Python
The goal of this project was to predict and classify which hard drives models have high or low reliability. Working in a team of four, my project partners and I utilized data provided by BackBlaze. Identified the primary drivers for early failures in hard drives that BackBlaze uses to store customer data in their data center. Together developed a model to predict early failures using SMART Stats features found in the data. Used our findings to make clear recommendations regarding hard drive reliability based on a given hard drive’s model type, manufacturer, and other criteria. Our deliverables included presentation slides, an analysis notebook, and a hard drive models reliability index.