John Salas

Data Scientist

(210) 782-1788



Available Work Locations: San Antonio, TX

Core Technologies: Python, MySQL, Spark, Tableau, Pandas, NumPy, Matplotlib, Seaborn, SciKitLearn, Anaconda, Jupyter Notebooks, Git/GitHub

Core Competencies: Data Storytelling, Applied Statistics, Machine Learning, Natural Language Processing, Classification, Regression, Clustering, Time Series Analysis, Anomaly Detection

Hire Me Because

I excel in using all of the tools in the data science tool kit. However, my main talent, is the exceptional insights I gain, from the data I analyze.

My Capstone Project: Predicting Hard Drive Reliability

In a team of four, my project partners and I, utilized data, provided by Backblaze, to identify the primary drivers of early failures in hard drives. We developed an SVM classifier model capable of predicting 76% of early failing hard drives. Based on the insights gained from the data, we were also able to show that drivers of early failure include data capacity, manufacturer, and model type.

“With technology, I am most passionate about the insights I can gain from a data set, and the impact that those insights will have.