I love working with data! I am inherently curious and love problem solving. My education and industry experience has provided me the tools to drive better decision making with data.
Available Work Locations: San Antonio, TX
Core Technologies: Python, MySQL, Spark, Tableau, Pandas, NumPy, Matplotlib, Seaborn, SciKitLearn, Anaconda, Jupyter Notebooks, Git/GitHub
Core Competencies: Data Storytelling, Applied Statistics, Machine Learning, Natural Language Processing, Classification, Regression, Clustering, Time Series Analysis, Anomaly Detection
Hire Me Because
My Capstone Project: Houston, we have a salary: Predicting Salaries for Texas Government Employees
Utilizing machine learning techniques, find key drivers of salaries for current Texas government employees. The data was acquired from Texas Tribune’s website and cleaned as well as prepared for analysis and modeling using Pandas, Seaborn and Scipy Stats. Regression models were created using Scikit learn to predict salaries for current employees. The best performing model was the Polynomial Regression, which outperformed the baseline by 24% and had an R-squared of 38%.