Data Science Primer

Data Science Primer

Daniel D. Gutierrez, AMULET Analytics

Data science involves understanding and preparing the data, defining the statistical learning model, and following the Data Science Process. Statistical learning models can assume many shapes and sizes, depending on their complexity and the application for which they are designed. The first step is to un­derstand what questions you are trying to answer for your organization. The level of detail and com­plexity of your questions will increase as you be­come more comfortable with the data science process.

In this session, I will cover the most important steps in the data science process – a general formula followed by data scientists in striving to achieve best practices with a data science project: under­standing the goal of the project, data access, data munging, exploratory data analysis, feature engi­neering, model selection, model validation, data visualization, communicate the results and deploy the solution to production.

About the Speaker

Daniel D. Gutierrez is a practicing data scientist through his Santa Monica, Calif. consulting firm AMULET Analytics. Daniel also serves as Managing Editor for insideBIGDATA.com where he keeps a pulse on this dynamic industry.

He is also an educator and teaches classes in data science, machine learning and R for universities and large enterprises. Daniel holds a BS degree in mathematics and computer science from UCLA.