My name is Nik and I created this blog as a way to chronicle my cross-career journey (descent?) into data science and Big Data. One of the biggest questions when I started was “Where do I even begin?” I figure there’s others like me out there, so I’ve compiled this list of resources to help us out. Let me know if you know of any others!
101 – Start Here
- An Introduction to Data Science by Jeffrey Stanton, Syracuse University
- Python for Data Analysis: Data Wrangling with Pandas, NumPy, and IPython
- Harvard University Extension School’s Data Science Course
- Statistics in a Nutshell (In a Nutshell (O’Reilly))
201 – Wax On / Wax Off
- Mining of Massive Data Sets by Anand Rajaraman and Jeff Ullman, Stanford University
- Elements of Statistical Learning by Hastie, Tibshirani, Friedman, Stanford University
- Learn R & Become a Data Analyst (Datacamp Course)
301 – Life Lessons
- Kaggle Competitions
- Building Machine Learning Systems with Python
- Analyzing Big Data With Twitter (Berkeley Course)
- Natural Language Processing with Python
401 – Semi-Finals
- NoSQL Distilled: A Brief Guide to the Emerging World of Polyglot Persistence
- Intro to Hadoop and Mapreduce
- Programming Collective Intelligence: Building Smart Web 2.0 Applications
- How You Should Go About Learning NoSQL
501 – Sweep the Leg