Machine Learning Bootcamp
Empower your team with state-of-the-art skills to discover hidden patterns in your data
-
An innovative curriculum provides your team with state-of-the-art machine learning skills actually used in practice
-
Build hands-on machine learning skills via in person classroom or live virtual experience
-
Jumpstart your team's advanced analytics journey using Python - no previous Python experience required
Answering the call for advanced analytics
Is machine learning shaping the future of your organization?
While data has always been used in business, things have changed. Functions like HR, Product Management, Customer Service, etc., are embracing advanced analytics to drive better business outcomes.
Do you want your team to be a part of this data-driven future?
It’s hard to avoid all the social media posts, magazine articles, or news clips trumpeting how machine learning is permanently changing the way organizations operate – and changing the expectations of organizations.
Machine learning for ANY team - regardless of role/background
Imagine a team of Product Managers that could answer the following question with data, "What feature usage(s) are highly predictive of a sticky customer?" How much value would they bring to their organization?
Training from a top-rated instructor
Machine Learning Bootcamp
The Machine Learning Bootcamp empowers your team with skills like random forests and k-means clustering to discover new insights.
If your team is new to Python, Dave’s free Python Crash Course will teach them everything they need to know before the bootcamp.
This training focuses on a practical subset of machine learning skills, so your team can hit the ground running and deliver insights ASAP.
The bootcamp is often bundled with additional courses (see below) to increase your team’s capabilities.
The outcome?
Your team will have the knowledge and hands-on skills to use machine learning to find hidden patterns in your data, including crafting predictive models and performing cluster analyses.
-
A well-defined set of skills for real-world machine learning insights
-
Your team will build real-world skills via 12 hands-on labs
-
Courses can be taught using Python in Excel if that works better for your team
-
Courses can be taught in-person or virtually. Choose what works best.
-
Taught by Dave Langer, globally recognized data science instructor
-
Bundle additional courses to expand your teams capabilities
What professionals have to say
Bring the same high-quality training experiences of national conferences to your team.
Training delivered in person or live virtual - whichever works best.
“Very good course as an intro to machine learning. I feel that with what I learned today I can put these skills into practice at work.”
— David Green, EMWD
“Fantastic intro ML course that’s presented in an engaging way. The content was easy to understand and the labs were easy to follow along. I’ve left the course wanting to dive deeper into the topic.”
— Jessica Liu, O-I Glass
“MIND BLOWN…not by the difficulty of the class, but by how EASY Dave makes machine learning within the reach of aspiring Data Scientists.
Easily the highlight of this year’s conference for me. I feel empowered to bring this material back to the job, put it to use, and teach it to others.”
— Chet Phelps, Health Solutions
“Best training and instructor I’ve had. Organized, clear, good pace, helpful examples, and an engaging and fun instructor.”
— Alex Kurtz, Sourceability
“I am so glad to have started the conference in Dave’s class. He set a wonderful tone for what is yet to come. I hope my other courses measure up!”
— Christina Mitchell, Naphcare
“Great class! Engaging instructor. Wish I would have had more time this week to attend his other sessions.”
— Matthew Royalt, Southern Star Central Gas Pipeline
Machine Learning Bootcamp Curriculum
3 days. 12 hands-on labs.
Bootcamp can be taught with Python in Excel.
-
What is Machine Learning?
Data Analyst, Teacher
Why Decision Trees?
-
Course Datasets
Exploratory Data Analysis (EDA)
Hands-on Lab #1
-
Classification Tree Intuition
Overfitting Intuition
Gini Impurity
Split Quality
Splitting Categorical Data
Splitting Numeric Data
Classification Trees with Python
Hands-On Lab #2
-
Under/Overfitting
The Bias-Variance Tradeoff
Supervising the Data
Model Tuning
Classification Tree Pruning
Measuring Awesomeness
Splitting the Dataset
Modeling Tuning with Python
Hands-On Lab #3
-
Feature Engineering Intuition
Data Leakage
Decision Boundaries
Engineering Numeric Features
Engineering Categorical Features
Engineering Date-Time Features
Missing Data
Hands-On Lab #4
-
Regression Tree Fundamentals
Numeric Feature MSE
Categorical Feature MSE
Feature Evaluation
Tuning Regression Trees
Imputation with Regression Trees
Hands-On Lab #5
-
Bad, Tree! Bad!
Ensembles
Bagging
Feature Randomization
Random Forests with Python
Hands-On Lab #6
-
Feature Importance
Tuning Random Forests
Model Testing
-
Additional Resources
Wanna Kaggle?
-
The following is the 3-day curriculum. The curriculum can be expanded by bundling additional courses (see below).
Dave’s free Python Crash Course is available for teams new to Python.
Introduction to Machine Learning - Days 1 & 2
-
Course Expectations
What is Cluster Analysis?
Cluster Analysis Use Cases
The Challenge of Clustering Data
-
The Iris Dataset
The Hand-Written Digits Dataset
The Heart Dataset
-
Hierarchical, Partitional, and Overlapping Clustering
Prototype Clusters
Density-Based Clusters
-
Introducing K-Means
The K-Means Algorithm
Euclidian Distance
The Problem with Outliers
Data Standardization
K-Means Caveats
Hands-On Lab #1
-
Evaluating Clusters
Cluster Cohesion
Evaluating Cohesion with the Elbow Method
The Silhouette Coefficient
Evaluating Clusters using the Silhouette Score
Hands-on Lab #2
-
Introducing DBSCAN
The DBSCAN Algorithm
DBSCAN Caveats
-
Considerations for Optimizing DBSCAN
Calculating min_samples
Choosing the eps Value
Introducing Nearest Neighbors
Evaluating eps with the Elbow Method
DBSCAN vs K-Means
Hands-On Lab #3
-
Introducing Dimensionality Reduction
Principal Component Analysis (PCA)
PCA Concepts
Hands-On Lab #4
-
The Problem with Categories
Encoding Categorical Data
Factor Analysis of Mixed Data (FAMD)
-
Supervised Learning Resources
Cluster Analysis Resources
-
Cluster Analysis - Day 3
Course Add-Ons
Expand your team’s capabilities by bundling additional courses into your bootcamp.
All courses can be taught using Python in Excel.
-
Visual Data Analysis
This 1-day hands-on course teaches how to use data visualizations the way Data Analysts/Scientists do - to get to the “why” of what’s happening. This course focuses on topics useful to any team, including Distribution Analysis, Correlation Analysis, Multivariate Analysis, and Time Series Analysis..
-
Data Wrangling for Machine Learning
This 1-day hands-on course focuses on techniques for producing the best quality data for use in crafting valuable machine learning models. Topics include data profiling, wrangling string data, and engineering date-time features. This course expands upon the topics covered in the Introduction to Machine Learning course.
-
Text Analytics
This 1-day hands-on course is an introduction to the tools an techniques of transforming text data into a form suitable for analytics. Examples include clustering documents and sentiment analysis. Topics include tokenization, stemming, lemmatization, TF-IDF, and cosine similarity.
FAQs
-
Yes! The bootcamp can be delivered virtually or in-person with your team.
-
While the courses do include mathematics, it is at a level accessible to a broad audience. For example, no knowledge of calculus or statistics is required.
-
All the courses use Python as the programming language. Dave's free Python Crash Course is available for teams new to Python.
-
Everything taught in the bootcamp is 100% compatible with Python in Excel. Courses can be taught using Python in Excel for the hands-on labs.
-
The Introduction to Machine Learning online course is available and covers the first two days of the bootcamp.
At this time, there is no online course for cluster analysis.