Key Points

Introduction

Machine learning is a set of tools and techniques that use data to make predictions.
Artificial intelligence is a broader term that refers to making computers show human-like intelligence.
Deep learning is a subset of machine learning.
All machine learning systems have limitations to be aware of.

Scikit-Learn is a Python library with lots of useful machine learning functions.
Scikit-Learn includes a linear regression function.
Scikit-Learn can perform polynomial regressions to model non-linear data.

Classifiers built with supervised techniques require labelled data to train on.
We usually split our data into training and test sets with 80% of the data used for training and 20% used to test it.
We must be careful that our training and test sets have similar characteristics.
Decision trees are a simple method of classification that can work well on simpler data.
Hyper parameters are parameters which affect the behaviour of training a model.
Tree depth in a decision tree is an example of a hyper parameter.
Support vector machines work well on more complex data with non-linear boundaries.
Support vector machines use a “kernel trick” to transition data into a higher dimensional space.

Clustering is a form of unsupervised learning.
Unsupervised learning algorithms don’t need training.
Kmeans is a popular clustering algorithm.
Kmeans is less useful when one cluster exists within another, such as concentric circles.
Spectral clustering can overcome some of the limitations of Kmeans.
Spectral clustering is much slower than Kmeans.
Scikit-Learn has functions to create example data.

PCA is a linear dimensionality reduction technique for tabular data
t-SNE is another dimensionality reduction technique for tabular data that is more general than PCA

Perceptrons are artificial neurons which build neural networks.
A perceptron takes multiple inputs, multiplies each by a weight value and sums the weighted inputs. It then applies an activation function to the sum.
A single perceptron can solve simple functions which are linearly separable.
Multiple perceptrons can be combined to form a neural network which can solve functions that aren’t linearly separable.
We can train a whole neural network with the back propagation algorithm. Scikit-learn includes an implementation of this algorithm.
Training a neural network requires some training data to show the network examples of what to learn.
To ensure the whole dataset can be used in training and testing we can train multiple times with different subsets of the data acting as training/testing data. This is called cross validation.
Deep learning neural networks are a very powerful modern machine learning technique. Scikit-Learn does not support these but other libraries like Tensorflow do.
Several companies now offer cloud APIs where we can train neural networks on powerful computers.

This course has only touched on a few areas of machine learning and is designed to teach you just enough to do something useful.
Machine learning is a rapidly evolving field and new tools and techniques are constantly appearing.