The following links point to a set of tutorials on many aspects of statistical data mining, including the foundations of probability, the foundations of statistical data analysis, and most of the classic machine learning and data mining algorithms.

These include classification algorithms such as decision trees, neural nets, Bayesian classifiers, Support Vector Machines and cased-based (aka non-parametric) learning. They include regression algorithms such as multivariate polynomial regression, MARS, Locally Weighted Regression, GMDH and neural nets. And they include other data mining operations such as clustering (mixture models, k-means and hierarchical), Bayesian networks and Reinforcement Learning.

- Decision Trees
- Information Gain
- Probability for Data Miners
- Probability Density Functions
- Gaussians
- Maximum Likelihood Estimation
- Gaussian Bayes Classifiers
- Cross-Validation
- Neural Networks
- Instance-based learning (aka Case-based or Memory-based or non-parametric)
- Eight Regression Algorithms
- Predicting Real-valued Outputs: An introduction to regression
- Bayesian Networks
- Inference in Bayesian Networks (by Scott Davies and Andrew Moore)
- Learning Bayesian Networks
- A Short Intro to Naive Bayesian Classifiers
- Short Overview of Bayes Nets
- Gaussian Mixture Models
- K-means and Hierarchical Clustering
- Hidden Markov Models
- VC dimension
- Support Vector Machines
- PAC Learning
- Markov Decision Processes
- Reinforcement Learning
- Biosurveillance: An example
- Elementary probability and Naive Bayes classifiers
- Spatial Surveillance
- Time Series Methods
- Game Tree Search Algorithms, including Alpha-Beta Search
- Zero-Sum Game Theory
- Non-zero-sum Game Theory
- Introductory overview of time-series-based anomaly detection algorithms
- AI Class introduction
- Search Algorithms
- A-star Heuristic Search
- Constraint Satisfaction Algorithms, with applications in Computer Vision and Scheduling
- Robot Motion Planning
- HillClimbing, Simulated Annealing and Genetic Algorithms