This section describes more advanced statistical methods. This includes the discovery and exploration of complex multivariate relationships among variables. Links to appropriate graphical methods are also provided throughout. Basic statistics are described in the previous section.
It is difficult to order these topics in a straight-forward way. I have chosen the following (admittedly arbitrary) headings.
Under predictive models, we have generalized linear models (include logistic regression, poisson regression, and survival analysis), discriminant function analysis (both linear and quadratic), and time series modeling.
Latent Variable Models
Cluster Analysis includes partitioning (k-means), hierarchical agglomerative, and model based approaches. Tree-Based methods (which could easily have gone under predictive models!) include classification and regression trees, random forests, and other partitioning methodologies.
Try the Kaggle R Tutorial on Machine Learning which includes an exercise with Random Forests.