1)

Indicate whether we would generally expect the performance of a flexible statistical learning method to be better or worse than an inflexible method for the following scenarios

2)

Explain for each scenario below whether it is a classification or regression problem, whether we’re interested in inference or prediction, and what \(n\) and \(n\) are.

3)

Sketch typical (squared) bias, variance, training error, test error, and Bayes (or irreducible) error curves on a single plot, as we go from less flexible statistical learning methods towards more flexible approaches.

The x-axis is the amount of flexibility in the model, and the y-axis represents the values for each curve.

As flexibility of a model increases:

4)

5)

What are the advantages and disadvantages of a very flexible approach? When is flexibility preferred over infexibility and vice versa?

A flexible approach may be closer to the ‘true’ model and provide a better fit to the data and provide a more accurate prediction. However if too flexible a model is used it may overfit the data. It’s predictive powers are then reduced as its variance increases.