n = 2809, data from ages 14-15 and prediction at ages 16-17. The data was from a longitudinal study in Australia.
The authors used several interesting techniques to deal with the unbalanced data, and also used predictor selection to prune unnecessary variables.