Applied Computing Teaching Labs

Undergraduate teaching demos

These activities turn common data science mistakes into classroom experiments. Students change the setup, see the result, then explain what the evidence supports.

Lab 1

Model Evaluation and Data Leakage

Students compare a random split with a group held out split and decide which score supports a claim about new sites, cohorts, or sensors.

Random split score 0.86
Group held out score 0.54
Leakage risk High

Try this

Use the controls to make the gap between the random split score and the group held out score larger or smaller. Which score would you trust for a new site, and why?

Lab 2

Machine Learning

Students adjust model complexity and data quality, then explain why a model can look impressive in training but fail to generalize.

Training accuracy 91%
Test accuracy 78%
Overfit gap 13%

Try this

Increase model complexity and watch the training and test scores. At what point does the model start memorizing instead of learning a useful pattern?