Data spending
Date created: 2022-09-20
Data spending is the idea that you use your data properly, like money in a budget. If you use the same data for multiple purposes, you risk accentuating bias or compund effects from methodological errors.
- Data leakage is what happens with improper data spending.
- Data splitting is the most common strategy, e.g. train/test