Examining the dataset driving machine learning systems