KL Divergence: The Information Theory Metric that Revolutionized Machine Learning