Rethinking LLM Benchmarks: Measuring True Reasoning Beyond Training Data towardsdatascience.com Post date November 7, 2024 No Comments on Rethinking LLM Benchmarks: Measuring True Reasoning Beyond Training Data External Tags Apple, benchmark, GenAI, LLM, reasoning
A Benchmark and Taxonomy of Categorical Encoders medium.com Post date March 29, 2024 No Comments on A Benchmark and Taxonomy of Categorical Encoders External Tags benchmark, categorical-data, categorical-encoding, data-science, machine-learning
Optimizing Pandas Code: The Impact of Operation Sequence medium.com Post date March 18, 2024 No Comments on Optimizing Pandas Code: The Impact of Operation Sequence External Tags benchmark, data-science, pandas, performance, python
Benchmarking Pytest with CICD Using GitHub Action towardsdatascience.com Post date March 5, 2024 No Comments on Benchmarking Pytest with CICD Using GitHub Action External Tags benchmark, cicd-tools, github-actions, programming, pytest
Apple M3 Machine Learning Speed Test towardsdatascience.com Post date January 9, 2024 No Comments on Apple M3 Machine Learning Speed Test External Tags apple-silicon, artificial-intelligence, benchmark, macbook-pro, machine-learning
Benchmarking Rust Compiler Settings with Criterion towardsdatascience.com Post date December 15, 2023 No Comments on Benchmarking Rust Compiler Settings with Criterion External Tags benchmark, criterion, programming, rust, software-engineering
MLX vs MPS vs CUDA: a Benchmark towardsdatascience.com Post date December 15, 2023 No Comments on MLX vs MPS vs CUDA: a Benchmark External Tags Apple, benchmark, cuda, deep learning, pytorch
Temporal Graph Benchmark towardsdatascience.com Post date December 9, 2023 No Comments on Temporal Graph Benchmark External Tags benchmark, graph, machine-learning, temporal-graph, thoughts-and-theory
Please Use Streaming Workload to Benchmark Vector Databases towardsdatascience.com Post date December 1, 2023 No Comments on Please Use Streaming Workload to Benchmark Vector Databases External Tags benchmark, data-engineering, streaming, Vector Database, vector-search