Comparing Performance of Big Data File Formats: A Practical Guide towardsdatascience.com Post date January 17, 2024 No Comments on Comparing Performance of Big Data File Formats: A Practical Guide External Tags apache-spark, big-data, data storage, data-analysis, data-engineering
Delta Lake — Partitioning, Z-Order and Liquid Clustering towardsdatascience.com Post date November 8, 2023 No Comments on Delta Lake — Partitioning, Z-Order and Liquid Clustering External Tags apache-spark, big-data, data-engineering, delta-lake, programming
5 Lessons Learned from Testing Databricks SQL Serverless + DBT towardsdatascience.com Post date October 17, 2023 No Comments on 5 Lessons Learned from Testing Databricks SQL Serverless + DBT External Tags apache-spark, cloud computing, data-engineering, databricks, SQL
Memory Management in Apache Spark: Disk Spill medium.com Post date September 15, 2023 No Comments on Memory Management in Apache Spark: Disk Spill External Tags apache-spark, data-engineering, data-science
Distributed Llama 2 on CPUs towardsdatascience.com Post date August 2, 2023 No Comments on Distributed Llama 2 on CPUs External Tags apache-spark, generative-ai, llama 2, LLM, nlp