PySpark Explained: User-Defined Functions medium.com Post date July 15, 2024 No Comments on PySpark Explained: User-Defined Functions External Tags data-engineering, data-science, pyspark, spark, tips-and-tricks
Top 30 PySpark Interview Questions and Answers feeds.feedburner.com Post date July 10, 2024 No Comments on Top 30 PySpark Interview Questions and Answers External Tags blockchain, CSV, Dataframe, dataframes, functions, Interview Questions, Lambda, pyspark, python, RDD, spark, SQL
Optimizing Sigma Rules in Spark with the Aho-Corasick Algorithm towardsdatascience.com Post date June 20, 2024 No Comments on Optimizing Sigma Rules in Spark with the Aho-Corasick Algorithm External Tags cybersecurity, sigma, spark, spark-streaming, tips-and-tricks
Performance Insights from Sigma Rule Detections in Spark Streaming medium.com Post date June 1, 2024 No Comments on Performance Insights from Sigma Rule Detections in Spark Streaming External Tags cybersecurity, data-science, spark, streaming
Performant IPv4 Range Spark Joins towardsdatascience.com Post date January 25, 2024 No Comments on Performant IPv4 Range Spark Joins External Tags cybersecurity, data-engineering, hands-on-tutorials, spark, SQL
Unleashing the Power of SQL Analytical Window Functions: A Deep Dive into Fusing IPv4 Blocks towardsdatascience.com Post date January 10, 2024 No Comments on Unleashing the Power of SQL Analytical Window Functions: A Deep Dive into Fusing IPv4 Blocks External Tags analytics, cybersecurity, hands-on-tutorials, spark, SQL
Spark vs Presto: A Comprehensive Comparison feeds.feedburner.com Post date December 28, 2023 No Comments on Spark vs Presto: A Comprehensive Comparison External Tags ai, big-data, data analytics, data-science, dataset, Datasets, Presto, spark, Use Cases
Parallelising Python on Spark: Options for concurrency with Pandas towardsdatascience.com Post date November 18, 2023 No Comments on Parallelising Python on Spark: Options for concurrency with Pandas External Tags databricks, machine-learning, parallel-computing, python, spark
Optimizing Output File Size in Apache Spark medium.com Post date August 11, 2023 No Comments on Optimizing Output File Size in Apache Spark External Tags big-data, data-science, optimization, spark
Mr. Pavan’s Data Engineering Journey Drives Business Success feeds.feedburner.com Post date June 24, 2023 No Comments on Mr. Pavan’s Data Engineering Journey Drives Business Success External Tags career in data science, career transition, career-advice, data engineering career, data-engineering, Hadoop, Microsoft, spark, Success Story