Himansu Sekhar – Medium

Himansu Sekhar

Published in
road to data engineering

Stream Data from Kinesis to Databricks with Pyspark

Streaming with AWS Kinesis and Databricks

Jan 5, 2021

Stream Data from Kinesis to Databricks with Pyspark

Jan 5, 2021

Published in
road to data engineering

Databricks Notebook Promotion using Azure DevOps

Productionize Databricks Notebooks

Jan 3, 2021

Databricks Notebook Promotion using Azure DevOps

Jan 3, 2021

Published in
road to data engineering

Spark Performance Optimization Series: #3. Shuffle

Apache Spark optimization techniques for better performance

Dec 29, 2020

Spark Performance Optimization Series: #3. Shuffle

Dec 29, 2020

Published in
road to data engineering

Spark Performance Optimization Series: #2. Spill

Apache Spark optimization techniques for better performance

Dec 28, 2020

Spark Performance Optimization Series: #2. Spill

Dec 28, 2020

Published in
road to data engineering

Spark Performance Optimization Series: #1. Skew

In Spark cluster data is typically read in as 128 MB partitions which ensures even distribution of data. However, as the data is…

Dec 27, 2020

Spark Performance Optimization Series: #1. Skew

Dec 27, 2020

Kubernetes Architecture,Hands On!

You need to learn Kubernetes right now!!!! But Why??? Well Kubernetes can deploy hundreds of containers with just one command and it is…

Nov 25, 2020

Kubernetes Architecture,Hands On!

Nov 25, 2020

Himansu Sekhar

Himansu Sekhar

Data Engineering | DevOps | DataOps | Distributed Computing

Following

Help
Status
About
Careers
Press
Blog
Privacy
Rules
Terms
Text to speech