Spark RDD to Task mapping
Understanding how RDD are converted to Task
Understanding how RDD are converted to Task
How to make schema compatibility check part of your build time.
How to inter-convert avro schema to class programatically.
How to define Avro Schema with Default values
Avro serialization de-serialization using Confluent Schema registry
Where is Big Data
used and how has the architecture of systems evolved in the light of Big Data
Whats the difference between reduceByKey and CombineByKey in Spark
Whats the difference between groupByKey vs ReduceByKey in Spark
Use combine by key and use map transformation to find Max value for all the keys in Spark
Group by key and in the Map Partition apply some custom logic on the aggregated values of the key in Spark