Friday, January 31, 2020

Hands-on MKE(MapR Kubernetes Ecosystem ) 1.0 release

Goal:

MKE(MapR Kubernetes Ecosystem ) 1.0 has been released.
Basically it puts Spark and Drill into Kubernetes environment in this release.
Below is the architecture from the documentation Operators and Compute Spaces
This article shares the step-by-step commands used to install and configure a MKE 1.0 env.

Thursday, December 12, 2019

How to use nodeSelector to constrain POD csi-controller-kdf-0 to only be able to run on particular Node(s)

Goal:

This article explains how to use nodeSelector to constrain POD csi-controller-kdf-0 to only be able to run on particular Node(s).

How to create a MapR PACC using mapr-setup.sh to submit a Spark sample job

Goal:

This article shares the detailed steps on how to create a MapR Persistent Application Client Container(PACC) using mapr-setup.sh to submit a Spark sample job towards a secured MapR cluster.

Friday, December 6, 2019

Spark Streaming sample scala code for different sources

Goal:

This article shares some sample Spark Streaming scala code for different sources -- socket text, text files in MapR-FS directory, kafka broker and MapR Event Store for Apache Kafka(MapR Streams).
These are wordcount code which can be run directly from spark-shell.

Wednesday, December 4, 2019

How to mount a PersistentVolume for Static Provisioning using MapR CSI in GKE

Goal:

This article explains the detailed steps on how to mount a PersistentVolume for Static Provisioning using MapR Container Storage Interface(CSI) in Google Kubernetes Engine(GKE).

How to submit REST requests to a distributed Kafka Connect cluster

Goal:

This article shares the examples of curl commands to submit REST requests to a distributed Kafka Connect cluster.

Thursday, November 28, 2019

Understanding different modes in kafka-connect using an example

Goal:

This article is to help understand different modes in kafka-connect using an example.
The example will stream data from a mysql table to MapR Event Store for Apache Kafka(aka "MapR Streams") using different modes of kafka-connect -- incrementing, bulk, timestamp and timestamp+incrementing .

Wednesday, November 27, 2019

Friday, November 15, 2019

Popular Posts