Stream Processing Using Apache Spark and Kafka

Thank you all for your overwhelming response to our “Stream Processing using Apache Spark and Apache Kafka session” in “Apache Spark Hands-On” series, which happened on June 15, 2016 8:00 pm IST

Key takeaways- 

+ Introduction to Apache Spark
+ Introduction to stream processing
+ Understanding RDD (Resilient Distributed Datasets)
+ Understanding Dstream
+ Kafka Introduction
+ Understanding Stream Processing flow
+ Real time hands-on using CloudxLab
+ Questions and Answers

Continue reading “Stream Processing Using Apache Spark and Kafka”

Apache Spark Introduction

Thank you all for your overwhelming response to our Apache Spark Introduction session in “Apache Spark Hands-On” series, which happened on April 28, 2016 8:00 pm IST

Presented By
Sandeep Giri

Sandeep Giri

Key takeaways for this webinar were

+ Introduction to Apache Spark
+ Introduction to RDD (Resilient Distributed Datasets)
+ Loading data into an RDD
+ RDD Operations – Transformation
+ RDD Operations – Actions
+ Hands-on demos using CloudxLab
+ Questions and Answers

Continue reading “Apache Spark Introduction”

CloudxLab Introduction

What is CloudxLab?

CloudxLab is a cloud based virtual lab for practicing Big Data (Hadoop, Spark etc), Machine Learning and Deep Learning technologies.

Origins

While training students on Big Data technologies at KnowBigData, we realized that our learners were facing a lot of trouble downloading and configuring virtual machines (VM) provided by major Hadoop vendors. Most often, these virtual machines were slow and would not allow for use of any other application on the same computer.

Moreover, working on a VM did not give a real world experience as one is still dealing with only one machine instead of a cluster of machines which is the whole idea of Big Data technologies which are primarily based on distributed computing.

This is how CloudxLab was conceptualized in an effort to resolve these pain points of learners. The video below will help understand how one of our clients – Simplilearn – is using CloudxLab to provide a better learning experience to their course takers.

Continue reading “CloudxLab Introduction”