Top big data Courses - Learn big data Online

Courses (6)
Topics (15)

Free(107) Guided Project(44) Python(39) Machine Learning(32) Big Data(26) IIT Roorkee(21) Deep Learning(21) Hadoop(17) Tensorflow 2(16) PG Certificate Program(13) Apache Hadoop(12) Spark(12) Pandas(12) DevOps(11) AI(11)

Show more tags

Showing results for ;

Search text : 'big data'

Loading and Saving Data

Topic

8 Concepts | 4,501 Learners

Free Apache Spark Big Data Spark

Learn to load and save data using Spark, compression, and how to handle various file formats using Spark from the industry experts.

Instructor: Sandeep Giri
Spark Project - Log Parsing

Topic

1 Concept | 4 Questions | 4,417 Learners

Free Guided Project Log Parsing Apache Spark Big Data

Whenever you make a request to a web server for a page, it records it in a file which is called logs.

The logs of a webserver are the gold mines for gaining insights in the user behaviour. Every data scientists usually look at the logs first to understand the behaviour of the users. But since the logs are humongous in size, it takes a distributed framework like Hadoop or Spark to process it.

As part of this project, you will learn to parse the text data stored in logs of a web server using the Apache Spark.
Machine Learning With Spark

Topic

7 Concepts | 3,189 Learners

Machine Learning Free Apache Spark Big Data Spark

Learn how to apply MLLib for Machine Learning using Spark.

Instructor: Sandeep Giri
Graph Processing With Spark

Topic

2 Concepts | 3,091 Learners

Free Apache Spark Big Data Spark

Learn Graph Processing with Spark from Industry Experts.

Instructor: Sandeep Giri
B

Building Realtime Analytics Dashboard using Apache Spark

Topic

1 Concept | 160 Learners

Free Big Data Spark Kafka

In this project, we will learn how to build a real-time analytics dashboard using Apache Spark Streaming, Kafka, Node.js, Socket.IO, and Highcharts.

Instructor: Abhinav Singh
U

Understanding Big Data Stack - Apache Hadoop and Spark

Topic

4 Concepts | 5 Learners
There are many Big Data Solution stacks.

The first and most powerful stack is Apache Hadoop and Spark together. While Hadoop provides storage for structured and unstructured data, Spark provides the computational capability on top of Hadoop.

The second way could be to use Cassandra or MongoDB. The third could be to use Google Compute Engine or Microsoft Azure. In such cases, you would have to upload your data to Google or Microsoft which may not be acceptable to your organization sometimes.

In this post, we will understand the basics of:
- Apache Hadoop
- components of the Hadoop ecosystem
- overview of …