Enrollments closing soon for Post Graduate Certificate Program in Applied Data Science & AI By IIT Roorkee | 3 Seats Left

  Apply Now
  • Topic
    12 Concepts | 46 Questions | 4 Assessments | 9,671 Learners

    Learn Big Data with Hadoop from the industry experts.

    Instructor: Sandeep Giri
  • Topic
    12 Concepts | 15 Questions | 9,258 Learners

    In this chapter, we learn the basics of Big Data which include various concepts, use-cases and understanding of the eco-system.

    This chapter doesn't require any knowledge of programming or technology. We believe it is very useful for every to learn the basics of Big Data. So, jump in!

    Happy Learning!

    Instructor: Sandeep Giri
  • I

    Topic
    7 Concepts | 15 Questions | 8,409 Learners

    As everyone knows, Big Data is a term of fascination in the present-day era of computing. It is in high demand in today’s IT industry and is believed to revolutionize technical solutions like never before.

    Upon learning the big data concepts, we will get a vivid picture of the need for clusters of machines (distributed systems), and appreciate the use of this architecture in solving critical problems associated with storing and processing humungous data. In addition, we will get an idea of system design concepts, which aid us in designing scalable and resilient systems - the most desirable kind of …

    Instructor: Cloudxlab
  • Topic
    16 Concepts | 72 Questions | 1 Assessment | 8,229 Learners

    Learn Big Data with Apache from the Industry Experts.

    Instructor: Sandeep Giri
  • Topic
    30 Concepts | 5 Questions | 18 Assessments | 7,738 Learners

    Welcome to a course in Scala Foundations.

    As part of this course, you will learn how to write programs using Scala.

    Scala is a programming language like Java or Python. The syntax is much like Python while under the hood it compiles to Java. It also comes with both an interactive interpreter and a compiler.

    Further, Scala is designed for scalable computing where the code could be sent to data and codes is also run in parallel.

    Scala is used in the enterprise world and has gain a lot of traction.

    Instructor: Sandeep Giri
  • Topic
    7 Concepts | 6,160 Learners

    Learn to code in Java from the Industry Experts.

    Instructor: Sandeep Giri
  • Topic
    1 Concept | 8 Questions | 1 Assessment | 6,037 Learners

    Learn how to do sentiment analysis in Hive from the industry experts.

    Instructor: Sandeep Giri
  • M

    Topic
    4 Concepts | 5 Questions | 23 Assessments | 5,841 Learners

    This chapter covers different NumPy constructs and functions along with Overview of Pandas, Matplotlib and Linear Algebra which is normally used in Machine Learning projects

    Instructor: Sandeep Giri
  • Topic
    9 Concepts | 5,838 Learners

    Learn to filter, sort, multiple reducers in MapReduce.

    Instructor: Sandeep Giri
  • Topic
    5 Concepts | 6 Assessments | 5,292 Learners

    Welcome to this project on Churning the Emails Inbox with Python. In this project, you will use Python to access the data from files and process it to achieve certain tasks. You will explore the MBox email dataset, and use Python to count lines, headers, subject lines by emails and domains. Know your way on how to work with data in Python.

    Skills you will develop:

    1. Python
    2. File Handling in Python
    Instructor: Abhinav Singh
  • Topic
    12 Concepts | 8 Questions | 4,787 Learners

    Spark Streaming is an extension of the core Spark API that enables scalable, high-throughput, fault-tolerant processing of live data streams. Learn Spark Streaming from the industry experts.

    Instructor: Sandeep Giri
  • Topic
    6 Concepts | 3 Questions | 4 Assessments | 4,692 Learners

    MapReduce is Framework as well as a paradigm of computing. By the way of map-reduce, we are able to break-down complex computation into distributed computing.

    As part of this chapter, we are going to learning how to build MapReduce programmes using Java.

    Please make sure you work along with the course instead of just sitting back and watching.

    Happy Learning!

    Instructor: Sandeep Giri
  • Topic
    8 Concepts | 4,493 Learners

    Learn to load and save data using Spark, compression, and how to handle various file formats using Spark from the industry experts.

    Instructor: Sandeep Giri
  • Topic
    1 Concept | 4 Questions | 4,402 Learners

    Whenever you make a request to a web server for a page, it records it in a file which is called logs.

    The logs of a webserver are the gold mines for gaining insights in the user behaviour. Every data scientists usually look at the logs first to understand the behaviour of the users. But since the logs are humongous in size, it takes a distributed framework like Hadoop or Spark to process it.

    As part of this project, you will learn to parse the text data stored in logs of a web server using the Apache Spark.

  • Topic
    6 Concepts | 9 Questions | 4,306 Learners

    Learn NoSQL, complex objects and relations from the industry experts.

    Instructor: Sandeep Giri