Application Deadline 8 Jan

Application Deadline 15th Dec

12 months






IIT Roorkee


About the Course

This Executive PG Certificate Program in Data Science is an online course. This course covers some of the most trending and latest technologies in the market. The cutting edge content provided through this course will help you launch a career in the field of Data Science

Additionally, this course comes with our cloud lab access to gain the much needed hands-on experience to solve the real-world problems.

Upon successfully completing the course, you will get the certificate from IIT Roorkee which you can use for progressing in your career and finding better opportunities.

Program Highlights

Executive PG Certificate from IIT Roorkee

Executive PG Certificate from IIT Roorkee

Certificate of Completion by IIT Roorkee

1 Week Immersion Program

1 Week Immersion Program

A On-Campus immersion Program at IIT-Roorkee

Placement Eligibility Test

Placement Eligibility Test

Proctored Exams with opportunity to get Placed

Hands-On Project

18+ Hands-On Project

Work on real world projects to get an hands-on experience

Timely Doubt Resolution

Timely Doubt Resolution

Get access to community of learners via our discussion forum

Access to Cloud Lab

Access to Cloud Lab

Lab comes pre-installed with all the software you will need to learn and practice.

Application Deadline 08th Jan'22


What is the certificate like?

  • Why IIT Roorkee?

    IIT Roorkee is ranked first among all the IITs AND 20th position globally in citations per faculty. Established in 1847, it's one of the oldest technical institutions in Asia. IIT Roorkee fosters a very strong entrepreneurial culture. Some of their alumni are highly successful as entrepreneurs in the new age digital economy.

  • Why CloudxLab?

    CloudxLab is a team of developers, engineers, and educators passionate about building innovative products to make learning fun, engaging, and for life. We are a highly motivated team who build fresh and lasting learning experiences for our users. Powered by our innovation processes, we provide a gamified environment where learning is fun and constructive. From creative design to intuitive apps we create a seamless learning experience for our users. We upskill engineers in deep tech - make them employable & future-ready.



Among the IITs in the ‘Citations per Faculty’ parameter

*QS World Rankings

India Today


Ranked Engineering College

*India Today 2020



Ranked for IITs

*NIRF 2020



Ranked Best Global Universities in India

*QS World Rankings

Hands-on Learning

hands-on lab

  • Gamified Learning Platform
    Making learning fun and sustainable

  • Auto-assessment Tests
    Learn by writing code and executing it on lab

  • No Installation Required
    Lab comes pre-installed softwares and accessible everywhere

Mentors / Faculty

Instructor Raksha Sharma

Raksha Sharma

Faculty CSE Dept

IIT Roorkee

Instructor Sanjeev Manhas

Sanjeev Manhas

Faculty ECE Dept

IIT Roorkee

Mentor Venkat Karun

Venkat Karun

Staff Software Engineer


Dr. M.L. Virdi

Dr. M.L. Virdi

Senior Research Scientist


Instructor Sandeep Giri

Sandeep Giri

Founder at CloudxLab

Past: Amazon, InMobi, D.E.Shaw

Instructor Abhinav Singh

Abhinav Singh

Co-Founder at CloudxLab

Past: Byjus

Instructor Praveen

Praveen Pavithran

Co-Founder at Yatis

Past: YourCabs, Cypress Semiconductor


Foundation Courses

1. Programming Tools and Foundational Concepts
1. Linux for Data Science
2. Getting Started with Git
3. Python Foundations
4. Machine Learning Prerequisites(Including Numpy, Pandas and Linear Algebra)
5. Getting Started with SQL
6. Statistics Foundations

Course on Machine Learning

1. Machine Learning Applications & Landscape
1. Introduction to Machine Learning
2. Machine Learning Application
3. Introduction to AI
4. Different types of Machine Learning - Supervised, Unsupervised
2. Building end-to-end Machine Learning Project
1. Machine Learning Projects Checklist
2. Get the data
3. Launch, monitor, and maintain the system
4. Explore the data to gain insights
5. Prepare the data for Machine Learning algorithms
6. Explore many different models and short-list the best ones
7. Fine-tune model
3. Classification
1. Training a Binary classification
2. Multiclass,Multilabel and Multioutput Classification
3. Performance Measures
4. Confusion Matrix
5. Precision and Recall
6. Precision/Recall Tradeoff
7. The ROC Curve
4. Training Models
1. Linear Regression
2. Gradient Descent
3. Polynomial Regression
4. Learning Curves
5. Regularized Linear Models
6. Logistic Regression
5. Support Vector Machines
1. Linear SVM Classification
2. Nonlinear SVM Classification
3. SVM Regression
6. Decision Trees
1. Training and Visualizing a Decision Tree
2. Making Predictions
3. Estimating Class Probabilities
4. The CART Training Algorithm
5. Gini Impurity or Entropy
6. Regularization Hyperparameters
7. Instability
7. Ensemble Learning and Random Forests
1. Voting Classifiers
2. Bagging and Pasting
3. Random Patches and Random Subspaces
4. Random Forests
5. Boosting and Stacking
8. Dimensionality Reduction
1. The Curse of Dimensionality
2. Main Approaches for Dimensionality Reduction
3. PCA
4. Kernel PCA
5. LLE
6. Other Dimensionality Reduction Techniques

Course on Business Analytics

Analytics Tools
1. PowerBI
2. Tableau
3. Market Basket Analysis
4. RFM segementation

Course on Big Data with Hadoop and Spark

1. Introduction to Hadoop
1. Introduction
2. Distributed systems
3. Big Data Use Cases
4. Various Solutions
5. Overview of Hadoop Ecosystem
6. Spark Ecosystem Walkthrough
2. Foundation & Environment
1. Understanding the CloudxLab
2. Getting Started - Hands on
3. Hadoop & Spark Hands-on
4. Understanding Regular Expressions
5. Setting up VM
3. Zookeeper
1. ZooKeeper - Race Condition
2. ZooKeeper - Deadlock
3. How does election happen - Paxos Algorithm?
4. Use cases
5. When not to use
1. Why HDFS?
2. NameNode & DataNodes
3. Advance HDFS Concepts (HA, Federation)
4. Hands-on with HDFS (Upload, Download, SetRep)
5. Data Locality (Rack Awareness)
1. Why YARN?
2. Evolution from MapReduce 1.0
3. Resource Management: YARN Architecture
4. Advance Concepts - Speculative Execution
6. MapReduce Basics
1. Understanding Sorting
2. MapReduce - Overview
3. Word Frequency Problem - Without MR
4. Only Mapper - Image Resizing
5. Temperature Problem
6. Multiple Reducer
7. Java MapReduce
7. MapReduce Advanced
1. Writing MapReduce Code Using Java
2. Apache Ant
3. Concept - Associative & Commutative
4. Combiner
5. Hadoop Streaming
6. Adv. Problem Solving - Anagrams
7. Adv. Problem Solving - Same DNA
8. Adv. Problem Solving - Similar DNA
9. Joins - Voting
10. Limitations of MapReduce
8. Analyzing Data with Pig
1. Pig - Introduction
2. Pig - Modes
3. Example - NYSE Stock Exchange
4. Concept - Lazy Evaluation
9. Processing Data with Hive
1. Hive - Introduction
2. Hive - Data Types
3. Loading Data in Hive (Tables)
4. Movielens Data Processing
5. Connecting Tableau and HiveServer 2
6. Connecting Microsoft Excel and HiveServer 2
7. Project: Sentiment Analyses of Twitter Data
8. Advanced - Partition Tables
9. Understanding HCatalog & Impal
10. NoSQL and HBase
1. NoSQL - Scaling Out / Up
2. ACID Properties and RDBMS Story
3. CAP Theorem
4. HBase Architecture - Region Servers etc
5. Hbase Data Model - Column Family Orientedness
6. Getting Started - Create table, Adding Data
7. Adv Example - Google Links Storage
8. Concept - Bloom Filter
9. Comparison of NOSQL Databases
11. Importing Data with Sqoop and Flume, Oozie
1. Sqoop - Introduction
2. Sqoop Import - MySQL to HDFS
3. Exporting to MySQL from HDFS
4. Concept - Unbounding Dataset Processing or Stream Processing
5. Flume Overview: Agents - Source, Sink, Channel
6. Data from Local network service into HDFS
7. Example - Extracting Twitter Data
8. Example - Creating workflow with Oozier
12. Introduction to Spark
1. Apache Spark ecosystem walkthrough
2. Spark Introduction - Why Spark?
13. Scala Basics
1. Introduction, Access Scala on CloudxLab
2. Variables and Methods
3. Interactive, Compilation, SBT
4. Types, Variables & Values
5. Functions
6. Collections
7. Classes
8. Parameters
14. Spark Basics
1. Apache Spark ecosystem
2. Why Spark?
3. Using the Spark Shell on CloudxLab
4. Example 1 - Performing Word Count
5. Understanding Spark Cluster Modes on YARN
6. RDDs (Resilient Distributed Datasets)
7. General RDD Operations: Transformations & Actions
8. RDD lineage
9. RDD Persistence Overview
10. Distributed Persistence
15. Writing and Deploying Spark Applications
1. Creating the SparkContext
2. Building a Spark Application (Scala, Java, Python)
3. The Spark Application Web UI
4. Configuring Spark Properties
5. Running Spark on Cluster
6. RDD Partitions
7. Executing Parallel Operations
8. Stages and Tasks
16. Common Patterns in Spark Data Processing
1. Common Spark Use Cases
1. Example 1 - Data Cleaning (Movielens)
1. Example 2 - Understanding Spark Streaming
2. Understanding Kafka
3. Example 3 - Spark Streaming from Kafka
4. Iterative Algorithms in Spark
5. Project: Real-time analytics of orders in an e-commerce company
17. Data Formats & Management
1. XML
3. How to store many small files - SequenceFile?
4. Parquet
5. Protocol Buffers
6. Comparing Compressions
7. Understanding Row Oriented and Column Oriented Formats - RCFile?
18. DataFrames and Spark SQL
1. Spark SQL - Introduction
2. Spark SQL - Dataframe Introduction
3. Transforming and Querying DataFrames
4. Saving DataFrames
5. DataFrames and RDDs
6. Comparing Spark SQL, Impala, and Hive-on-Spark
19. Machine Learning with Spark
1. Machine Learning Introduction
2. Applications Of Machine Learning
3. MlLib Example: k-means
4. SparkR Example
Months of Blended Training
Days of Lab Access


Which Executive PG Certificate Program by IIT Roorkee should you choose?

Topics available Executive PG Certificate Program in Data Science Executive PG Certificate Program in AI (Machine Learning & Deep Learning) Executive PG Certificate Program in Machine Learning Operations
Foundations (Python, Linux, Git, SQL, Analytics)
Machine Learning
Deep Learning Foundations - Artificial Neural Networks
Advanced Deep Learning
Business Analytics
Big Data (Hadoop & Spark)
DevOps Foundations - Deploy Model to Production
Advanced DevOps
View Details View Details

Apply Now

Application Process

  • Step 1. Submit the application form and SOP(Statement of Purpose)
    Register by filling the application form. Admission Test in the next step will be immediately after filling the form.

  • Step 2. Admission Test
    Online admission test of 60 minutes containing 20 multiple choice questions to assess your quantitative aptitude and programming skills.

  • Step 3. Personal Interview
    Online personal interview for 45 minutes with a course mentor. The interviewer will ask questions based on your Statement of Purpose, work experience(if any) and your motivation and aspirations to join the course.

  • Step 4. Join The Program
    If selected, the admission office will send the letter of acceptance. Submit the admission fees in due time to confirm the seat

  • Note: Admission test should be taken immediately after submitting the application using the link displayed post application submission.

Eligibility Criteria

    1. Anybody in their final year of undergraduate degree or has completed their undergraduation is eligible to apply for the course
    1. Must have studied Mathematics in 12th standard


    1. 10% Scholarships are available for students, women from STEM background and unemployed
    1. 5% Scholarship available for IIT Alumni and CloudxLab Alumni.

PS: Details to avail the scholarship will be sent post application-submission and only one scholarship applicable per learner

Certification Guideline

You will be required to have at least 60% attendance in live sessions, complete at least 75% of the course, 1 Capstone Project and 6 Guided projects - Analyse emails from Python, Sentiment Analysis (Hive) from Hadoop, Log Parsing from Spark, 3 mandatory projects from Machine Learning. All the above requirements need to be met within the deadline of the course (12 Months) to be eligible for the certificate from IIT Roorkee.

No Cost EMI at


Or Program Fee 4,999*

  • 12 Months Program
  • 365 Days of Online Lab Access
  • 24*7 Support
  • Application Deadline 08th Jan'22
  • Certificate from IIT Roorkee
Sold Out »

  • Note: Additional 5% off for one-time payment
  • Placement Assistance

    Placement Eligibility Test

    Placement Eligibility Test

    We have around 300+ recruitment partners who will be interviewing you based on your performances in PET

    Dedicated Job Portal

    Dedicated Job Portal

    Opportunities from companies who approach us asking for our learner profiles will be posted on our job portal to providevisibility to your profile

    Career Guidance Webinars

    Career Guidance Webinars

    Career Guidance Webinars from seasoned industry experts



    Frequently Asked Questions

    Will I get support?

    Yes! Please feel free to ask your questions on CloudxLab forum and our community and team of experts will answer your questions. We believe forum will add better perspectives, ideas, and solutions to your questions.

    Can I get a certificate for the projects completed?

    We have created a set of Guided Projects on our platform. You may complete these guided projects and earn the certificate for free. Check it out here

    Can I expect any placement support?

    Yes, we do offer placement assistance that includes career guidance, resume building tips and mock interviews. Each participant will receive staunch support from the industry mentors, who also direct you through various placement opportunities within the industry. Above all, we are partnered with leading MNC’s that offer placement opportunities to our participants.

    Do I need to install any software before starting this course?

    No, we will provide you with access to our online lab and BootML so that you do not have to install anything on your local machine

    What is the validity of course material?

    We understand that you might need course material for a longer duration to make most out of your subscription. You will get lifetime access to the course material so that you can refer to the course material anytime.

    What if I miss a class?

    You will never lose any lecture. You can view the recorded session of the class in your LMS.

    What is the refund policy?

    We provide a 100% fee refund if the request is raised within the first 2 instructor-led sessions. Please contact us at to request a refund within the stipulated time. Thereafter, no refund is provided.

    I have some more questions. Can I talk to someone?

    Absolutely! Please contact us here

    Will there be Options to Pay using EMI/Installments

    Yes, you can choose to pay by installments on the payment page.

    How will the payment for the course be made?

    The course fee will be paid in two parts:

    1. Admission fee - Which is a small percentage of the total course fee which helps to reserve your seat for the program

    2. Installments - The remaining course fee will be paid in 11 Installments paid monthly for 11 months as part of No Cost EMI

    If I have more questions during the week days apart from the live sessions, how can I get it cleared?

    Teaching assistants will be helping you during the weekdays to ensure a seamless learning experience. You will be able to have a session at your and the teaching assistants convenience and get your queries cleared over WhatsApp, email or call. There is no limit to the same as well.

    What all will be asked during the personal interview?

    The Interviewer during the personal interview will be asking you questions regarding the Statement of Purpose (SOP) you have filled out in your application form, your motivation for joining this course and what you find most interesting in this field.

    Is there an admission fee for the PG Certification Programs?

    No, there are no additional charges for admission. But, if you are choosing to pay by No Cost EMI, there will be a fixed admission charge that is to be paid. This amount will be subtracted from the total course fees.