Advanced Certification in Data Science & AI by E&ICT Academy (IIT Roorkee)

Curriculum

11+

Months of Blended Training

330

Days of Lab Access

29+

Projects

13K+

Learners

Download Curriculum

Foundation: 1. Linux for Data Science

2. Getting Started with Git

3. Python Foundations

4. Machine Learning Prerequisites(Including Numpy, Pandas and Linear Algebra)

5. Getting Started with SQL

6. Statistics Foundations
1. Introduction to Machine Learning and Deep Learning: In this topic, we will cover concepts like different types of Machine Learning algorithms (Supervised, Unsupervised, Reinforcement) and challenges in Machine Learning. We will see examples of solving the problems using the traditional approach and why Machine Learning algorithms give far better accuracy than the traditional approach. This topic will give you a brief introduction to both Machine Learning and Deep Learning world.
2. Data Preprocessing, Regression - Build end-to-end Machine Learning Project: We will start the course by learning concepts in Machine Learning. In this topic, we will build a machine learning model to predict housing pricing in California. By the end of this project, you will understand how to build machine learning pipelines to build a model. We will also cover concepts like data cleaning, preparing data for machine learning algorithms, exploring many different models, short-list the best one and fine-tuning the selected model
3. Classification: In this topic, we will train a model on the MNIST dataset to recognize handwritten digits. We will also learn various performance measures in classification like Confusion Matrix, Precision and Recall, and ROC Curve.
4. Machine Learning Algorithms: In this topic, we will learn various Machine Learning algorithms and concepts like Unsupervised Learning, Ensemble Learning, and Dimensionality Reduction
5. Introduction to Artifical Neural Networks with Keras: We will start the Deep Learning course with Artificial Neural Networks. We will learn about biological neurons, multilayer perceptrons, and back-propagation. We will implement a multilayer perceptron using Keras and visualize the runs and graphs using Tensorboard
6. Training Deep Neural Networks: In this topic, we will learn various challenges deep neural networks face while training like vanishing and exploding gradients. We will learn various techniques to solve these problems like reusing pre-trained layers, using faster optimizers and avoiding overfitting by regularization.
7. Custom Models and Training with TensorFlow: In this topic, we will dive deeper into TensorFlow and its lower level Python API. These lower-level Python APIs are useful when we need extra control like writing custom loss function, layers and many more.
8. Loading and Preprocessing Data with TensorFlow: Deep Learning systems are usually trained on very large datasets that may not fit in the RAM. In this topic, we will learn TensorFlow's Data API which helps in ingesting dataset and preprocessing it efficiently.
9. Deep Computer Vision using Convolutional Neural Network: In this topic, we will learn how Convolutional Neural Networks - CNNs achieve superhuman performance on complex visual tasks. Today CNNs power image search services, self-driving cars, automatic video classification systems and more. We will learn CNNs basic building blocks and how to implement them using TensorFlow and Keras
10. Processing Sequences Using RNNs and CNNs: Predicting the future is something we do all the time like predicting stock prices. In this topic, we will learn how Recurrent Neural Networks - RNN predict the future, the problem they face like limited short-term memory and solutions to these problems - LSTM (Long Short-Term Memory) and GRU cells
11. Natural Language Processing Concepts and RNNs: Using Natural Language Processing we build systems that can read and write natural language. In this topic, we will learn different NLP techniques and generate Shakespearean text using a Character RNN.
12. Representation Learning & Generative Learning Using autoencoders and GANs: Autoencoders are artificial neural networks capable of learning dense representations of input data without any supervision. For example, we could train an autoencoder on pictures of faces and it can then generate new faces. In this topic, we will learn different types of autoencoders and generative models.
13. Reinforcement Learning: Reinforcement Learning is one of the most exciting fields of Machine Learning. Using Reinforcement Learning AlphaGo(system) defeated the world champion at the game of Go. Reinforcement Learning is an area of Machine Learning aimed at creating agents capable of taking actions in an environment in a way that maximizes rewards over time. In this topic, we will learn various concepts in Reinforcement Learning and experiment with OpenAI Gym.
1. Introduction: 1. Introduction

2. Distributed systems

3. Big Data Use Cases

4. Various Solutions

5. Overview of Hadoop Ecosystem

6. Spark Ecosystem Walkthrough
2. Foundation & Environment: 1. Understanding the CloudxLab

2. Getting Started - Hands on

3. Hadoop & Spark Hands-on

4. Understanding Regular Expressions

5. Setting up VM
3. Zookeeper: 1. ZooKeeper - Race Condition

2. ZooKeeper - Deadlock

3. How does election happen - Paxos Algorithm?

4. Use cases

5. When not to use
4. HDFS: 1. Why HDFS?

2. NameNode & DataNodes

3. Advance HDFS Concepts (HA, Federation)

4. Hands-on with HDFS (Upload, Download, SetRep)

5. Data Locality (Rack Awareness)
5. YARN: 1. Why YARN?

2. Evolution from MapReduce 1.0

3. Resource Management: YARN Architecture

4. Advance Concepts - Speculative Execution
6. MapReduce Basics: 1. Understanding Sorting

2. MapReduce - Overview

3. Word Frequency Problem - Without MR

4. Only Mapper - Image Resizing

5. Temperature Problem

6. Multiple Reducer

7. Java MapReduce
7. MapReduce Advanced: 1. Writing MapReduce Code Using Java

2. Apache Ant

3. Concept - Associative & Commutative

4. Combiner

5. Hadoop Streaming

6. Adv. Problem Solving - Anagrams

7. Adv. Problem Solving - Same DNA

8. Adv. Problem Solving - Similar DNA

9. Joins - Voting

10. Limitations of MapReduce
8. Analyzing Data with Pig: 1. Pig - Introduction

2. Pig - Modes

3. Example - NYSE Stock Exchange

4. Concept - Lazy Evaluation
9. Processing Data with Hive: 1. Hive - Introduction

2. Hive - Data Types

3. Loading Data in Hive (Tables)

4. Movielens Data Processing

5. Connecting Tableau and HiveServer 2

6. Connecting Microsoft Excel and HiveServer 2

7. Project: Sentiment Analyses of Twitter Data

8. Advanced - Partition Tables

9. Understanding HCatalog & Impal
10. NoSQL and HBase: 1. NoSQL - Scaling Out / Up

2. ACID Properties and RDBMS Story

3. CAP Theorem

4. HBase Architecture - Region Servers etc

5. Hbase Data Model - Column Family Orientedness

6. Getting Started - Create table, Adding Data

7. Adv Example - Google Links Storage

8. Concept - Bloom Filter

9. Comparison of NOSQL Databases
11. Importing Data with Sqoop and Flume, Oozie: 1. Sqoop - Introduction

2. Sqoop Import - MySQL to HDFS

3. Exporting to MySQL from HDFS

4. Concept - Unbounding Dataset Processing or Stream Processing

5. Flume Overview: Agents - Source, Sink, Channel

6. Data from Local network service into HDFS

7. Example - Extracting Twitter Data

8. Example - Creating workflow with Oozier
1. Introduction: 1. Apache Spark ecosystem walkthrough

2. Spark Introduction - Why Spark?
2. Scala Basics: 1. Introduction, Access Scala on CloudxLab

2. Variables and Methods

3. Interactive, Compilation, SBT

4. Types, Variables & Values

5. Functions

6. Collections

7. Classes

8. Parameters

3. Spark Basics: 1. Apache Spark ecosystem

2. Why Spark?

3. Using the Spark Shell on CloudxLab

4. Example 1 - Performing Word Count

5. Understanding Spark Cluster Modes on YARN

6. RDDs (Resilient Distributed Datasets)

7. General RDD Operations: Transformations & Actions

8. RDD lineage

9. RDD Persistence Overview

10. Distributed Persistence
4. Writing and Deploying Spark Applications: 1. Creating the SparkContext

2. Building a Spark Application (Scala, Java, Python)

3. The Spark Application Web UI

4. Configuring Spark Properties

5. Running Spark on Cluster

6. RDD Partitions

7. Executing Parallel Operations

8. Stages and Tasks
5. Common Patterns in Spark Data Processing: 1. Common Spark Use Cases

1. Example 1 - Data Cleaning (Movielens)

1. Example 2 - Understanding Spark Streaming

2. Understanding Kafka

3. Example 3 - Spark Streaming from Kafka

4. Iterative Algorithms in Spark

5. Project: Real-time analytics of orders in an e-commerce company
6. Data Formats & Management: 1. XML

2. AVRO

3. How to store many small files - SequenceFile?

4. Parquet

5. Protocol Buffers

6. Comparing Compressions

7. Understanding Row Oriented and Column Oriented Formats - RCFile?
7. DataFrames and Spark SQL: 1. Spark SQL - Introduction

2. Spark SQL - Dataframe Introduction

3. Transforming and Querying DataFrames

4. Saving DataFrames

5. DataFrames and RDDs

6. Comparing Spark SQL, Impala, and Hive-on-Spark
8. Machine Learning with Spark: 1. Machine Learning Introduction

2. Applications Of Machine Learning

3. MlLib Example: k-means

4. SparkR Example

Projects

Project 1.
Churn Email Inbox with Python

Churn the mail activity from various individuals in an open source project development team.

Project 2.
MapReduce and Python

Solve various problems using MapReduce and Python.

Project 3.
Sentiment Analysis

Sentiment analysis of "Iron Man 3" movie using Hive and visualizing the sentiment data using BI tools such as Tableau.

Project 4.
Spark application

Write end-to-end Spark application starting from writing code on your local machine to deploying to the cluster.

Project 5.
Parse Apache Access Logs using Spark.

The logs of a webserver are the gold mines for gaining insights in the user behaviour. So learn to parse the text data stored in logs of a web server using the Apache Spark.

Project 6.
Predicting the median housing prices in California

In this project we will build a machine learning model to predict housing prices. We will learn various data manipulation, visualization and cleaning techniques using various libraries of Python like Pandas, Scikit-Learn and Matplotlib.

Project 7.
Classifying handwritten digits in MNIST dataset

The MNIST dataset is considered as "Hello World!" of Machine Learning. Write your first classification logic. Starting with Binary Classification learn Multiclass, Multilabel, Multi-output classification and different error analysis techniques.

Project 8.
Noise removal from the images

Build a model that takes a noisy image as an input and outputs the clean image.

Project 9.
Build a spam classifier

Build a model to classify email as spam or ham. First, download examples of spam and ham from Apache SpamAssassin’s public datasets and then train a model to classify email.

Project 10.
Predict which passengers survived in the Titanic shipwreck

The sinking of the RMS Titanic is one of the most infamous shipwrecks in history. In this project, you build a model to predict which passengers survived the tragedy.

Project 11.
Image Classification with Pre-trained InceptionV3 Network

This project aims to impart the knowledge of how to access the pre-trained models from TensorFlow 2, and appreciate its powerful classification capacity by making the model predict the class of an input image.

Project 12.
Host an Image Classification App on Heroku

Heroku is a cloud platform for the deployment and management purposes of web applications. You will learn how to deploy DeepLearning based Flask web app on Heroku.

Project 13.
Predicting Noisy Images using KNN Classifier

We will train a KNN classifier to predict MNIST images from their noisy version.

Project 14.
Build an Image Classifier in Fashion MNIST dataset

Build a CNN from scratch to classify FashionMNIST data using Tensorflow2, Matplotlib and Python.

Project 15.
Deploy Machine Learning models to Production using Flask

Learn how to deploy a machine learning model as a web application using the Flask framework.

Project 16.
Sentiment Analysis using IMDB dataset

Create a sentiment analysis model with the IMDB dataset using TensorFlow 2.

Project 17.
Mask R-CNN with OpenCV for Object Detection

Learn how to read a pre-trained TensorFlow model for object detection using OpenCV.

Project 18.
Image Classification with Pre-trained Keras models

Learn how to access the pre-trained models(here we get pre-trained ResNet model) from Keras of TensorFlow 2 to classify images.

Project 19.
Build cats classifier using transfer learning

Build a basic neural network to classify if a given image is of cat or not using transfer learning technique with Python and Keras.

Project 20.
Art Generation Project

Use TensorFlow 2 to generate an image that is an artistic blend of a content image and style image using Neural Style Transfer.

Project 21.
Credit Card Fraud Detection using Machine Learning

Learn how to over-sample the dataset with imbalanced classes using the SMOTE technique and how to use the thus obtained data to build a fraudulent transaction classifier.

Project 22.
Image Stitching using OpenCV and Python (Creating Panorama Project)

As you know, the Google photos app has stunning automatic features like video making, panorama stitching, collage making, and many more. In this project, we will understand how to make a panorama stitching using OpenCV with Python.

Project 23.
NYSE Stock Closing Price Prediction using TensorFlow 2

Predict stock market closing prices for a firm using GRU, a state-of-art deep learning algorithm for sequential data, with Keras and Python.

Project 24.
Introduction to Transfer Learning (Cat vs Non-cats Project)

Apply the idea of Transfer Learning to build an image classifier with Tensorflow2, and use it to predict the class of an input image - whether it is a cat or a non-cat.

Project 25.
Building Cat vs Non-Cat Image Classifier using NumPy

Use Python and Numpy to build a Logistic Regression Classifier from scratch, and apply it to predict the class of an input image - whether it is a cat or a non-cat.

Project 26.
Iris Flowers Classification using Deep Learning & Keras

Use Python and Tensorflow 2 Keras to build a dense deep neural network classifier to predict the classes of flowers in the Iris dataset.

Project 27.
Classify Clothes from Fashion MNIST Dataset

Build a model to classify clothes into various categories in Fashion MNIST dataset.

Project 28.
Face recognition

Identify person from digital image or video.

Project 29.
Game - OpenGym

Project using OpenGym on Reinforcement Learning.

Sold Out

Application Process

Step 1. Submit the application form and SOP(Statement of Purpose)
Register by filling the application form
Step 2. Reviewing the application
The admission team will review the application and respond with the application status in 48 hours
Step 3. Join The Program
Confirmation of seat is subject to the payment

No Cost EMI at

164/Month

Or Program Fee 1799

11 Months of Blended Learning
330 Days of Online Lab Access
24*7 Support
Batch Starts on 03^rd April, 2021
Certificate from E&ICT Academy, IIT Roorkee

This course is sold out

Checkout New AI course by IIT Roorkee »

Testimonials

“Sessions were great, pace was also very good. Each of the steps were explained well multiple times to ensure everyone understands the concepts. Thanks Sandeep!”

Nitin Nigam

“Thanks a lot,it was great course! I'm happy that you lead in this path to AI/ML/DL.I hope to continue to collaborate with you in future.”

Domenico Flioravanti

“Thank you so much Sandeep for all your great sessions. It will help in our career a lot. Your session is very much explanatory and understandable. Kudos to you.Thanks for all your hard work and time. Definitely, we will recommend all our friends and colleagues to attend your different course.Thanks a ton”

Hemanta Lenka

“I have been using CloudxLab for a while now, and they are amazing! The best part about using CloudxLab is that you do not need to wait for someone to tell you whether what you did was right or not, it is done automatically on the go. The training materials are of top notch quality. If you get stuck, they have a huge community of trainers and learners to help you out with all your doubts. They have a course structure for everyone, whether you are new to programming or are a seasoned programmer, they have something to offer you. And they are affordable too! I would recommend CloudxLab all the time.”

Rajtilak Bhattacharjee

“This course is suitable for everyone. Me being a product manager had not done hands-on coding since quite some time. Python was completely new to me. However, Sandeep Giri gave us a crash course to Python and then introduced us to Machine Learning. Also, the CloudxLab’s environment was very useful to just log in and start practising coding and playing with things learnt. A good mix of theory and practical exercises and specifically the sequence of starting straight away with a project and then going deeper was a very good way of teaching. I would recommend this course to all.”

Kamal Upadhyay

“It has been a wonderful learning experience with CXL. This is one of the courses that will probably stay with me for a significant amount of time. The platform provides a unique opportunity to try hands-on simultaneously with the coursework in an almost real-life coding example. Besides, learning to use algebra, tech system and Git is a good refresher for anyone planning to start or stay in technology. The course covers the depth and breadth of ML topics. I specifically like the MNIST example and the depth to which it goes in explaining each and every line of code. Would definitely recommend the instructor-led course.”

Pratik Sonthalia

“This is one of the best-designed course, very informative and well paced. The killer feature of machine/deep learning coursed from CloudxLab is the live session with access to labs for hands-on practices! With that, it becomes easy following any discourse, even if one misses the live sessions(Read that as me!). Sandeep(course instructor) has loads of patience and his way of explaining things are just remarkable. I might have better comments to add here, once I learn more! Great Jobs guys!”

Dhyan Prem

Senior Software Developer at Decision Resources Group

Related Course

Advanced Certification in Machine Learning & AI from - EICT, IIT Roorkee

08 Months Online Program

View Details

Frequently Asked Questions

What is the application process for the program and what are the timelines?: Applications have already started for this course. You can start the application process by submitting the application form and eligibility quiz. Then our admission committee will take a call on approval and revert back. After making the payment you will get access to the self-paced content, then live classes of Batch will start from 28 March 2021.
How does the EMI Payment model work?: The EMI payment starts from April 2021. The monthly EMI payments should be cleared before the 5th of every month. Failure to make the payments will lead to the removal of course access from your account
What is the format of the course?: The content will be a mix of interactive self-paced lectures and live instructor-led training from industry leaders as well as renowned faculty from IIT Roorkee. Linux, Python and Big Data will be provided as self-paced module before the live classes start. Additionally, the program comprises 24*7 support dedicated to solving your academic queries and reinforcing learning. The discussion forum on the CloudxLab website will also facilitate peer-to-peer interactions.
Do I need to install any software before starting this course?: No, we will provide you with the access to our online lab and BootML so that you do not have to install anything on your local machine
What if I miss any classes in between?: Recording of every session will be available just after the class, even if you miss the class you can go through the recording and ask your doubts either on our discussion forum or in the next class.
What are the expected career options after pursuing this course?: Someone who has successfully completed this course is expected to be able to solve problems more efficiently using some of the latest technologies in the industry. Learners who have completed this course will be a perfect fit for VLSI, Semiconductor, or similar industries.
What are the prerequisites for this course?: Basic knowledge of any programming language and Linux will help you in understanding the concepts faster. We will provide access to our self-paced courses on Python and Linux once you sign up for this course.
What is the eligibility criteria to earn the certificate for the course: You will be required to have at least 60% attendance in live sessions, complete at least 75% of the course content, and complete 1 Capstone Project and 8 Guided projects - Analyse emails from Python, Sentiment Analysis (Hive) from Hadoop, Log Parsing from Spark, 3 mandatory projects from Machine Learning, and 2 mandatory projects from Deep Learning. All the above requirements need to be met within the deadline of the course (11 Months) to be eligible for the certificate from E&ICT Academy, IIT Roorkee.
What is the validity of the course material?: We understand that you might need course material for a longer duration to make most out of your subscription. You will get lifetime access to the course material so that you can refer to the course material anytime.
Is there any EMI option available?: Yes, for further details please drop a mail to reachus@cloudxlab.com
May I directly interact with the instructor?: Yes, every learner can directly ask their questions and discuss his/her query during any of the class lectures. You can also post your query on our discussion forum.
What is the refund policy?: We provide a 100% fee refund if the request is raised within the first 2 instructor-led sessions. Please contact us at reachus@cloudxlab.com to request a refund within the stipulated time. Thereafter, no refund is provided.

Advanced Certification in Data Science & AI By E&ICT Academy, IIT Roorkee

Program Partner - CloudxLab

Learn Python, NumPy, Pandas, Scikit-learn, TensorFlow 2.0, Spark, Hadoop, Regression, Classification, SVM, Random Forests, CNNs, RNNs, Reinforcement Learning, GANs and More

03rd April

11 months

Online

29+

E&ICT, IIT Roorkee

13,500+

E&ICT Academy, IIT Roorkee

About the Course

Program Highlights

Certificate of Completion by E&ICT Academy, IIT Roorkee

11 Months of Blended Learning

Work on about 29+ projects to get hands-on experience

Timely Doubt Resolution

Best In Class Curriculum

Cloud Lab Access

Batch Starts on 03rd April, 2021

Our Students Work At

Certificate

What is the certificate like?

Why E&ICT, IIT Roorkee?

Why Cloudxlab?

Programming Languages and Tools

Hands-on Learning

Gamified Learning Platform

Auto-assessment Tests

No Installation Required

Mentors / Faculty

Raksha Sharma

Sanjeev Manhas

Venkat Karun

Sandeep Giri

Abhinav Singh

Praveen Pavithran

Curriculum

Months of Blended Training

Days of Lab Access

Projects

Learners

Machine Learning & Deep Learning

Course on Big Data with Hadoop

Course on Big Data with Spark

Projects

Project 1.Churn Email Inbox with Python

Project 2.MapReduce and Python

Project 3. Sentiment Analysis

Project 4. Spark application

Project 5. Parse Apache Access Logs using Spark.

Project 6. Predicting the median housing prices in California

Project 7. Classifying handwritten digits in MNIST dataset

Project 8. Noise removal from the images

Project 9. Build a spam classifier

Project 10. Predict which passengers survived in the Titanic shipwreck

Project 11. Image Classification with Pre-trained InceptionV3 Network

Project 12. Host an Image Classification App on Heroku

Project 13. Predicting Noisy Images using KNN Classifier

Project 14. Build an Image Classifier in Fashion MNIST dataset

Project 15. Deploy Machine Learning models to Production using Flask

Project 16. Sentiment Analysis using IMDB dataset

Project 17. Mask R-CNN with OpenCV for Object Detection

Project 18. Image Classification with Pre-trained Keras models

Project 19. Build cats classifier using transfer learning

Project 20. Art Generation Project

Project 21. Credit Card Fraud Detection using Machine Learning

Project 22. Image Stitching using OpenCV and Python (Creating Panorama Project)

Project 23. NYSE Stock Closing Price Prediction using TensorFlow 2

Project 24. Introduction to Transfer Learning (Cat vs Non-cats Project)

Project 25. Building Cat vs Non-Cat Image Classifier using NumPy

Project 26. Iris Flowers Classification using Deep Learning & Keras

Project 27. Classify Clothes from Fashion MNIST Dataset

Project 28. Face recognition

Project 29. Game - OpenGym

Sold Out

Application Process

Step 1. Submit the application form and SOP(Statement of Purpose)

Step 2. Reviewing the application

Step 3. Join The Program

No Cost EMI at

Advanced Certification in Data Science & AI
By E&ICT Academy, IIT Roorkee

03^rd April

Batch Starts on 03^rd April, 2021

Project 1.
Churn Email Inbox with Python

Project 2.
MapReduce and Python

Project 3.
Sentiment Analysis

Project 4.
Spark application

Project 5.
Parse Apache Access Logs using Spark.

Project 6.
Predicting the median housing prices in California

Project 7.
Classifying handwritten digits in MNIST dataset

Project 8.
Noise removal from the images

Project 9.
Build a spam classifier

Project 10.
Predict which passengers survived in the Titanic shipwreck

Project 11.
Image Classification with Pre-trained InceptionV3 Network

Project 12.
Host an Image Classification App on Heroku

Project 13.
Predicting Noisy Images using KNN Classifier

Project 14.
Build an Image Classifier in Fashion MNIST dataset

Project 15.
Deploy Machine Learning models to Production using Flask

Project 16.
Sentiment Analysis using IMDB dataset

Project 17.
Mask R-CNN with OpenCV for Object Detection

Project 18.
Image Classification with Pre-trained Keras models

Project 19.
Build cats classifier using transfer learning

Project 20.
Art Generation Project

Project 21.
Credit Card Fraud Detection using Machine Learning

Project 22.
Image Stitching using OpenCV and Python (Creating Panorama Project)

Project 23.
NYSE Stock Closing Price Prediction using TensorFlow 2

Project 24.
Introduction to Transfer Learning (Cat vs Non-cats Project)

Project 25.
Building Cat vs Non-Cat Image Classifier using NumPy

Project 26.
Iris Flowers Classification using Deep Learning & Keras

Project 27.
Classify Clothes from Fashion MNIST Dataset

Project 28.
Face recognition

Project 29.
Game - OpenGym