Duration
Format
Certificate
Computing systems have fueled the growth of AI. Improvements in deep-learning algorithms have inevitably gone hand-in-hand with the improvements in the hardware accelerators. Our ability to train increasingly complex AI models and achieve low-power, real-time inference depends on the capabilities of computing systems.
In recent years, the metrics used for optimizing and evaluating AI algorithms are diversifying: along with accuracy, there is increasing emphasis on the metrics such as energy efficiency and model size. Given this, researchers working on deep learning can no longer afford to ignore the computing system. Instead, the knowledge of the potential and limitations of computing systems can provide invaluable guidance to them in designing the most efficient and accurate algorithms.
This course aims to inform students, practitioners and researchers in deep-learning algorithms about the potential and limitations of various processor architectures for accelerating the deep learning algorithms. At the same time, it seeks to motivate and even challenge the engineers and professionals in the architecture domain to optimize the processors according to the needs of deep-learning algorithms.
Course contents: This course discusses AI acceleration on various computing systems, such as FPGAs, mobile/desktop GPUs, smartphones, ASICs, DSPs and CPUs. It explains the architecture of several commercial AI accelerators, viz., Microsoft's Brainwave, Qualcomm's Hexagon DSP, NVIDIA's desktop GPUs and Tensor cores, NVIDIA's Jetson GPU, Intel's Xeon Phi, Intel Habana Labs' Goya and Gaudi, Google's Tensor Processing Unit (TPU) version 1 to 4, Cerebras' Wafer Scale Engine, Alibaba's HanGuang Processor, Groq’s Tensor Streaming Processor (TSP), Untether’s TsunAImi Processor and Graphcore's Intelligence Processing Unit (IPU). Further, it discusses how Facebook optimizes AI services in its data center and how it optimizes its mobile app. The course also discusses several research-grade accelerators, such as memristor-based accelerators. Overall, the course teaches about AI accelerators at levels ranging from smartphone, desktop, server to data-center.
Apart from performance and energy metrics, this course also discusses hardware reliability and security techniques for deep-learning algorithms and accelerators. A few real-life applications that benefit from AI-accelerators are reviewed, such as autonomous driving and brain implants. The course draws from recent research papers to showcase the state-of-art in these fields. To make the course self-sufficient, a reasonable amount of background is presented on both computer architecture and CNNs.
This course is at the intersection of deep learning algorithms, computer architecture, and chip design, and thus, is expected to be beneficial for a broad range of learners.
Certificate of Completion by IIT Roorkee
Learn from IIT Roorkee professors and Industry Experts
Proctored Exams with Deep Learning models with opportunity to get Placed
Get an hands-on experience with our Guided Projects
Get access to community of learners via our discussion forum
Lab comes pre-installed with all the software you will need to learn and practice.
IIT Roorkee is ranked first among all the IITs AND 20th position globally in citations per faculty. Established in 1847, it's one of the oldest technical institutions in Asia. IIT Roorkee fosters a very strong entrepreneurial culture. Some of their alumni are highly successful as entrepreneurs in the new age digital economy.
CloudxLab is a team of developers, engineers, and educators passionate about building innovative products to make learning fun, engaging, and for life. We are a highly motivated team who build fresh and lasting learning experiences for our users. Powered by our innovation processes, we provide a gamified environment where learning is fun and constructive. From creative design to intuitive apps we create a seamless learning experience for our users. We upskill engineers in deep tech - make them employable & future-ready.
Among the IITs in the ‘Citations per Faculty’ parameter
*QS World Rankings
Ranked Engineering College
*India Today 2020
Ranked for IITs
*NIRF 2020
Ranked Best Global Universities in India
*QS World Rankings
Faculty at ECE Dept and Center for AI and DS
IIT Roorkee
Dr. Sparsh Mittal is currently working as an assistant professor at ECE Dept at IIT Roorkee, India. He is also a joint faculty at Center for AI and DS at IIT Roorkee. He received the B.Tech. degree from IIT, Roorkee, India and the Ph.D. degree from Iowa State University (ISU), USA. He has worked as a Post-Doctoral Research Associate at Oak Ridge National Lab (ORNL), USA and as an assistant professor at CSE, IIT Hyderabad. He was the graduating topper of his batch in B.Tech and his BTech project received the best project award. He has received a fellowship from ISU and a performance award from ORNL.
He has published more than 100 papers at top venues and his research has been covered by technical websites such as InsideHPC, HPCWire, Phys.org, and ScientificComputing. He is an associate editor of Elsevier's Journal of Systems Architecture. He has given invited talks at ISC Conference at Germany, New York University, University of Michigan and Xilinx (Hyderabad). In Stanford's list of world's top researchers, in the field of Computer Hardware & Architecture, he was ranked as number 107 (for whole career) and as number 3 (for year 2019 alone).
We have around 300+ recruitment partners who will be interviewing you based on your performances in PET
Sessions will be conducted to guide you on creating the perfect resume and professional profile to get noticed by recruiters
Career Guidance Webinars from seasoned industry experts
You will be required to complete 100% of the course content within 90 days of enrollment to be eligible for the certificate.
The candidate should have an idea of what is deep learning, especially the basics of CNNs and RNNs. Background in computer architecture or embedded-system is preferred, although not mandatory.
Senior Software Developer at Decision Resources Group
220+ Hours of Online Self-Paced Training
80+ Hours of Online Self-Paced Training
The candidate should have an idea of what is deep learning, especially the basics of CNNs and RNNs. Background in computer architecture or embedded-system is preferred, although not mandatory.
Someone who has successfully completed this course is expected to be able to solve problems more efficiently using some of the latest technologies in the industry. Learners who have completed this course will be a perfect fit for VLSI, Semiconductor, or similar industries.
If you are unhappy with the product for any reason, let us know within 7 days of purchasing or upgrading your account, and we'll cancel your account and issue a full refund. Please contact us at reachus@cloudxlab.com to request a refund within the stipulated time. We will be sorry to see you go though!
No, we will provide you with the access to our online lab and BootML so that you do not have to install anything on your local machine
We understand that you might need course material for a longer duration to make most out of your subscription. You will get lifetime access to the course material so that you can refer to the course material anytime.