Enrollments Open for Certification Course on Artificial Intelligence and Deep Learning by IIT RoorkeeApply Now
Welcome to the tutorial on Linux Basics.
As part of this Linux tutorial, you will learn how to work in Linux Console. Most of the command and concepts are same in other Unix based systems.
Whether you are building a server-side application, creating an API, working data science, Linux is a very important skill. With Linux, you can almost automate anything.
This tutorial is very hands-on, it would make you do things in real-time in CloudxLab.
In this chapter, we learn the basics of Big Data which include various concepts, use-cases and understanding of the eco-system.
This chapter doesn't require any knowledge of programming or technology. We believe it is very useful for every to learn the basics of Big Data. So, jump in!
Welcome to a course in Scala Foundations.
As part of this course, you will learn how to write programs using Scala.
Scala is a programming language like Java or Python. The syntax is much like Python while under the hood it compiles to Java. It also comes with both an interactive interpreter and a compiler.
Further, Scala is designed for scalable computing where the code could be sent to data and codes is also run in parallel.
Scala is used in the enterprise world and has gain a lot of traction.
MapReduce is Framework as well as a paradigm of computing. By the way of map-reduce, we are able to break-down complex computation into distributed computing.
As part of this chapter, we are going to learning how to build MapReduce programmes using Java.
Please make sure you work along with the course instead of just sitting back and watching.
Welcome to this project on Churning the Emails Inbox with Python. In this project, you will use Python to access the data from files and process it to achieve certain tasks. You will explore the MBox email dataset, and use Python to count lines, headers, subject lines by emails and domains. Know your way on how to work with data in Python.
Skills you will develop:
Whenever you make a request to a web server for a page, it records it in a file which is called logs.
The logs of a webserver are the gold mines for gaining insights in the user behaviour. Every data scientists usually look at the logs first to understand the behaviour of the users. But since the logs are humongous in size, it takes a distributed framework like Hadoop or Spark to process it.
As part of this project, you will learn to parse the text data stored in logs of a web server using the Apache Spark.
Learn how to write Spark applications.