End to End ML Project - Explore the dataset

Now we will explore the dataset.

  • Use the info method to get more information on the dataset

    housing.<<your code goes here>>
  • Get a better understand of the mean, standard deviation, maximum value and other such information from the dataset by using the describe method

    housing.<<your code goes here>>
  • Plot histograms of all the features using hist method

    housing.<<your code goes here>>(bins=50, figsize=(20,15))
  • Plot a histogram of the median income attribute of the dataset

    housing["median_income"].<<your code goes here>>
  • Divide the median income attribute into bins and labels using the cut mthod, and then plot another histogram of the same

    housing["income_cat"] = pd.<<your code goes here>>(housing["median_income"],
                                   bins=[0., 1.5, 3.0, 4.5, 6., np.inf],
                                   labels=[1, 2, 3, 4, 5])

