Login using Social Account
     Continue with GoogleLogin using your credentials
Task
The dataset (Location: /cxldata/datasets/project/bikes.csv) contains the hourly rental bike demand data. The goal of the model is to estimate the bike demand in future given the parameters as observed in the past.
We will be following this example step-by-step in this assessment.
Input data available:
instant: record index
dteday : date
season: season (1: springer, 2: summer, 3: fall, 4: winter)
yr: year (0: 2011, 1:2012)
mnth: month (1 to 12)
hr: hour (0 to 23)
holiday: weather day is holiday or not (extracted from [Web Link])
weekday: day of the week
workingday: if day is neither weekend nor holiday is 1, otherwise is 0.
weathersit:
1: Clear, Few clouds, Partly cloudy, Partly cloudy
2: Mist + Cloudy, Mist + Broken clouds, Mist + Few clouds, Mist
3: Light Snow, Light Rain + Thunderstorm + Scattered clouds, Light Rain + Scattered clouds
4: Heavy Rain + Ice Pallets + Thunderstorm + Mist, Snow + Fog
temp: Normalized temperature in Celsius. The values are derived via (tt_min)/(t_maxt_min), t_min=*8, t_max=+39 (only in hourly scale)
atemp: Normalized feeling temperature in Celsius. The values are derived via (tt_min)/(t_maxt_min), t_min=*16, t_max=+50 (only in hourly scale)
hum: Normalized humidity. The values are divided to 100 (max)
windspeed: Normalized wind speed. The values are divided to 67 (max)
casual: count of casual users
registered: count of registered users
cnt: count of total rental bikes including both casual and registered
Steps that will be followed:
Importing the libraries
Defining some utility functions
Loading the data
Cleaning the data
Adding derived features
isWorking: Neither a workingday nor a holiday
monthCount: count of the number of months from the beginning of the dataset
xformHr: transform by shifting the hours by 5 hrs
dayCnt: count of the days from the beginning of the dataset
xformWorkHr: transforming the hour dataset to make the non-working days to have hours from 25 to 48
cntDeTrended: De-trended count values
Analyzing the dataset
Dividing the dataset into training and test dataset
Training several models and analyzing their performance
Selecting a model and evaluating using test dataset
Improving the model by finding the best hyper-parameters and features
Analyzing the residuals
Taking you to the next exercise in seconds...
Want to create exercises like this yourself? Click here.
No hints are availble for this assesment
Answer is not availble for this assesment
Loading comments...