Month: November 2018

My Tech World

Day 11-100 Days MLCode: Regularization in ML

Regularization  Today’s we’ll focus on the regularization in the Machine Learning. In the last blog, we discussed the overfitting problem. In the case of a Polynomial feature, we can control the overfitting issue by reducing the degree of Polynomial features. However, In the case of linear regression, overfitting can be controlled by controlling ( limiting) the…
Read more


November 20, 2018 0

Day 10-100 Days MLCode: Polynomial Regression

This is the 10th day of #100DaysofMLCode challenge and we are going to cover the Polynomial Regression. As per wiki, Polynomial Reg. defined as Polynomial regression is a form of regression analysis in which the relationship between the independent variable x and the dependent variable y is modelled as an nth degree polynomial in x. In real life, we can not fit all the data by just drawing…
Read more


November 19, 2018 0

Day 9-100 Days MLCode: Gradient Descent – Part 2

In the blog, we are going to continue study about Gradient descent. In our previous blog, we have discussed the Batch Gradient descent, here we are going to discuss Stochastic Gradient Descent and Mini Batch. Stochastic Gradient Descent We have seen that for the Batch Gradient descent, we have to calculate the Gradient Descent for…
Read more


November 18, 2018 0

Day 8-100 Days MLCode: Gradient descent

In this blog, we are going to discuss the Gradient Descent. This is a continuation of our #100DaysofML code challenge. You can find the previous blog here. When we are dealing with training the model, there can be two approaches to optimize the cost function of the model. A Direct Closed approach: In this case,…
Read more


November 17, 2018 1

Day 7 – 100 Days – SPAM Detector Part 2

In this blog, we’ll continue to develop the Spam Detector classifier which we started in the previous blog. we have already prepared the Train and Test data in our last blog, lets now work on cleaning the data for the training . Data Transformation Let’s develop a subroutine to convert the HTML emails to text…
Read more


November 16, 2018 0

Day 6-100 Days Of ML Code: SPAM Classifier – Part 1

Today lets create the SPAM Classifier. Spam classifier will detect whether an email is Spam or ham. We are going to use the Spam and Ham from Apache SpamAssassin’s dataset. Let’s download the dataset first. This blog is part of my #100daysofMLCode challenge and you can find all my previous blog here. import tarfile from…
Read more


November 15, 2018 0

Day 5-100 Days Of ML Code: Multiclass Classification

Binary Classifiers which we have created on Day 4 can distinguish between two classes, however, Multiclass Classifiers can differentiate between more than two class. Algorithms like Random Forest, Naive Bayes, and K-Nearest Neighbors are capable to handle the multiclass labels while other famous classification algorithms like SVM ( Support Vector Machine), Simple Gradient Descent cannot handle the…
Read more


November 14, 2018 0

Day 4-100 Days Of ML Code: Classification

Classification is one the most common supervise learning task to predict the class type of the data. In day 4 of our #100DaysOfMLCode Challenge, we’ll try to prepare the small classifier. Problem statement: Predict the Soil Type of forest based on the input detail from the dataset. We are using the Forest Cover Type Dataset…
Read more


November 13, 2018 0

DAY 3-100 DAYS OF ML CODE: Model Evaluation and Fine -Tuning

We are going to evaluate our model in this blog. This is a continuation of the previous blog, #100DaysOfMLCode Challenge and we are going to created and evaluate the model In the first blog, we did data analysis and you can find the detail here. After that in the next blog, we prepared a pipeline…
Read more


November 12, 2018 0

Day 2-100 Days Of ML Code: Data Cleansing & Linear Regression

Today we are going to cleanse the data which we did analysis in the last post and create a small Linear Regression Model to perform Linear Regression on the cleansed data. Prepare Data for Machine Learning: Let’s first divide the entire data into two sets, training and other for testing. We are going to use SciKit-Learn to…
Read more


November 11, 2018 0