Welcome to Vallabh’s Portfolio

This website exists primarily to share my data science projects and learnings with anyone who’s willing to listen to my ramblings, but it may evolve over time. You may contact me through my Linkedin Profile or by mail at vallabh.reddyb@gmail.com.

Journey

About Me

My name is Vallabh Reddy. I’m a Data Scientist and developer with a penchant for cognitive stimuli. I’ve worked in Data Science, ML, and consulting for around 8 years. I’m a supply chain SME with 5 years of work at Amazon’s global exports division under my belt where I helped the company realize operating efficiencies and cost saving opportunities through the use of ML techniques and statistical simulations.

I play squash, strategy games, and built my own PC. I love reading up on natural and man-made systems like the economy and what makes them tick.

Portfolio

Below is a list of complete and ongoing public Data Science projects of mine.

Apparel Image Classification using Convolutional Neural Networks
Multilingual Toxic Comments Classification with only English training data - RoBERTa NLP Transformer
USA Foreign Trade Analysis
Kickstarter Campaign Success Prediction

1. Apparel Image Classification using Convolutional Neural Network

Github does not represent certain features of a jupyter notebook well, such as the intra notebook anchors, so here’s a link to open the notebook through NBViewer.

Here is the Github link for the same notebook.

Introduction

The internet has only been getting more expansive. And this expanse brings unstructured data, like images. These are difficult to organize because the subjects in an image are not readily interpretable by machines. But deep learning innovation has picked up momentum and it is much easier to build models which are able to make sense of image data and classify their contents.

The objective of this project was to build an image classifier that could recognize the category of apparel in the photo fed into it. Something that would be of import to online retailers who are flooded with swarms of images from their 3rd party seller or from the customers reviewing the product. This technology could inspect the image to confirm that the image does contain the product itself or a part of the product, otherwise the image could be flagged and inspected for irrelevance.

Dataset

The model is trained and tested on the Fashion MNIST dataset. The dataset is provided by the research branch of Zalando, a European e-commerce company. The dataset is made up of 28x28 grayscale images of 10 categories of apparel. There are 60,000 observations in the training set and 10,000 observations in the test set. The mapping for the 10 categories of apparel is given below.

Label	Description
0	T-shirt/top
1	Trouser
2	Pullover
3	Dress
4	Coat
5	Sandal
6	Shirt
7	Sneaker
8	Bag
9	Ankle boot

Here’s a look at a few sample images.

Sample Images

Sample Images

The dataset has no data quality issues and each category has an equal number of observations.

Dilbert

Methodology

Using Keras for Python, the model is built of Convolutional Neural Networks(CNN) which are especially effective in image comprehension. This is because each neuron in these Convolutional layers have a component called kernel which is trained to look for certain patterns in the image, and pass the output called feature map on to the next layer to be built upon. In this way, layer after layer the patterns that the network can detect get more complex.

Let’s take a look at kernels from the first Convolutional layer.

Kernels

1st Convolutional Layer Kernels

We can see that these are not too complicated, they are usually straght lines or an unintelligible colleciton of pixels. Let us take our first image, which is a shirt and then look at the feature maps that neurons from the first convolutional layer.

Shirt

Sample Shirt Image

First Layer

Feature maps of the 1st Convolutional layer

In certain images the vertical sides of the shirt are emphasized whereas in others, horizontal bottoms and tops are focused.

We mentioned earlier that as we get into deeper layers, the activation patterns change. So let’s look at the 4th layer.

Fourth Layer

Feature maps of the 4th Convolutional layer

We can see that the activations are much more abstract here. It might make sense that in order to differentiate apparel categories of different shapes, the network learns to look for a collection of abstract features subtle enough to differentiate each other.

The notebook contains several architectures of CNNs and their performance is compared. I used the GPU implementation of TensorFlow since this increases the speed of deep learning model training exponentially, but the drawback is that the results are not perfectly reproducible and setting a seed does not work for the randomizing operations performed by the GPU.

Results

I was able to achieve a 91-93% accuracy of classifications on the test set using a 13 layer network.