Multi-label Image Classification
Desc - Movie genre classification using movie posters
Overview
This is a tutorial from Analytics Vidhya
The dataset of images contain more than two categories ie it is not a simple either/or
Each image in the dataset can contain only one category
Example: A dataset containing images such as dog, cat, rabbit, parrot Each image contains only dog, cat, parrot rabbit
The above is know as multi-label image classification.
Question: Can we predict the genre of a movie by looking at the movie poster? And ofcourse a movie can belong to more than one genre.
The key is in the output layer - use a sigmoid activation instead of softmax. With Softmax as the probablity of one increases the probability of the other classses decrease (becuase the sum must equal 1). With Sigmoid however the probabilities are independent of each other. So with sigmoid the architecture will internally create N models where N is the number of classes. Cool huh?!
Setup
For details of the model and data see repo
Note: No attempt has been made to finetune the architecture and reduce the amount of overfitting and hence get a better training/validation loss.