The data consisted of product images like t-shirts, bags, keychains, mobile covers, etc. with characters graphics.
Training Set: 6694 images across 42 categories
Test Set: 3727 images
The data had label inconsistency of 2-3%. Had to manually resolve the label inconsitency.
Additional Data
As the data provided by CrowdAnalytix was not equally distributed
across the 42. Some categories had fewer data comparatively to the other
categories. To resolve this data insufficiency among the categories we
downloaded the additional data with the help of gi2ds and this tutorial created by Andrian Rosebrook .
You can download the data from here
(Comprises the data provided by CrowdAnalyticsX and the above mentioned
additional data). The filenames of images from Crowdanalytix starts
with Cax_train and the other images filenames start with number.
Categories
The following were the 42 categories for classification.
Angry Birds
Baloo
Bart simpson
Ben
Bulbasaur
Charizard
Charlie brown
Charmender
Chicken_little
Cinderella
Darth_vader
Disney_princes
Donald_duck
Godzilla
Goku
Goofy
Han-solo
Harry_potter
Hellokitty
Itachi
John_Cena
Jojosiwa
Kakashi
Marilyn_monroe
Mickey_mouse
Minions
Naruto
Pikachu
Pokemon
Popeye
Power_rangers
R2-D2
Roman_reigns
Scoopy Doo
SpongeBob SquarePants
Squirtle
Teenage_mutant_ninja_turtles
Tom and Jerry
Toy_story_characters
Vampirina
Vegeta
Winnie the poo
Confusion Matrix
Pair of confused categories with minimum value of 2