Grocery Detection Dataset - Model Overview

Download Dataset

You can find the dataset here: Download Dataset

1. Dataset Overview

The Grocery Detection Dataset provides comprehensive insights into grocery item recognition for machine learning applications.

Total Dataset Size

24,938 unique images

Annotation Format

Annotated in YOLOv11 format

Pre-processing Applied

  • Auto-orientation of pixel data (with EXIF-orientation stripping)
  • Resize to 640x640 (Stretch)

Data Augmentation

  • 50% probability of horizontal flip
  • 50% probability of vertical flip
  • Equal probability of one of the following 90-degree rotations: none, clockwise, counter-clockwise, upside-down
  • Random rotation of between -45 and +45 degrees
  • Random shear of between -45° to +45° horizontally and -45° to +45° vertically
  • Random brightness adjustment of between -48% and +48%
  • Random exposure adjustment of between -12% and +12%

2. Dataset Distribution Visualization

Dataset Distribution by Set Type

Sample Count Across Dataset Splits

3. Data Split Analysis

Comprehensive breakdown of dataset distribution for model training:

Set Type Sample Count Percentage Metadata Representation
Training Set 17,400 70% Primary model training images
Validation Set 4,987 20% Model performance validation images
Test Set 2,551 10% Final model evaluation images

4. Dataset Collection Challenges

Key challenges encountered during dataset collection:

  • Varied Product Formats: Diverse grocery packaging
  • Labeling Variability: Inconsistent product descriptions and categorizations
  • Complex Backgrounds: Images captured in varied lighting and environments
  • Data Augmentation Necessity: Enhancing model robustness through augmentation

5. Conclusion

The Grocery Detection Dataset offers a robust resource for training models to identify grocery items in various conditions, facilitating advancements in automated product recognition.