Project: Food Packaging Image Analysis

This project aims to analyze images of food packaging (e.g., snack bags, breakfast items, spaghetti boxes, salad bags, canned goods) and identify the following elements on these packages:

Text areas (e.g., ingredient lists)
Specific marking areas (barcodes, QR codes)
Graphic elements areas (Logos, characters, etc.)

The project uses a combination of image processing and machine learning techniques to achieve this.

Code Overview

The code is written in Python and uses several libraries including cv2, matplotlib, pytesseract, pandas, numpy, PIL, ultralytics and transformers.

The main components of the code are:

Import of required libraries: The necessary Python libraries are imported. Some libraries are installed using pip and apt-get.
Library: Two classes TextDetector and YoloDetector are defined. TextDetector is used to extract and highlight text from images. YoloDetector is used to train a YOLO model and make predictions.
Intersection over Union (IoU): A function bb_intersection_over_union is defined to calculate the IoU of two boxes. Another function eliminate_overlapping_boxes is used to eliminate boxes that overlap with boxes in another array.
Plotting: Functions plot_boxes and plot_img are defined to plot bounding boxes on images and display images.
Inference: A function inference is defined to perform inference on a list of images.
Training: A SAM model is loaded and a YOLO model is trained.
Performances on fine-tuning: The performance of the model is evaluated.
Prediction: Predictions are made using the trained YOLO and SAM models.

Current State

The Yolo model achieves a mean average precision (mAP) of 0.6485 on the test set. I also integrate DETR model and train it on the dataset.

Usage

To use this project, you need to have the necessary Python libraries installed. You can then run the code in the vn.ipynb notebook. The notebook includes sections for training the model and making predictions. The files yolo_vs_detr.ipynb are used to fine-tune the model and compare the performances of YOLO and DETR models.

Note

This project is a work in progress. Future updates may include improvements to the model and additional features.

The dataset used for training the model is not included in this repository. You can use have a look at the dataset here.

Big thanks to LEMELIN William for providing the dataset.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.idea		.idea
.gitignore		.gitignore
README.md		README.md
conf.yaml		conf.yaml
detectnet.ipynb		detectnet.ipynb
mask-rnn.ipynb		mask-rnn.ipynb
vn.ipynb		vn.ipynb
yolo_vs_detr.ipynb		yolo_vs_detr.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Project: Food Packaging Image Analysis

Code Overview

Current State

Usage

Note

About

Uh oh!

Releases

Packages

Languages

amenalahassa/graphic-element-detection

Folders and files

Latest commit

History

Repository files navigation

Project: Food Packaging Image Analysis

Code Overview

Current State

Usage

Note

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages