Rüegg, Thomas and Wissjak, Patrick (2024) Development of a Deep Learning Model for Object Detection. Other thesis, OST Ostschweizer Fachhochschule.
Full text not available from this repository.Abstract
Autonomous cleaning robots are increasingly vital in keeping large-scale environments like warehouses and factories clean. This project presents the development and evaluation of a machine learning-based object detection system for KEMARO’s industrial cleaning robots. The primary objective was to address the robot’s current limitations in detecting small, valuable, or dangerous objects, such as keys or cables, which are overlooked by the current computer vision algorithms. The project explored state-of-the-art sensor technologies, including depth cameras, alongside advanced deep learning models such as YOLO and FastSAM.
High-quality real-world data was recorded using the Intel RealSense D435 with various configurations, including color recording and different resolutions, while maintaining the same approximate height and tilt to the camera placement of KEMARO’s K900 cleaning robot.
A key innovation of the study was the adaptation of YOLO for RGB-D input, allowing depth information to complement color image data. Data scarcity was addressed through a combination of real-world data collection and synthetic dataset generation using Blender. Various training configurations were tested to evaluate the generalization of the object detection models in real-world scenarios.
The results demonstrate that synthetic data can improve training outcomes when combined with real-world data, but challenges remain in model stability and the quality of depth information. Feeding RGB instead of grayscale images to the model achieved better results in object detection. While incorporating depth (RGB-D) increased model object detection performance when using perfect depth-maps, it proofed to be disadvantageous with imperfect and noisy real-world depth maps.
A roadmap for deep learning based object detection was also developed for KEMARO, providing recommendations for data governance, sensor improvements, and iterative hardware upgrades.
Item Type: | Thesis (Other) |
---|---|
Subjects: | Area of Application > Business oriented Area of Application > Image/Video Processing Technologies > Programming Languages > Python |
Divisions: | Bachelor of Science FHO in Informatik > Student Research Project |
Depositing User: | OST Deposit User |
Contributors: | Contribution Name Email Thesis advisor Lehmann, Marco UNSPECIFIED |
Date Deposited: | 18 Feb 2025 12:29 |
Last Modified: | 18 Feb 2025 12:29 |
URI: | https://eprints.ost.ch/id/eprint/1253 |