Joos, Andri (2024) Training a simulated drone with deep reinforcement learning. Other thesis, OST - Ostschweizer Fachhochschule.
HS 2023 2024-SA-EP-Joos-Reinforcement Learning training a fpv drone in a simulated.pdf - Supplemental Material
Download (1MB)
Abstract
This report details the comprehensive planning, execution, and evaluation of a solodeveloped project in the field of artificial intelligence (AI) and software engineering.
The project, focused on implementing a drone capable of autonomous navigation and delivery, follows the Agile Scrum methodology for project management. The report covers various aspects, including risk management, iterative development processes, and the application of the Soft Actor-Critic (SAC) algorithm for AI training.
A key aspect of the report is the detailed description of the SAC algorithm, which outlines the mathematical principles of this algorithm in an understandable way.
The development lifecycle is outlined through a series of sprints, each encompassing planning, review, and retrospective meetings. The report underscores the importance of risk management, categorizing and addressing potential challenges throughout the project.
Detailed insights into the project’s structure, from backlogs to sprints, provide a transparent view of the iterative development process. The challenges faced, such as lack of experience with AI and Unreal Engine, are mitigated through proactive measures, including research and adaptation.
The long-term plan, presented in a Gantt chart, highlights key milestones, phases, and features. Primary and secondary features, along with epics and milestones, are meticulously defined, allowing for a clear understanding of project progress.
The report concludes with a thorough retrospective, highlighting successes, areas for improvement, and personal reflections. Key achievements include overcoming the hyperparameter problem and implementing the SAC algorithm to control the drone.
Item Type: | Thesis (Other) |
---|---|
Subjects: | Topics > Software > Testing and Simulation Area of Application > Academic and Education Technologies > Programming Languages > Python |
Divisions: | Bachelor of Science FHO in Informatik > Student Research Project |
Depositing User: | OST Deposit User |
Contributors: | Contribution Name Email Thesis advisor Lehmann, Marco UNSPECIFIED |
Date Deposited: | 04 Mar 2024 08:56 |
Last Modified: | 04 Mar 2024 08:56 |
URI: | https://eprints.ost.ch/id/eprint/1177 |