Ítem
|
Horváth, Dániel
García, Fernando |
|
| Yavuz, Selin | |
| maig 2025 | |
|
Urban object detection remains an important challenge for autonomous naviga tion, particularly in accurately identifying and locating thin and small structures
such as poles. These structures are often difficult to detect due to their size, shape,
and integration into complex urban scenes.
In response to these challenges, this thesis proposes a dual-stage approach to
urban object detection by integrating 2D deep learning-based object detection into
3D spatial representation. The methodology consists of two main stages: (1) 2D
object detection, which involves fine-tuning a pretrained model to identify pole-like
objects in urban environments, and (2) 3D object detection, where the 2D detections
are projected into 3D space and further processed through point cloud clustering
and outlier rejection to generate accurate 3D bounding boxes.
The approach shows how combining segmentation-derived labels with geometric
modeling can serve as an alternative to fully supervised 3D object inference. The
model demonstrates promising performance on a custom dataset, benefiting from
the SAHI framework, which enhances the detection of smaller or more challenging to-see objects. These results indicate that integrating 2D and 3D detection methods
can provide valuable spatial information, even when detailed 3D annotations are
limited or unavailable, supporting practical urban navigation applications.
This thesis presents a foundational pipeline for pole detection in urban naviga tion, laying the groundwork for more adaptable and robust autonomous systems in
complex real-world environments. Further development, including dataset growth
and model refinement, can enhance detection accuracy and robustness, thereby ad vancing the effectiveness of autonomous systems in urban environments. 9 |
|
| application/pdf | |
| http://hdl.handle.net/10256/28372 | |
| eng | |
| Universitat de Girona. Institut de Recerca en Visió per Computador i Robòtica | |
| Attribution-NonCommercial-NoDerivatives 4.0 International | |
| http://creativecommons.org/licenses/by-nc-nd/4.0/ | |
|
Aprenentatge profund (Aprenentatge automàtic)
Deep learning (Machine learning) Sensors òptics tridimensionals Sensors Urban object detection Vehicles autònoms -- Sistemes de navegació Autonomous Vehicles -- Navigation systems SAHI Computer vision Visió per ordinador |
|
| Urban object detection using sensor fusion for autonomous navigation | |
| info:eu-repo/semantics/masterThesis | |
| DUGiDocs |
