Towards Automated Construction: Visual-based Pose Reconstruction for Tower Crane Operations using Differentiable Rendering and Network-based Image Segmentation
This study focuses on visual-based pose reconstruction, aimed at automating construction sites. It addresses the pivotal challenge of camera-based pose reconstruction, crucial for robotic operations, as discussed in this work for tracking objects maneuvered by tower cranes. Central to this research is the formulation of a gradient-based optimization problem, with the objective of enhancing the alignment between synthetic model renderings and actual image captures through the use of differentiable rendering. Additionally, the study presents the design of a neural network tailored for image segmentation, intending to simplify the network architecture to reduce latency times and meet real-time operational demands. Although the entire procedure is motivated and discussed for the application of object tracking of a load by a tower crane, the presented framework is extendable, without loss of generality, to all camera-based object pose reconstructions, where a predefined geometric model of the target object is available. The adaptability and innovative applications of the framework highlight its significant contributions to advancing robotic vision and automation within the construction industry and beyond.
%0 Conference Paper
%1 10595817
%A Schüle, Johannes
%A Burkhardt, Mark
%A Gienger, Andreas
%A Sawodny, Oliver
%B 2024 IEEE 33rd International Symposium on Industrial Electronics (ISIE)
%D 2024
%K peer rp8
%P 1-7
%R 10.1109/ISIE54533.2024.10595817
%T Towards Automated Construction: Visual-based Pose Reconstruction for Tower Crane Operations using Differentiable Rendering and Network-based Image Segmentation
%X This study focuses on visual-based pose reconstruction, aimed at automating construction sites. It addresses the pivotal challenge of camera-based pose reconstruction, crucial for robotic operations, as discussed in this work for tracking objects maneuvered by tower cranes. Central to this research is the formulation of a gradient-based optimization problem, with the objective of enhancing the alignment between synthetic model renderings and actual image captures through the use of differentiable rendering. Additionally, the study presents the design of a neural network tailored for image segmentation, intending to simplify the network architecture to reduce latency times and meet real-time operational demands. Although the entire procedure is motivated and discussed for the application of object tracking of a load by a tower crane, the presented framework is extendable, without loss of generality, to all camera-based object pose reconstructions, where a predefined geometric model of the target object is available. The adaptability and innovative applications of the framework highlight its significant contributions to advancing robotic vision and automation within the construction industry and beyond.
@inproceedings{10595817,
abstract = {This study focuses on visual-based pose reconstruction, aimed at automating construction sites. It addresses the pivotal challenge of camera-based pose reconstruction, crucial for robotic operations, as discussed in this work for tracking objects maneuvered by tower cranes. Central to this research is the formulation of a gradient-based optimization problem, with the objective of enhancing the alignment between synthetic model renderings and actual image captures through the use of differentiable rendering. Additionally, the study presents the design of a neural network tailored for image segmentation, intending to simplify the network architecture to reduce latency times and meet real-time operational demands. Although the entire procedure is motivated and discussed for the application of object tracking of a load by a tower crane, the presented framework is extendable, without loss of generality, to all camera-based object pose reconstructions, where a predefined geometric model of the target object is available. The adaptability and innovative applications of the framework highlight its significant contributions to advancing robotic vision and automation within the construction industry and beyond.},
added-at = {2024-10-30T12:04:16.000+0100},
author = {Schüle, Johannes and Burkhardt, Mark and Gienger, Andreas and Sawodny, Oliver},
biburl = {https://puma.ub.uni-stuttgart.de/bibtex/2872dd59a4d7ef182776290e07b10d8f0/intcdc},
booktitle = {2024 IEEE 33rd International Symposium on Industrial Electronics (ISIE)},
doi = {10.1109/ISIE54533.2024.10595817},
interhash = {e2fc435823ae0d9da33982bd655abbf8},
intrahash = {872dd59a4d7ef182776290e07b10d8f0},
issn = {2163-5145},
keywords = {peer rp8},
month = {June},
pages = {1-7},
timestamp = {2024-10-30T12:04:16.000+0100},
title = {Towards Automated Construction: Visual-based Pose Reconstruction for Tower Crane Operations using Differentiable Rendering and Network-based Image Segmentation},
year = 2024
}