Abstract
We present a novel pipeline for learning the conditional distribution of a building roof mesh given pixels from an aerial image, under the assumption that roof geometry follows a set of regular patterns. Unlike alternative methods that require multiple images of the same object, our approach enables estimating 3D roof meshes using only a single image for predictions. The approach employs the PolyGen, a deep generative transformer architecture for 3D meshes. We apply this model in a new domain and investigate the sensitivity of the image resolution. We propose a novel metric to evaluate the performance of the inferred meshes, and our results show that the model is robust even at lower resolutions, while qualitatively producing realistic representations for out-of-distribution samples.
Original language | English |
---|---|
Title of host publication | Proceedings of the 2023 IEEE International Conference on Acoustics, Speech and Signal Processing |
Number of pages | 5 |
Publisher | IEEE |
Publication date | 2023 |
ISBN (Print) | 978-1-7281-6328-4 |
ISBN (Electronic) | 978-1-7281-6327-7 |
DOIs | |
Publication status | Published - 2023 |
Event | 2023 IEEE International Conference on Acoustics, Speech and Signal Processing - Rhodes Island, Greece Duration: 4 Jun 2023 → 10 Jun 2023 |
Conference
Conference | 2023 IEEE International Conference on Acoustics, Speech and Signal Processing |
---|---|
Country/Territory | Greece |
City | Rhodes Island |
Period | 04/06/2023 → 10/06/2023 |
Keywords
- 3D Reconstruction
- Aerial imagery
- Generative models
- Remote sensing
- Buildings