Learning To Generate 3d Representations of Building Roofs Using Single-View Aerial Imagery

Maxim Khomiakov, Alejandro Valverde Mahou, Alba Reinders Sánchez, Jes Frellsen, Michael Riis Andersen

Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

62 Downloads (Pure)

Abstract

We present a novel pipeline for learning the conditional distribution of a building roof mesh given pixels from an aerial image, under the assumption that roof geometry follows a set of regular patterns. Unlike alternative methods that require multiple images of the same object, our approach enables estimating 3D roof meshes using only a single image for predictions. The approach employs the PolyGen, a deep generative transformer architecture for 3D meshes. We apply this model in a new domain and investigate the sensitivity of the image resolution. We propose a novel metric to evaluate the performance of the inferred meshes, and our results show that the model is robust even at lower resolutions, while qualitatively producing realistic representations for out-of-distribution samples.
Original languageEnglish
Title of host publicationProceedings of the 2023 IEEE International Conference on Acoustics, Speech and Signal Processing
Number of pages5
PublisherIEEE
Publication date2023
ISBN (Print)978-1-7281-6328-4
ISBN (Electronic)978-1-7281-6327-7
DOIs
Publication statusPublished - 2023
Event2023 IEEE International Conference on Acoustics, Speech and Signal Processing - Rhodes Island, Greece
Duration: 4 Jun 202310 Jun 2023

Conference

Conference2023 IEEE International Conference on Acoustics, Speech and Signal Processing
Country/TerritoryGreece
CityRhodes Island
Period04/06/202310/06/2023

Keywords

  • 3D Reconstruction
  • Aerial imagery
  • Generative models
  • Remote sensing
  • Buildings

Fingerprint

Dive into the research topics of 'Learning To Generate 3d Representations of Building Roofs Using Single-View Aerial Imagery'. Together they form a unique fingerprint.

Cite this