Skip to main navigation Skip to search Skip to main content

Hyper-Transforming Latent Diffusion Models

  • Zuse School ELIZA
  • Saarland University

Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

Abstract

We introduce a novel generative framework for functions by integrating Implicit Neural Representations (INRs) and Transformer-based hypernetworks into latent variable models. Unlike prior approaches that rely on MLP-based hypernetworks with scalability limitations, our method employs a Transformer-based decoder to generate INR parameters from latent variables, addressing both representation capacity and computational efficiency. Our framework extends latent diffusion models (LDMs) to INR generation by replacing standard decoders with a Transformer-based hypernetwork, which can be trained either from scratch or via hyper-transforming—a strategy that fine-tunes only the decoder while freezing the pre-trained latent space. This enables efficient adaptation of existing generative models to INR-based representations without requiring full retraining. We validate our approach across multiple modalities, demonstrating improved scalability, expressiveness, and generalization over existing INR-based generative models. Our findings establish a unified and flexible framework for learning structured function representations.
Original languageEnglish
Title of host publicationProceedings of the 42nd International Conference on Machine Learning
Volume267
PublisherProceedings of Machine Learning Research
Publication date2025
Pages48714-48733
Publication statusPublished - 2025
Event42nd International Conference on Machine Learning - Vancouver, Canada
Duration: 13 Jul 202519 Jul 2025

Conference

Conference42nd International Conference on Machine Learning
Country/TerritoryCanada
CityVancouver
Period13/07/202519/07/2025
SeriesProceedings of Machine Learning Research

Fingerprint

Dive into the research topics of 'Hyper-Transforming Latent Diffusion Models'. Together they form a unique fingerprint.

Cite this