Live
Black Hat USAAI BusinessBlack Hat AsiaAI BusinessBoston Becomes First Major District to Bring AI Literacy Into Classrooms - GoverningGoogle News: AIHow payment fraud evolved from ancient Roman coins to AI-deepfakes — and what's next - The Business JournalsGNews AI deepfakeOracle Lays Off Thousands to Offset AI SpendingGizmodoFranklin Templeton agrees to acquire CoinFund spinoff 250 Digital to form Franklin Crypto, which will offer strategies designed for institutional investors (Vicky Ge Huang/Wall Street Journal)TechmemeDeveloper’s Guide to Building ADK Agents with SkillsGoogle Developers BlogUMW Inaugural AI Expert-in-Residence Shares Insight on Technology’s ‘Tremendous’ Impact - University of Mary WashingtonGoogle News: AISpaceX Said to File Confidentially for IPO Before AI RivalsBloomberg TechnologyCargill Wins 2026 BIG Artificial Intelligence Excellence Award - foodmarket.comGoogle News: AIWhen machines judge without knowing: AI, augmentation and the limits of automated cybersecurity decisions - IAPPGNews AI cybersecurityMeet the Agentic AI Design-to-Source Workspace for PLM: From CAD to Confident Sourcing Decisions - Oracle BlogsGNews AI agenticYouTube blasted by hundreds of experts over ‘AI slop’ videos served up to kidsFast Company TechApono Uses Gamified AI Security Exercise to Engage Cloud Security Community - TipRanksGoogle News: AI SafetyBlack Hat USAAI BusinessBlack Hat AsiaAI BusinessBoston Becomes First Major District to Bring AI Literacy Into Classrooms - GoverningGoogle News: AIHow payment fraud evolved from ancient Roman coins to AI-deepfakes — and what's next - The Business JournalsGNews AI deepfakeOracle Lays Off Thousands to Offset AI SpendingGizmodoFranklin Templeton agrees to acquire CoinFund spinoff 250 Digital to form Franklin Crypto, which will offer strategies designed for institutional investors (Vicky Ge Huang/Wall Street Journal)TechmemeDeveloper’s Guide to Building ADK Agents with SkillsGoogle Developers BlogUMW Inaugural AI Expert-in-Residence Shares Insight on Technology’s ‘Tremendous’ Impact - University of Mary WashingtonGoogle News: AISpaceX Said to File Confidentially for IPO Before AI RivalsBloomberg TechnologyCargill Wins 2026 BIG Artificial Intelligence Excellence Award - foodmarket.comGoogle News: AIWhen machines judge without knowing: AI, augmentation and the limits of automated cybersecurity decisions - IAPPGNews AI cybersecurityMeet the Agentic AI Design-to-Source Workspace for PLM: From CAD to Confident Sourcing Decisions - Oracle BlogsGNews AI agenticYouTube blasted by hundreds of experts over ‘AI slop’ videos served up to kidsFast Company TechApono Uses Gamified AI Security Exercise to Engage Cloud Security Community - TipRanksGoogle News: AI Safety

GaussianGPT: Towards Autoregressive 3D Gaussian Scene Generation

arXivMarch 30, 202610 min read0 views
Source Quiz

arXiv:2603.26661v1 Announce Type: new Abstract: Most recent advances in 3D generative modeling rely on diffusion or flow-matching formulations. We instead explore a fully autoregressive alternative and introduce GaussianGPT, a transformer-based model that directly generates 3D Gaussians via next-token prediction, thus facilitating full 3D scene generation. We first compress Gaussian primitives into a discrete latent grid using a sparse 3D convolutional autoencoder with vector quantization. The resulting tokens are serialized and modeled using a causal transformer with 3D rotary positional embe — Nicolas von L\"utzow, Barbara R\"ossle, Katharina Schmid, Matthias Nie{\ss}ner

View PDF HTML (experimental)

Abstract:Most recent advances in 3D generative modeling rely on diffusion or flow-matching formulations. We instead explore a fully autoregressive alternative and introduce GaussianGPT, a transformer-based model that directly generates 3D Gaussians via next-token prediction, thus facilitating full 3D scene generation. We first compress Gaussian primitives into a discrete latent grid using a sparse 3D convolutional autoencoder with vector quantization. The resulting tokens are serialized and modeled using a causal transformer with 3D rotary positional embedding, enabling sequential generation of spatial structure and appearance. Unlike diffusion-based methods that refine scenes holistically, our formulation constructs scenes step-by-step, naturally supporting completion, outpainting, controllable sampling via temperature, and flexible generation horizons. This formulation leverages the compositional inductive biases and scalability of autoregressive modeling while operating on explicit representations compatible with modern neural rendering pipelines, positioning autoregressive transformers as a complementary paradigm for controllable and context-aware 3D generation.

Comments: Project page: this https URL - Project video: this https URL

Subjects:

Computer Vision and Pattern Recognition (cs.CV)

Cite as: arXiv:2603.26661 [cs.CV]

(or arXiv:2603.26661v1 [cs.CV] for this version)

https://doi.org/10.48550/arXiv.2603.26661

arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Nicolas Von Lützow [view email] [v1] Fri, 27 Mar 2026 17:58:05 UTC (15,893 KB)

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by AI News Hub · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

researchpaperarxiv

Knowledge Map

Knowledge Map
TopicsEntitiesSource
GaussianGPT…researchpaperarxivcomputer-vi…image-recog…arXiv

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 89 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!

More in Research Papers