Live
Black Hat USAAI BusinessBlack Hat AsiaAI BusinessApple turns 50: 8 of the company’s biggest tech milestonesSilicon RepublicI Built an AI Agent That Can Write Its Own Tools When It Gets StuckDEV CommunityBuilding a "Soft Sensor" for Cement Kilns: Predicting Control Levers with PythonDEV CommunityWe Traced One Query Through Perplexity’s Entire Stack in Cohort – Here’s What Actually Happens in 3 SecondsDEV CommunityAgent Self-Discovery: How AI Agents Find Their Own WalletsDEV CommunityYour content pipeline is lying to you, and in regulated software, that's a serious problemDEV CommunityDiffusion-based AI model successfully trained in electroplatingPhys.org AIClaude Code hooks: how to intercept every tool call before it runsDEV CommunityHow I built a browser-based video editor with FFmpeg.wasm (no backend, no server costs)DEV CommunityWhy We Built an API for Spanish Fiscal ID Validation Instead of Just Implementing ItDEV CommunityA technical deep-dive into building APEX: an autonomous AI operations system on OpenClawDEV CommunityBest Amazon Spring Sale laptop deals 2026ZDNet AIBlack Hat USAAI BusinessBlack Hat AsiaAI BusinessApple turns 50: 8 of the company’s biggest tech milestonesSilicon RepublicI Built an AI Agent That Can Write Its Own Tools When It Gets StuckDEV CommunityBuilding a "Soft Sensor" for Cement Kilns: Predicting Control Levers with PythonDEV CommunityWe Traced One Query Through Perplexity’s Entire Stack in Cohort – Here’s What Actually Happens in 3 SecondsDEV CommunityAgent Self-Discovery: How AI Agents Find Their Own WalletsDEV CommunityYour content pipeline is lying to you, and in regulated software, that's a serious problemDEV CommunityDiffusion-based AI model successfully trained in electroplatingPhys.org AIClaude Code hooks: how to intercept every tool call before it runsDEV CommunityHow I built a browser-based video editor with FFmpeg.wasm (no backend, no server costs)DEV CommunityWhy We Built an API for Spanish Fiscal ID Validation Instead of Just Implementing ItDEV CommunityA technical deep-dive into building APEX: an autonomous AI operations system on OpenClawDEV CommunityBest Amazon Spring Sale laptop deals 2026ZDNet AI

Beyond Descriptions: A Generative Scene2Audio Framework for Blind and Low-Vision Users to Experience Vista Landscapes

arXivMarch 31, 202610 min read0 views
Source Quiz

arXiv:2603.27295v1 Announce Type: cross Abstract: Current scene perception tools for Blind and Low Vision (BLV) individuals rely on spoken descriptions but lack engaging representations of visually pleasing distant environmental landscapes (Vista spaces). Our proposed Scene2Audio framework generates comprehensible and enjoyable nonverbal audio using generative models informed by psychoacoustics, and principles of scene audio composition. Through a user study with 11 BLV participants, we found that combining the Scene2Audio sounds with speech creates a better experience than speech alone, as th — Chitralekha Gupta, Jing Peng, Ashwin Ram, Shreyas Sridhar, Christophe Jouffrais, Suranga Nanayakkara

View PDF HTML (experimental)

Abstract:Current scene perception tools for Blind and Low Vision (BLV) individuals rely on spoken descriptions but lack engaging representations of visually pleasing distant environmental landscapes (Vista spaces). Our proposed Scene2Audio framework generates comprehensible and enjoyable nonverbal audio using generative models informed by psychoacoustics, and principles of scene audio composition. Through a user study with 11 BLV participants, we found that combining the Scene2Audio sounds with speech creates a better experience than speech alone, as the sound effects complement the speech making the scene easier to imagine. A mobile app "in-the-wild" study with 7 BLV users for more than a week further showed the potential of Scene2Audio in enhancing outdoor scene experiences. Our work bridges the gap between visual and auditory scene perception by moving beyond purely descriptive aids, addressing the aesthetic needs of BLV users.

Comments: Accepted in CHI 2026

Subjects:

Human-Computer Interaction (cs.HC); Artificial Intelligence (cs.AI)

Cite as: arXiv:2603.27295 [cs.HC]

(or arXiv:2603.27295v1 [cs.HC] for this version)

https://doi.org/10.48550/arXiv.2603.27295

arXiv-issued DOI via DataCite (pending registration)

Related DOI:

https://doi.org/10.1145/3772318.3791655

DOI(s) linking to related resources

Submission history

From: Chitralekha Gupta [view email] [v1] Sat, 28 Mar 2026 14:57:44 UTC (18,539 KB)

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by AI News Hub · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

researchpaperarxiv

Knowledge Map

Knowledge Map
TopicsEntitiesSource
Beyond Desc…researchpaperarxivaiartificial-…arXiv

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 154 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!

More in Research Papers