Live
Black Hat USADark ReadingBlack Hat AsiaAI BusinessThis International Fact-Checking Day, use these 5 tips to spot AI-generated contentFast Company TechDay 13: Why Good Models Fail in the Real World (Data Leakage)Medium AISmart solutions for sustainable energy: Machine learning powers biochar production from aquatic biomass - EurekAlert!Google News: Machine LearningIran Reportedly Executing Political Prisoners As War With Israel And U.S. Rages OnInternational Business TimesI Built a 6-Agent AI System in a WeekendMedium AIGenerative AI shifts from market boom to disruption risk - FinTech GlobalGoogle News: Generative AIChatGPT shopping: How it works, and how to get your products listed - AOL.comGoogle News: ChatGPTAgentic Coding: The Risks and Pitfalls Nobody Talks AboutMedium AIHow to Make Money with AI in 2026 (Even If You’re Starting from Zero)Medium AIYour Company Is Spending on AI. The Numbers Are Not Adding Up. Here Is What Is Actually Happening.Medium AIIn the AI Era, Just Get FitMedium AIMy Salary Doubled After I Added These 4 Skills to My Resume — All Free to LearnMedium AIBlack Hat USADark ReadingBlack Hat AsiaAI BusinessThis International Fact-Checking Day, use these 5 tips to spot AI-generated contentFast Company TechDay 13: Why Good Models Fail in the Real World (Data Leakage)Medium AISmart solutions for sustainable energy: Machine learning powers biochar production from aquatic biomass - EurekAlert!Google News: Machine LearningIran Reportedly Executing Political Prisoners As War With Israel And U.S. Rages OnInternational Business TimesI Built a 6-Agent AI System in a WeekendMedium AIGenerative AI shifts from market boom to disruption risk - FinTech GlobalGoogle News: Generative AIChatGPT shopping: How it works, and how to get your products listed - AOL.comGoogle News: ChatGPTAgentic Coding: The Risks and Pitfalls Nobody Talks AboutMedium AIHow to Make Money with AI in 2026 (Even If You’re Starting from Zero)Medium AIYour Company Is Spending on AI. The Numbers Are Not Adding Up. Here Is What Is Actually Happening.Medium AIIn the AI Era, Just Get FitMedium AIMy Salary Doubled After I Added These 4 Skills to My Resume — All Free to LearnMedium AI
AI NEWS HUBbyEIGENVECTOREigenvector

Editable-DeepSC: Reliable Cross-Modal Semantic Communications for Facial Editing

arXivMarch 30, 202610 min read0 views
Source Quiz

arXiv:2411.15702v5 Announce Type: replace-cross Abstract: Interactive computer vision (CV) plays a crucial role in various real-world applications, whose performance is highly dependent on communication networks. Nonetheless, the data-oriented characteristics of conventional communications often do not align with the special needs of interactive CV tasks. To alleviate this issue, the recently emerged semantic communications only transmit task-related semantic information and exhibit a promising landscape to address this problem. However, the communication challenges associated with Semantic Fa — Bin Chen, Wenbo Yu, Qinshan Zhang, Tianqu Zhuang, Hao Wu, Yong Jiang, Shu-Tao Xia

View PDF HTML (experimental)

Abstract:Interactive computer vision (CV) plays a crucial role in various real-world applications, whose performance is highly dependent on communication networks. Nonetheless, the data-oriented characteristics of conventional communications often do not align with the special needs of interactive CV tasks. To alleviate this issue, the recently emerged semantic communications only transmit task-related semantic information and exhibit a promising landscape to address this problem. However, the communication challenges associated with Semantic Facial Editing, one of the most important interactive CV applications on social media, still remain largely unexplored. In this paper, we fill this gap by proposing Editable-DeepSC, a novel cross-modal semantic communication approach for facial editing. Firstly, we theoretically discuss different transmission schemes that separately handle communications and editings, and emphasize the necessity of Joint Editing-Channel Coding (JECC) via iterative attributes matching, which integrates editings into the communication chain to preserve more semantic mutual information. To compactly represent the high-dimensional data, we leverage inversion methods via pre-trained StyleGAN priors for semantic coding. To tackle the dynamic channel noise conditions, we propose SNR-aware channel coding via model fine-tuning. Extensive experiments indicate that Editable-DeepSC can achieve superior editings while significantly saving the transmission bandwidth, even under high-resolution and out-of-distribution (OOD) settings.

Subjects:

Information Theory (cs.IT); Computer Vision and Pattern Recognition (cs.CV); Networking and Internet Architecture (cs.NI)

Cite as: arXiv:2411.15702 [cs.IT]

(or arXiv:2411.15702v5 [cs.IT] for this version)

https://doi.org/10.48550/arXiv.2411.15702

arXiv-issued DOI via DataCite

Submission history

From: Wenbo Yu [view email] [v1] Sun, 24 Nov 2024 04:07:33 UTC (5,860 KB) [v2] Tue, 6 May 2025 16:30:58 UTC (5,968 KB) [v3] Mon, 13 Oct 2025 13:17:15 UTC (6,062 KB) [v4] Sat, 21 Feb 2026 15:49:46 UTC (6,113 KB) [v5] Fri, 27 Mar 2026 17:19:35 UTC (6,114 KB)

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by Eigenvector · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

Knowledge Map

Knowledge Map
TopicsEntitiesSource
Editable-De…researchpaperarxivcomputer-vi…image-recog…arXiv

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 186 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!