Live
Black Hat USADark ReadingBlack Hat AsiaAI BusinessMercor says it was hit by cyberattack tied to compromise of open-source LiteLLM projectTechCrunch AIHow AI has suddenly become much more useful to open-source developers - ZDNETGNews AI open sourceIn the Iran war, it looks like AI helped with operations, not strategyGary Marcus BlogGoogle adds AI charging guidance to Maps for EV drivers - mezha.netGoogle News - AI UkraineDespite its $350 billion investment promise in the U.S., the U.S. has unprecedentedly raised trade p.. - 매일경제GNews AI KoreaI Was Told to Write My Thesis in LaTeX. Here's How I Actually Got Started.DEV CommunityBuilding a Multi-Tenant SaaS with Stripe Connect in 2026DEV CommunityPart 3 of 3 — Engineering Intent Series -- Inside the Machine: The ISL Build PipelineDEV CommunityChoosing and Integrating Mobile Video SDKs: FFmpeg, ExoPlayer, and Commercial OptionsDEV CommunityStudent hui speaks on AI in education — and how to handle it - Hawaii Public RadioGNews AI educationBuild an End-to-End RAG Pipeline for LLM ApplicationsDEV CommunityAgentX-Phase2: 49-Model Byzantine FBA Consensus — Building Cool Agents that Modernize COBOL to RustDEV CommunityBlack Hat USADark ReadingBlack Hat AsiaAI BusinessMercor says it was hit by cyberattack tied to compromise of open-source LiteLLM projectTechCrunch AIHow AI has suddenly become much more useful to open-source developers - ZDNETGNews AI open sourceIn the Iran war, it looks like AI helped with operations, not strategyGary Marcus BlogGoogle adds AI charging guidance to Maps for EV drivers - mezha.netGoogle News - AI UkraineDespite its $350 billion investment promise in the U.S., the U.S. has unprecedentedly raised trade p.. - 매일경제GNews AI KoreaI Was Told to Write My Thesis in LaTeX. Here's How I Actually Got Started.DEV CommunityBuilding a Multi-Tenant SaaS with Stripe Connect in 2026DEV CommunityPart 3 of 3 — Engineering Intent Series -- Inside the Machine: The ISL Build PipelineDEV CommunityChoosing and Integrating Mobile Video SDKs: FFmpeg, ExoPlayer, and Commercial OptionsDEV CommunityStudent hui speaks on AI in education — and how to handle it - Hawaii Public RadioGNews AI educationBuild an End-to-End RAG Pipeline for LLM ApplicationsDEV CommunityAgentX-Phase2: 49-Model Byzantine FBA Consensus — Building Cool Agents that Modernize COBOL to RustDEV Community

iiANET: Inception Inspired Attention Hybrid Network for efficient Long-Range Dependency

arXivMarch 31, 20262 min read0 views
Source Quiz

arXiv:2407.07603v3 Announce Type: replace Abstract: The recent emergence of hybrid models has introduced a transformative approach to computer vision, gradually moving beyond conventional convolutional neural networks and vision transformers. However, efficiently combining these two approaches to better capture long-range dependencies in complex images remains a challenge. In this paper, we present iiANET (Inception Inspired Attention Network), an efficient hybrid visual backbone designed to improve the modeling of long-range dependencies in complex visual recognition tasks. The core innovatio — Haruna Yunusa, Adamu Lawan, Abdulganiyu Abdu Yusuf

View PDF HTML (experimental)

Abstract:The recent emergence of hybrid models has introduced a transformative approach to computer vision, gradually moving beyond conventional convolutional neural networks and vision transformers. However, efficiently combining these two approaches to better capture long-range dependencies in complex images remains a challenge. In this paper, we present iiANET (Inception Inspired Attention Network), an efficient hybrid visual backbone designed to improve the modeling of long-range dependencies in complex visual recognition tasks. The core innovation of iiANET is the iiABlock, a unified building block that integrates a modified global r-MHSA (Multi-Head Self-Attention) and convolutional layers in parallel. This design enables iiABlock to simultaneously capture global context and local details, making it effective for extracting rich and diverse features. By efficiently fusing these complementary representations, iiABlock allows iiANET to achieve strong feature interaction while maintaining computational efficiency. Extensive qualitative and quantitative evaluations on some SOTA benchmarks demonstrate improved performance.

Comments: 17 pages, 7 figures. Published in Transactions on Machine Learning Research (TMLR). Available at this https URL

Subjects:

Computer Vision and Pattern Recognition (cs.CV)

Cite as: arXiv:2407.07603 [cs.CV]

(or arXiv:2407.07603v3 [cs.CV] for this version)

https://doi.org/10.48550/arXiv.2407.07603

arXiv-issued DOI via DataCite

Journal reference: Transactions on Machine Learning Research (12/2025)

Submission history

From: Yunusa Haruna [view email] [v1] Wed, 10 Jul 2024 12:39:02 UTC (11,616 KB) [v2] Sat, 12 Apr 2025 11:32:38 UTC (1,954 KB) [v3] Sun, 29 Mar 2026 17:57:18 UTC (30,560 KB)

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by AI News Hub · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

researchpaperarxiv

Knowledge Map

Knowledge Map
TopicsEntitiesSource
iiANET: Inc…researchpaperarxivcomputer-vi…image-recog…arXiv

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 72 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!

More in Research Papers