Live
Black Hat USADark ReadingBlack Hat AsiaAI BusinessAI Agents Are Coming for Your Waiting Room. That’s Just the Start. - CDOTrendsGNews AI agenticAI Has Already Decided: First-Party Data Will Define Advertising’s Agentic Era - AdExchangerGNews AI agenticDefending Habit Streakslesswrong.comChinese-made drone shot in Iran, Australia’s diplomacy pivot: 5 weekend reads you missedSCMP Tech (Asia AI)How NLP Actually Understands Text?Medium AIXENONOSTRA RESEARCH NOTES ALGEBROS: An Algebraic Meta-Language for Code Structure Extraction and…Medium AI18 Specific Tutorial Ideas for AI Voice Integration Using Vapi and TwilioDev.to AIUI DESIGNERS IN TROUBLEMedium AIEstimates of the expected utility gain of AI Safety Researchlesswrong.comMastering Python for Machine Learning: A Practical, No-Nonsense RoadmapMedium AII Audited 13 AI Agent Platforms for Security Misconfigurations — Here's the Open-Source Scanner I BuiltDev.to AIFrom Reality to Writing: Why I Explore Technology, Identity and Human BehaviorMedium AIBlack Hat USADark ReadingBlack Hat AsiaAI BusinessAI Agents Are Coming for Your Waiting Room. That’s Just the Start. - CDOTrendsGNews AI agenticAI Has Already Decided: First-Party Data Will Define Advertising’s Agentic Era - AdExchangerGNews AI agenticDefending Habit Streakslesswrong.comChinese-made drone shot in Iran, Australia’s diplomacy pivot: 5 weekend reads you missedSCMP Tech (Asia AI)How NLP Actually Understands Text?Medium AIXENONOSTRA RESEARCH NOTES ALGEBROS: An Algebraic Meta-Language for Code Structure Extraction and…Medium AI18 Specific Tutorial Ideas for AI Voice Integration Using Vapi and TwilioDev.to AIUI DESIGNERS IN TROUBLEMedium AIEstimates of the expected utility gain of AI Safety Researchlesswrong.comMastering Python for Machine Learning: A Practical, No-Nonsense RoadmapMedium AII Audited 13 AI Agent Platforms for Security Misconfigurations — Here's the Open-Source Scanner I BuiltDev.to AIFrom Reality to Writing: Why I Explore Technology, Identity and Human BehaviorMedium AI
AI NEWS HUBbyEIGENVECTOREigenvector

Cross-Scale MAE: A Tale of Multi-Scale Exploitation in Remote Sensing

arXiv eess.IVby [Submitted on 29 Jan 2024]April 3, 20261 min read1 views
Source Quiz

arXiv:2401.15855v1 Announce Type: cross Abstract: Remote sensing images present unique challenges to image analysis due to the extensive geographic coverage, hardware limitations, and misaligned multi-scale images. This paper revisits the classical multi-scale representation learning problem but under the general framework of self-supervised learning for remote sensing image understanding. We present Cross-Scale MAE, a self-supervised model built upon the Masked Auto-Encoder (MAE).During pre-training, Cross-Scale MAE employs scale augmentation techniques and enforces cross-scale consistency constraints through both contrastive and generative losses to ensure consistent and meaningful representations well-suited for a wide range of downstream tasks. Further, our implementation leverages the

View PDF

Abstract:Remote sensing images present unique challenges to image analysis due to the extensive geographic coverage, hardware limitations, and misaligned multi-scale images. This paper revisits the classical multi-scale representation learning problem but under the general framework of self-supervised learning for remote sensing image understanding. We present Cross-Scale MAE, a self-supervised model built upon the Masked Auto-Encoder (MAE).During pre-training, Cross-Scale MAE employs scale augmentation techniques and enforces cross-scale consistency constraints through both contrastive and generative losses to ensure consistent and meaningful representations well-suited for a wide range of downstream tasks. Further, our implementation leverages the xFormers library to accelerate network pre-training on a single GPU while maintaining the quality of learned representations. Experimental evaluations demonstrate that Cross-Scale MAE exhibits superior performance compared to standard MAE and other state-of-the-art remote sensing MAE methods.

Subjects:

Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Image and Video Processing (eess.IV)

Cite as: arXiv:2401.15855 [cs.CV]

(or arXiv:2401.15855v1 [cs.CV] for this version)

https://doi.org/10.48550/arXiv.2401.15855

arXiv-issued DOI via DataCite

Submission history

From: Maofeng Tang [view email] [v1] Mon, 29 Jan 2024 03:06:19 UTC (950 KB)

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by Eigenvector · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

modeltrainingannounce

Knowledge Map

Knowledge Map
TopicsEntitiesSource
Cross-Scale…modeltrainingannouncevaluationanalysispaperarXiv eess.…

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 251 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!

More in Models