Live
Black Hat USADark ReadingBlack Hat AsiaAI BusinessLetters to Sen. Ed Markey: six autonomous vehicle companies say remote assistants don't directly control vehicles; Tesla says its operators are allowed to do so (Aarian Marshall/Wired)TechmemeAnthropic Just Leaked Claude Code's Source. Here's What It Means for Your Vibe-Coded App.DEV CommunityYou're a slop coder. Autospec is for professionals only.DEV CommunityWhat Happened to CodiumAI? The Rebrand to Qodo ExplainedDEV CommunityAIによる雇用破壊はまだ限定的——だが、従来の指標では本当の影響は見えないCIO MagazineWhat Karpathy's Autoresearch Unlocked for MeDEV CommunityBitcoin enters the public bond market as Moody’s gives a first-of-its-kind crypto deal a ratingCoinDesk AIOpenClaw Creem agentDEV CommunityStock Market Today, March 31: Nvidia Rises on $2 Billion Marvell AI Infrastructure Partnership - The Motley FoolGNews AI NVIDIAVolt Typhoon Weaponized SOHO Routers at Scale — Here's Your Zero-Trust Playbook for the Remote EdgeDEV CommunityDeep Dive into vLLM: How PagedAttention & Continuous Batching Revolutionized LLM InferenceDEV CommunityFour futures of AI: Life sciences - EYGoogle News: AIBlack Hat USADark ReadingBlack Hat AsiaAI BusinessLetters to Sen. Ed Markey: six autonomous vehicle companies say remote assistants don't directly control vehicles; Tesla says its operators are allowed to do so (Aarian Marshall/Wired)TechmemeAnthropic Just Leaked Claude Code's Source. Here's What It Means for Your Vibe-Coded App.DEV CommunityYou're a slop coder. Autospec is for professionals only.DEV CommunityWhat Happened to CodiumAI? The Rebrand to Qodo ExplainedDEV CommunityAIによる雇用破壊はまだ限定的——だが、従来の指標では本当の影響は見えないCIO MagazineWhat Karpathy's Autoresearch Unlocked for MeDEV CommunityBitcoin enters the public bond market as Moody’s gives a first-of-its-kind crypto deal a ratingCoinDesk AIOpenClaw Creem agentDEV CommunityStock Market Today, March 31: Nvidia Rises on $2 Billion Marvell AI Infrastructure Partnership - The Motley FoolGNews AI NVIDIAVolt Typhoon Weaponized SOHO Routers at Scale — Here's Your Zero-Trust Playbook for the Remote EdgeDEV CommunityDeep Dive into vLLM: How PagedAttention & Continuous Batching Revolutionized LLM InferenceDEV CommunityFour futures of AI: Life sciences - EYGoogle News: AI

TimeSenCLIP: A Time Series Vision-Language Model for Remote Sensing

arXivMarch 30, 202610 min read0 views
Source Quiz

arXiv:2508.11919v3 Announce Type: replace Abstract: Vision-language models (VLMs) have shown significant promise in remote sensing applications, particularly for land-use and land-cover (LULC) mapping via zero-shot classification and retrieval. However, current approaches face several key challenges, such as the dependence on caption-based supervision, which is often not available or very limited in terms of the covered semantics, and the fact of being adapted from generic VLM architectures that are suitable for very high resolution images. Consequently, these models tend to prioritize spatial — Pallavi Jain, Diego Marcos, Dino Ienco, Roberto Interdonato, Tristan Berchoux

View PDF HTML (experimental)

Abstract:Vision-language models (VLMs) have shown significant promise in remote sensing applications, particularly for land-use and land-cover (LULC) mapping via zero-shot classification and retrieval. However, current approaches face several key challenges, such as the dependence on caption-based supervision, which is often not available or very limited in terms of the covered semantics, and the fact of being adapted from generic VLM architectures that are suitable for very high resolution images. Consequently, these models tend to prioritize spatial context over spectral and temporal information, limiting their effectiveness for medium-resolution remote sensing imagery. In this work, we present TimeSenCLIP, a lightweight VLM for remote sensing time series, using a cross-view temporal contrastive framework to align multispectral Sentinel-2 time series with geo-tagged ground-level imagery, without requiring textual annotations. Unlike prior VLMs, TimeSenCLIP emphasizes temporal and spectral signals over spatial context, investigating whether single-pixel time series contain sufficient information for solving a variety of tasks.

Comments: Accepted (ISPRS Journal of Photogrammetry and Remote Sensing)

Subjects:

Computer Vision and Pattern Recognition (cs.CV)

Cite as: arXiv:2508.11919 [cs.CV]

(or arXiv:2508.11919v3 [cs.CV] for this version)

https://doi.org/10.48550/arXiv.2508.11919

arXiv-issued DOI via DataCite

Submission history

From: Pallavi Jain [view email] [v1] Sat, 16 Aug 2025 05:44:33 UTC (5,353 KB) [v2] Thu, 18 Dec 2025 21:07:31 UTC (28,611 KB) [v3] Fri, 27 Mar 2026 10:59:31 UTC (26,776 KB)

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by AI News Hub · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

researchpaperarxiv

Knowledge Map

Knowledge Map
TopicsEntitiesSource
TimeSenCLIP…researchpaperarxivcomputer-vi…image-recog…arXiv

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 125 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!

More in Research Papers