Research Papers research paper arxiv computer-vision image-recognition

TimeSenCLIP: A Time Series Vision-Language Model for Remote Sensing

arXivMarch 30, 202610 min read0 views

arXiv:2508.11919v3 Announce Type: replace Abstract: Vision-language models (VLMs) have shown significant promise in remote sensing applications, particularly for land-use and land-cover (LULC) mapping via zero-shot classification and retrieval. However, current approaches face several key challenges, such as the dependence on caption-based supervision, which is often not available or very limited in terms of the covered semantics, and the fact of being adapted from generic VLM architectures that are suitable for very high resolution images. Consequently, these models tend to prioritize spatial — Pallavi Jain, Diego Marcos, Dino Ienco, Roberto Interdonato, Tristan Berchoux

View PDF HTML (experimental)

Abstract:Vision-language models (VLMs) have shown significant promise in remote sensing applications, particularly for land-use and land-cover (LULC) mapping via zero-shot classification and retrieval. However, current approaches face several key challenges, such as the dependence on caption-based supervision, which is often not available or very limited in terms of the covered semantics, and the fact of being adapted from generic VLM architectures that are suitable for very high resolution images. Consequently, these models tend to prioritize spatial context over spectral and temporal information, limiting their effectiveness for medium-resolution remote sensing imagery. In this work, we present TimeSenCLIP, a lightweight VLM for remote sensing time series, using a cross-view temporal contrastive framework to align multispectral Sentinel-2 time series with geo-tagged ground-level imagery, without requiring textual annotations. Unlike prior VLMs, TimeSenCLIP emphasizes temporal and spectral signals over spatial context, investigating whether single-pixel time series contain sufficient information for solving a variety of tasks.

Comments: Accepted (ISPRS Journal of Photogrammetry and Remote Sensing)

Subjects:

Computer Vision and Pattern Recognition (cs.CV)

Cite as: arXiv:2508.11919 [cs.CV]

(or arXiv:2508.11919v3 [cs.CV] for this version)

https://doi.org/10.48550/arXiv.2508.11919

arXiv-issued DOI via DataCite

Submission history

From: Pallavi Jain [view email] [v1] Sat, 16 Aug 2025 05:44:33 UTC (5,353 KB) [v2] Thu, 18 Dec 2025 21:07:31 UTC (28,611 KB) [v3] Fri, 27 Mar 2026 10:59:31 UTC (26,776 KB)

Original source

arXiv

https://arxiv.org/abs/2508.11919

Was this article helpful?

Ask AI about this article

Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

researchpaperarxiv

ModelsLive

What Karpathy's Autoresearch Unlocked for Me

I'm not a data scientist. I've trained a few models before — simple classification problems, with AI writing the Python and me running the iterations. It worked. I got confident. Then a friend asked for help with something harder. <h2> Three Weeks at 0.58 </h2> The problem involved predicting an outcome from a mix of CRM data and call recordings. Not trivial, but not exotic either. Quick primer on AUC — the metric I'll use throughout. Imagine your model looks at two random people: one where the answer is yes, one where it's no. AUC measures how often the model correctly ranks the yes above the no. Score of 0.5 means random guessing. Score of 1.0 means always right. I tried everything I knew: XGBoost, feature engineering, extracting features from transcripts u

DEV Community

3m31 minutes ago

Research PapersFresh

Is AI's visual understanding mostly a 'mirage'? New research suggests so - inkl

<a href="https://news.google.com/rss/articles/CBMimwFBVV95cUxNUjItTURkaldjcXhpNjA0b2dSVUhWNGRwaEkzclBMM0FFMk8wNTNsMWFDc1hmVU9jYU5jbzBKSG42WGZpbTI5TUs0R0w5Q2QxX0NCdjgtT2JtUUNsWHkxUFE1MzJjN1RxMmdmQjc5YlBsVTdfVU02Z1FZSlFrLU1hdmxibjBGcUV3STlvc0JnaG9PcXRDZVhQamdmYw?oc=5" target="_blank">Is AI's visual understanding mostly a 'mirage'? New research suggests so</a> inkl

GNews AI multimodal

1mabout 5 hours ago

Self-Evolving AI

Google DeepMind Introduces Aletheia: The AI Agent Moving from Math Competitions to Fully Autonomous Professional Research Discoveries - marktechpost.com

<a href="https://news.google.com/rss/articles/CBMigwJBVV95cUxPOWhkeUl4RkMxb1IzY1NKcUlVeFpYQ3NWc0ZLWVo2OWRESkdsTkZYYlpaamJ6WjE0Z3RaUkFJVEhIQnBFSHdjV2tEd3R1dFVGS28tYXlFNlZwZnVZSnV2TFlNNDFrNDFIdGJ6VzlwbENMZ2x3dHdFNXFWdzlWLWt2OEZQcW1WbTExSUdOVnNjbktiQURwQXRLUFZVeXp5WjZhbTY4dXhpdlphNWl2THNRNGxqbVcxcDlDbW5US3VBLUNvR1owSHNIUE5xMktmcVVDTjl4dXpOMmVPTUdQWkx0aF9yRXowU0NxQ2lHc0VMRzlaNDEyU0lLY0lSdWpLUndFUll30gGIAkFVX3lxTE0tR0JPRll6R09pU1d3NzVSQ0YwSVRJS1Q5YVF6THpfRUhEZ2EyVGJBNi1XX1ZOT19zZkU1WDlqelRzMm5NWjU3VTR2WC1LZ0drMTUyaHZVWFNLa1MwbEJ1OUZYM2R5cWFza0hJbllFSHhPc0tWYTNMbU1PMmw4T0RtLVpZLXBfbERKRUR0LTF6bl94S1FJRDBweVpxRGpTU242a25lMTVHdG5pTXBxUVgtaHp3MG9yX1NEd3p6Z0lKaWxLTVJGNWQxVkxVRFdZbERzSFBJUjUwQkRPNENYWVVrdlM4dTljODRxeWhDMDdLN1czT0tadUM1YmtCcENBcWU4NngwcUI3ZA?oc=5" target="_blank">Google DeepMind Intro

Google News: DeepMind

1m19 days ago

Knowledge Map

TopicsEntitiesSource

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 125 connections

Scroll to zoom · drag to pan · click to open

Discussion

No comments yet — be the first to share your thoughts!

More in Research Papers

Research PapersFresh

Is AI's visual understanding mostly a 'mirage'? New research suggests so - inkl

GNews AI multimodal

1mabout 5 hours ago

Research PapersLive

Google backs UH Mānoa AI, robotics research - University of Hawaii System

<a href="https://news.google.com/rss/articles/CBMickFVX3lxTE04TGNzcGpVeFNibkdwMzJIOFdrMHYtSS1OUDdFZkR6RFRtUU4yYWN1MlYtRGZQazl5Y1k3SklTYURwYVBycG5NZm5zbzNfRjY0SF9WNVJfM2tTU1BOX0xfb2ZtaWFVVFp3cFg3WXJmNnQwQQ?oc=5" target="_blank">Google backs UH Mānoa AI, robotics research</a> University of Hawaii System

GNews AI Google

1mabout 1 hour ago

Research PapersFresh

A Retrospective on the ICLR 2026 Review Process

The selection of papers for ICLR 2026 has fully concluded. We extend our congratulations to the authors whose work will appear at the conference. Creating ICLR’s technical program requires immense effort from the authors, reviewers, and area chairs, and we thank you for your contributions and service. For researchers whose work was rejected, we hope […]

blog.iclr.cc

1mabout 2 hours ago

Research Papers

Vector Researchers present papers at ACL 2024

Vector researchers will be well represented at the 62nd Annual Meeting of the Association for Computational Linguistics in Bangkok, Thailand this year. 14 papers co-authored by Vector-affiliated researchers are being […] The post Vector Researchers present papers at ACL 2024 appeared first on Vector Institute for Artificial Intelligence .

Vector Institute

1mover 1 year ago