Research Papers research paper arxiv machine-learning deep-learning

Towards Knowledge Guided Pretraining Approaches for Multimodal Foundation Models: Applications in Remote Sensing

arXivMarch 30, 202610 min read0 views

arXiv:2407.19660v5 Announce Type: replace-cross Abstract: Self-supervised learning has emerged as a powerful paradigm for pretraining foundation models using large-scale data. Existing pretraining approaches predominantly rely on masked reconstruction or next-token prediction strategies, demonstrating strong performance across various downstream tasks, including geoscience applications. However, these approaches do not fully capture the knowledge of causal interplay between different geospatial and environmental variables. To address this limitation, we propose Knowledge Guided Variable-Step F — Praveen Ravirathinam, Ajitesh Parthasarathy, Ankush Khandelwal, Rahul Ghosh, Vipin Kumar

View PDF HTML (experimental)

Abstract:Self-supervised learning has emerged as a powerful paradigm for pretraining foundation models using large-scale data. Existing pretraining approaches predominantly rely on masked reconstruction or next-token prediction strategies, demonstrating strong performance across various downstream tasks, including geoscience applications. However, these approaches do not fully capture the knowledge of causal interplay between different geospatial and environmental variables. To address this limitation, we propose Knowledge Guided Variable-Step Forecasting (KG-VSF), a novel pretraining task that models forecasting as a conditional generation task, where driver variables (e.g., weather) inform the prediction of response variables (e.g., satellite imagery). We demonstrate that pretraining in such a fashion leads to strong embeddings which give enhanced performance when finetuned on downstream tasks where capturing this causality matters such as pixel wise crop type mapping, soil moisture estimation and forecasting, missing image prediction, and future image forecasting when compared to finetuning embeddings from other standard pretraining approaches.

Comments: 33 pages with appendix

Subjects:

Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)

Cite as: arXiv:2407.19660 [cs.CV]

(or arXiv:2407.19660v5 [cs.CV] for this version)

https://doi.org/10.48550/arXiv.2407.19660

arXiv-issued DOI via DataCite

Submission history

From: Praveen Ravirathinam [view email] [v1] Mon, 29 Jul 2024 02:49:55 UTC (1,399 KB) [v2] Wed, 16 Oct 2024 21:18:10 UTC (3,245 KB) [v3] Tue, 18 Feb 2025 03:39:37 UTC (15,914 KB) [v4] Fri, 2 Jan 2026 09:15:07 UTC (4,709 KB) [v5] Fri, 27 Mar 2026 01:24:51 UTC (4,735 KB)

Original source

arXiv

https://arxiv.org/abs/2407.19660

Was this article helpful?

Ask AI about this article

Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

researchpaperarxiv

Research PapersFresh

UVA researchers use AI to speed up drug development - WVIR

<a href="https://news.google.com/rss/articles/CBMiiwFBVV95cUxOaHF0M0pSdUx0OUp4UHd4a0VnVllZVWtFZ0F6U0I2azlPejJLLTduTmdtZWFCYWhLRWRQSjRXTkxaWlJiV1ozc1JERnFqemtLczJmOEh3d0luZTlNdFNNcDlRdjdobU50RDd0Tk5NRkdqSU5HbGo0RVEzSTdoVThFeWxhRHFzUWpaX3FF0gGfAUFVX3lxTE1jbWpYOWZEWGtJd25vRGg0Nll3VFRzNGdoT01YYmt4YVZ1RHV5dVB3TVN0UVdGVDNHbDFKZnBlODlyQkZSWFFjZ2NDRWVvS05kXzJPOVNpT0xtZ3g5UjM2MF8wWmhPdGkwU1hGYTJzOTlreTJjNzVlaFdHVm9mNUxjOXdQVmR3cVE1ZlhrRmpMbWZpU1FFUEx0UVZXVlBBTQ?oc=5" target="_blank">UVA researchers use AI to speed up drug development</a> WVIR

Google News: AI

1mabout 4 hours ago

ProductsLive

UCL appoints Google DeepMind fellow to advance multilingual AI research - EdTech Innovation Hub

<a href="https://news.google.com/rss/articles/CBMisgFBVV95cUxQR3RqV1doQ2lCUFBMLTdSMjU1NEhDdHQ2dEhsbElyd1BLc0J6cE80VTBMYWxHdmk1a2h0NEJzckF6ZU5wN1dEUDR5aGJra1dGZUNEdExRMnFmWm1mUzFkU0tCZkpkdmNTME1JS0ZxSzlsVVNLQjFacEp1NXdJMlJfM3BQSTRlZENOWDlzQnJ1aVJ0amdZRndGYXpvN3pjaDdPMDJjcV9hdmhPTHJ5MkpEenBn?oc=5" target="_blank">UCL appoints Google DeepMind fellow to advance multilingual AI research</a> EdTech Innovation Hub

Google News: DeepMind

1mabout 1 hour ago

Models

Paper Finds That Leading AI Chatbots Like ChatGPT and Claude Remain Incredibly Sycophantic, Resulting in Twisted Effects on Users - Futurism

<a href="https://news.google.com/rss/articles/CBMikwFBVV95cUxQWnR0SXhyVm01QXZhUTNsWDNYSFNoNDZnRWpuN3M0Skw5LXJVNFVOSWg4TWRXSEFqY2Zab0M2LWhKV1hZa0xKcDJId19RSW1WRndVREU1TFVZSl8tZ3U1MGk3U2kzWWtDbm9ZWmNMM3R5VFpMdXJ3ZzlHaXZGR2FQbHBqeWFZekppZHdhVTYyU3BnWDA?oc=5" target="_blank">Paper Finds That Leading AI Chatbots Like ChatGPT and Claude Remain Incredibly Sycophantic, Resulting in Twisted Effects on Users</a> Futurism

Google News: ChatGPT

1m2 days ago

Knowledge Map

TopicsEntitiesSource

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 189 connections

Scroll to zoom · drag to pan · click to open

Discussion

No comments yet — be the first to share your thoughts!

More in Research Papers

Research PapersFresh

UVA researchers use AI to speed up drug development - WVIR

Google News: AI

1mabout 4 hours ago

Research PapersFresh

Illinois Tech computer science researcher honored by IEEE Chicago Section - EurekAlert!

<a href="https://news.google.com/rss/articles/CBMiXEFVX3lxTE13OVpWMEk1Z3hlMkR2bHNBQ2dkazFwb3VqN3hCa29GWGJvSVlPa00zd2xUakRmYXFqQmc5OWU0eGl4a21FMDAwWUN2Q3p0M3FrbXBkNV8zN0cxaG1s?oc=5" target="_blank">Illinois Tech computer science researcher honored by IEEE Chicago Section</a> EurekAlert!

Google News: Machine Learning

1mabout 5 hours ago

Research PapersFresh

Research roundup: 7 cool science stories we almost missed

Ars Technica

1mabout 7 hours ago

Research PapersFresh

AI maps science papers to predict research trends two to three years ahead - Tech Xplore

<a href="https://news.google.com/rss/articles/CBMie0FVX3lxTE5aTkZYTWdaRDZwTXNRMldpMG1WZ1YzWDZTOHN5M183Z3A1ZTFYbnhEWTdPRmpvZnZFU0xodlRsNWxFaGxTcEpwalhJNmJpQWE5VjhaRS1tOXJIeTc5Z0JNblJ3dFd4WjRYZGJOX0NrWGt6ZmZJVTBpRm5wWQ?oc=5" target="_blank">AI maps science papers to predict research trends two to three years ahead</a> Tech Xplore

Google News: Machine Learning

1mabout 7 hours ago