Models model training announce valuation study alignment

On the limited utility of parallel data for learning shared multilingual representations

arXiv cs.CLby Julius Leino, J\"org TiedemannApril 1, 20261 min read0 views

arXiv:2603.29026v1 Announce Type: new Abstract: Shared multilingual representations are essential for cross-lingual tasks and knowledge transfer across languages. This study looks at the impact of parallel data, i.e. translated sentences, in pretraining as a signal to trigger representations that are aligned across languages. We train reference models with different proportions of parallel data and show that parallel data seem to have only a minimal effect on the cross-lingual alignment. Based on multiple evaluation methods, we find that the effect is limited to potentially accelerating the representation sharing in the early phases of pretraining, and to decreasing the amount of language-specific neurons in the model. Cross-lingual alignment seems to emerge on similar levels even without

View PDF HTML (experimental)

Abstract:Shared multilingual representations are essential for cross-lingual tasks and knowledge transfer across languages. This study looks at the impact of parallel data, i.e. translated sentences, in pretraining as a signal to trigger representations that are aligned across languages. We train reference models with different proportions of parallel data and show that parallel data seem to have only a minimal effect on the cross-lingual alignment. Based on multiple evaluation methods, we find that the effect is limited to potentially accelerating the representation sharing in the early phases of pretraining, and to decreasing the amount of language-specific neurons in the model. Cross-lingual alignment seems to emerge on similar levels even without the explicit signal from parallel data.

Subjects:

Computation and Language (cs.CL)

Cite as: arXiv:2603.29026 [cs.CL]

(or arXiv:2603.29026v1 [cs.CL] for this version)

https://doi.org/10.48550/arXiv.2603.29026

arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Julius Leino [view email] [v1] Mon, 30 Mar 2026 21:37:34 UTC (1,252 KB)

Original source

arXiv cs.CL

https://arxiv.org/abs/2603.29026

Was this article helpful?

Ask AI about this article

Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

modeltrainingannounce

ProductsLive

The end of 'shadow AI' at enterprises? Kilo launches KiloClaw for Organizations to enable secure AI agents at scale

As generative AI matures from a novelty into a workplace staple, a new friction point has emerged: the "shadow AI" or "Bring Your Own AI (BYOAI)" crisis. Much like the unsanctioned use of personal devices in years past, developers and knowledge workers are increasingly deploying autonomous agents on personal infrastructure to manage their professional workflows. "Our journey with Kilo Claw has been to make it easier and easier and more accessible to folks," says Kilo co-founder Scott Breitenother. Today, the company dedicated to providing a portable, multi-model, cloud-based AI coding environment is moving to formalize this "shadow AI" layer: it's launching KiloClaw for Organizations and KiloClaw Chat, a suite of tools designed to provide enterprise-grade governance over personal AI a

VentureBeat AI

9m16 minutes ago

ProductsLive

From Linux Admin to DevOps & AI: My Journey Begins

<h2> Why I'm Sharing This Journey </h2> <p>Hi! I'm Nazmur, a Junior Linux Administrator who's decided to level up. Like many in operations, I've realized that the future belongs to those who can bridge <strong>traditional sysadmin work</strong> with modern <strong>DevOps practices</strong> and emerging <strong>AI technologies</strong>.</p> <p>This is the first post in my journey from Linux admin → DevOps Engineer → AI/ML Engineer.</p> <h2> Where I Started </h2> <p>As a Linux admin, my daily work involves:</p> <ul> <li>Managing and monitoring Linux servers</li> <li>Writing bash scripts to automate repetitive tasks</li> <li>Troubleshooting system issues</li> <li>Ensuring uptime and reliability</li> </ul> <p>It's solid work, but I kept asking myself: <em>"How can I do this faster? More effici

DEV Community

5m28 minutes ago

ProductsLive

Multi-Cloud Strategy: When and How to Go Multi-Cloud

<h2> Introduction </h2> <p>Every few months, another major cloud outage makes headlines. AWS us-east-1 goes down, taking half the internet with it. A misconfigured Azure deployment affects thousands of customers. These incidents fuel the multi-cloud narrative: "Don't put all your eggs in one basket."</p> <p>But multi-cloud comes with significant costs—complexity, operational overhead, and often higher expenses. While some organizations genuinely benefit from multi-cloud, many adopt it for the wrong reasons and regret the decision.</p> <p>In this comprehensive guide, we'll explore when multi-cloud makes sense, when it doesn't, and how to implement it successfully if you truly need it.</p> <h2> What is Multi-Cloud? </h2> <h3> Definition </h3> <p>Multi-cloud means using services from multiple

DEV Community

13m26 minutes ago

Knowledge Map

TopicsEntitiesSource

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 188 connections

Scroll to zoom · drag to pan · click to open

Discussion

No comments yet — be the first to share your thoughts!

More in Models

ModelsLive

I Let an AI Agent Run My Freelance Life. It Almost Burned It Down.

<p>For the past few days I kept seeing OpenClaw everywhere. YouTube, Instagram, that one tech Discord I lurk in but never actually talk in. Everyone losing their minds over it.</p> <p><em>"It negotiated $4,200 off a car price." "It runs my entire inbox." "It's the future of computing."</em></p> <p>I had a rough idea what it was, some kind of AI agent. And the intern brain immediately went: if this is basically an automation tool, I can fix my entire chaotic freelance workflow with it.</p> <p>Classic. Give a sleep-deprived software intern a new shiny tool and watch what happens.</p> <p>I'm juggling a software internship by day and freelance client work on the side. My problems aren't glamorous. Client meetings clashing with job interview slots. Cold emails to recruiters I keep meaning to se

DEV Community

7m25 minutes ago

ModelsLive

From Coin Toss to LLM — Understanding Random Variables

<h1> From Coin Toss to LLM — Understanding Random Variables </h1> <p>A beginner friendly guide to probability and random variables — no math background needed.</p> <h2> 1. What is Probability? </h2> <p>Probability is a number that measures how likely something is to happen.</p> <p>This number is always between 0 and 1:</p> <div class="table-wrapper-paragraph"><table> <thead> <tr> <th>Value</th> <th>Meaning</th> </tr> </thead> <tbody> <tr> <td>0</td> <td>Impossible — will never happen</td> </tr> <tr> <td>1</td> <td>Certain — will always happen</td> </tr> <tr> <td>0.5</td> <td>Equal chance — may or may not happen</td> </tr> </tbody> </table></div> <h3> Example </h3> <p>Flip a fair coin. Two outcomes are possible — heads or tails. Neither is more likely than the other.</p> <p>So the probabili

DEV Community

6m25 minutes ago

ModelsLive

OpenAI Now Valued at $852B After New Funding Round

The round solidifies the ChatGPT maker's position as one of the world's most valuable private companies.

AI Business

3mabout 1 hour ago

ModelsLive

5 Things I Believed About SEO That AI Search Proved Completely Wrong

<p>`<br> I have been writing technical content for about three years. In that time I developed a fairly confident set of beliefs about what makes content rank well. Keyword research, internal linking, page speed, backlink acquisition, content length. I followed the playbook. My traffic grew. I assumed I understood what I was doing.</p> <p>Then, about six weeks ago, I started seriously looking at whether my content was being cited by ChatGPT, Perplexity AI, and Google Gemini. Not just vaguely appearing somewhere, but actually being used as a source when someone asked a question my posts directly answer.</p> <p>The results were uncomfortable. And almost everything I had been confident about turned out to be either wrong or only partially true when applied to how AI search actually works.</p>

DEV Community

7m43 minutes ago