EleutherAI's Thoughts on the EU AI Act

EleutherAI BlogJuly 26, 20231 min read0 views

How we are supporting open source and open science in the EU AI Act.

In June, the European Parliament adopted its negotiating position on the EU AI, a comprehensive piece of legislation aimed at regulating a wide variety of artificial intelligence (AI) research, products, and services. It's expected to be finalized and adopted by the end of the year, bringing widespread changes to the way that AI organizations operate in the European Union. There is a lot in the current draft's regulations on large-scale AI systems that we agree with, such as an emphasis on transparency and documentation and an explicit requirement to assess the suitability of training data. Unfortunately the current text places a substantial burden on non-profit, open source, and community-driven research, drawing no distinction between tech giants like OpenAI and Google, non-profit research groups like EleutherAI and the Allen Institute for Artificial Intelligence, and independent hobbyists who train or finetune models governed by this law.

[Read the full position paper here]

In April we released the Pythia model suite, a set of eight models trained on two different datasets ranging from 70 million to 12 billion parameters. To empower researchers to study how the capabilities of large language models evolve over the course of training we saved and released 154 checkpoints per model, providing a previously unprecedented amount of detail to the picture of how large language models train. The 154 Pythia-12B checkpoints represents more partially trained checkpoints for a single model than the rest of the world has ever released across all other 12 billion parameter or larger language models. Pythia has received widespread acclaim, with over sixty citations in just four months and was accepted for an oral presentation at the International Conference on Machine Learning (ICML) occuring later today. Under the current parliamentary text we would not be able to do a project like this again, as the over 5,000 variations and partially trained model checkpoints each count as their own model and would require the same individualized documentation, testing, and reporting as if we developed over 5,000 distinct commercially deployed models.

The Parliamentary text also includes requirements that are currently impossible for EleutherAI to comply with. For example, it requires reporting energy usage and environmental data about the computing cluster used to train the model - information we do not necessarily have access to since we, like almost everyone who does large scale AI research, do not own the actual GPUs we use to train our models. While we work with our cloud providers to disclose as much as possible about energy usage and environmental impact, some information the EU Parliament texts requires for disclosure is viewed as proprietary by the cloud providers and is not something we have access to.

To address these shortcomings EleutherAI has partnered with Creative Commons, Hugging Face, GitHub, LAION, and Open Future to draft a position paper detailing our perspectives on the parlementary text and recommending how the EU can better achieve its goals by embracing what the open source community has to offer. Our primary recommendations are:

Define AI components clearly,
Clarify that collaborative development of open source AI components and making them available in public repositories does not subject developers to the requirements in the AI Act, building on and improving the Parliament text’s Recitals 12a-c and Article 2(5e),
Support the AI Office’s coordination and inclusive governance with the open source ecosystem, building on the Parliament’s text,
Ensure the R&D exception is practical and effective, by permitting limited testing in real-world conditions, combining aspects of the Council’s approach and an amended version of the Parliament’s Article 2(5d),
Set proportional requirements for “foundation models,” recognizing and distinctly treating different uses and development modalities, including open source approaches, tailoring the Parliament’s Article 28b.

EleutherAI is an unprecedented experiment in doing open, transparent, and public scientific research in artifical intelligence. While we do not believe that all organizations must necessary follow in our footsteps, we believe that it's important that somebody reveals what goes on behind the curtain during the development of these increasingly influential technologies. As such we are committing today to not only comply with the final text to the best of our ability, but also to document and publicly disclose all costs we incur and additional steps we need to take to achieve compliance. As countries around the world look to the EU AI Act when drafting their own regulation, we hope that an honest and open accounting of our ability to comply with the EU AI Act will provide lawmakers essential information about how to design regulatory frameworks that do not put an undue burden on non-profit, open source, and independent researchers.

Original source

EleutherAI Blog

https://blog.eleuther.ai/eu-aia/

Was this article helpful?

Ask AI about this article

Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

open sourceeu ai act

ProductsLive

What Nobody Tells You About Building a Protocol for AI Agents

<p>For the past few months, I've been building ARSIA Protocol as a part-time open source project, an open compliance layer for AI agents, designed to sit above MCP (Anthropic) and A2A (Google).</p> <p>The first ideas came in July 2025. Back then it was just a question: what if there was a compliance layer that sat above agent communication protocols? For months, that's all it was, an idea slowly taking shape. I'd study the regulatory landscape, read the EU AI Act, sketch mental models, scribble notes. Weekend conversations about what the architecture could look like. No code, no repo, no rush. The idea needed time to mature, and I let it.</p> <p>By November I had rough concept drafts. By January the mental model was solid enough to start testing assumptions on paper. But it wasn't until Ma

DEV Community

7m15 minutes ago

ReleasesLive

Axios npm Package Compromised: Supply Chain Attack Delivers Cross-Platform RAT

<p>On March 31, 2026, two malicious versions of <a href="https://snyk.io/advisor/npm-package/axios" rel="noopener noreferrer">axios</a>, the enormously popular JavaScript HTTP client with over 100 million weekly downloads, were briefly published to npm via a compromised maintainer account. The packages contained a hidden dependency that deployed a cross-platform remote access trojan (RAT) to any machine that ran <code>npm install</code> (or equivalent in other package managers like Bun) during a two-hour window.</p> <p>The malicious versions (<code>1.14.1</code> and <code>0.30.4</code>) were removed from npm by 03:29 UTC. But in the window they were live, anyone whose CI/CD pipeline, developer environment, or build system pulled a fresh install could have been compromised without ever touc

DEV Community

11mabout 1 hour ago

ReleasesLive

Accessible web testing with Cypress and wick-a11y

<p>I spent a couple of hours building a custom logging callback for cypress-axe. It formatted violations into a console table and registered Cypress tasks in the config file. It worked. Then I installed wick-a11y and got better output with zero custom code.</p> <p>This is the second article in my accessibility testing series. The first one covered Cypress with cypress-axe, and you can find it here:<br> </p> <div class="ltag__link--embedded"> <div class="crayons-story "> <a href="https://dev.to/cypress/accessible-web-testing-with-cypress-and-axe-core-1af9" class="crayons-story__hidden-navigation-link">Accessible web testing with Cypress and Axe Core</a> <div class="crayons-story__body crayons-story__body-full_post"> <div class="crayons-story__top"> <div class="crayons-story__meta"> <div cla

DEV Community

16mabout 1 hour ago

Knowledge Map

TopicsEntitiesSource

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 115 connections

Scroll to zoom · drag to pan · click to open

Discussion

No comments yet — be the first to share your thoughts!

More in Releases

Releases

Dubai hosts launch of AI tools for university students - Digital Watch Observatory

<a href="https://news.google.com/rss/articles/CBMihwFBVV95cUxNbmUwVjkySnhCNE5KVGtCUTlLc3FqRWhfc3RWUzh3amU2eXNSTERtbzFwZ0lCcmFFSDRUa0d1VlpJN3RVVG15OVhqeHduSVIyYWlZYnhfOTVseUVITlJzeG44SEFBakdKTW1PaTR3VUkyMnhCRW54eUQ5N1pBbWtnVTk1enluTlU?oc=5" target="_blank">Dubai hosts launch of AI tools for university students</a> <font color="#6f6f6f">Digital Watch Observatory</font>

Google News AI UAE

1mabout 2 months ago

ReleasesLive

Bitcoin is closer to its 'buy zone' than it's been in three years

The gap between bitcoin's spot price and realized price is compressing toward levels that historically marked cycle bottoms, but the on-chain data shows the capitulation that typically precedes those bottoms hasn't happened.

CoinDesk AI

1m12 minutes ago

ReleasesLive

Hong Kong hasn’t issued a single HKD stablecoin license after March target

Officials flagged March for initial approvals, but licensing has yet to begin with no updated timeline

CoinDesk AI

1m11 minutes ago

ReleasesLive

Windows might be hiding some of your PC's storage by default - here's how to reclaim it

A feature called Reserved Storage keeps a small amount of storage for updates - here's how to disable it (and if you should).

ZDNet Big Data

1m22 minutes ago