Products model foundation model announce open-source application analysis

Machine Learning in the Wild: Early Evidence of Non-Compliant ML-Automation in Open-Source Software

arXiv cs.SEby Zohaib Arshid, Daniele Bifolco, Fiorella Zampetti, Massimiliano Di PentaApril 1, 20261 min read0 views

arXiv:2603.29698v1 Announce Type: new Abstract: The increasing availability of Machine Learning (ML) models, particularly foundation models, enables their use across a range of downstream applications, from scenarios with missing data to safety-critical contexts. This, in principle, may contravene not only the models' terms of use, but also governmental principles and regulations. This paper presents a preliminary investigation into the use of ML models by 173 open-source projects on GitHub, spanning 16 application domains. We evaluate whether models are used to make decisions, the scope of these decisions, and whether any post-processing measures are taken to reduce the risks inherent in fully autonomous systems. Lastly, we investigate the models' compliance with established terms of use.

View PDF HTML (experimental)

Abstract:The increasing availability of Machine Learning (ML) models, particularly foundation models, enables their use across a range of downstream applications, from scenarios with missing data to safety-critical contexts. This, in principle, may contravene not only the models' terms of use, but also governmental principles and regulations. This paper presents a preliminary investigation into the use of ML models by 173 open-source projects on GitHub, spanning 16 application domains. We evaluate whether models are used to make decisions, the scope of these decisions, and whether any post-processing measures are taken to reduce the risks inherent in fully autonomous systems. Lastly, we investigate the models' compliance with established terms of use. This study lays the groundwork for defining guidelines for developers and creating analysis tools that automatically identify potential regulatory violations in the use of ML models in software systems.

Subjects:

Software Engineering (cs.SE)

Cite as: arXiv:2603.29698 [cs.SE]

(or arXiv:2603.29698v1 [cs.SE] for this version)

https://doi.org/10.48550/arXiv.2603.29698

arXiv-issued DOI via DataCite (pending registration)

Journal reference: 34th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering July 05--09, 2026 Montreal, QC, Canada

Related DOI:

https://doi.org/10.1145/3803437.3805572

DOI(s) linking to related resources

Submission history

From: Daniele Bifolco [view email] [v1] Tue, 31 Mar 2026 12:53:01 UTC (91 KB)

Original source

arXiv cs.SE

https://arxiv.org/abs/2603.29698

Was this article helpful?

Ask AI about this article

Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

modelfoundation modelannounce

Laws & RegulationLive

AI company insiders can bias models for election interference

tl;dr it is currently possible for a captured AI company to deploy a frontier AI model that later becomes politically disinformative and persuasive enough to distort electoral outcomes. With gratitude to Anders Cairns Woodruff for productive discussion and feedback. LLMs are able to be highly persuasive, especially when engaged in conversational contexts . An AI "swarm" or other disinformation techniques scaled massively by AI assistance are potential threats to democracy because they could distort electoral results. AI massively increases the capacity for actors with malicious incentives to influence politics and governments in ways that are hard to prevent, such as AI-enabled coups . Mundane use and integration of AI also has been suggested to pose risks to democracy. A political persuas

LessWrong AI

17m35 minutes ago

ProductsLive

I open sourced a production MLOps pipeline. Here is what it took to get it to PyPI and Hugging Face in one day.

<p>I have been running ML pipelines in production for few years. Tens of millions of predictions a day, real money on the line, no tolerance for guesswork.</p> <p>PulseFlow started as something I built for myself. A reference architecture I kept recreating from scratch at every company because nothing open source matched what production actually demands.</p> <p>Today I packaged it, published it to PyPI, and put a live demo on Hugging Face. Here is what it covers and how to run it in under ten minutes.</p> <h2> What PulseFlow is </h2> <p>A production-grade MLOps pipeline you can clone and run immediately. Not a tutorial. Not a toy dataset. A real stack.<br> </p> <div class="highlight js-code-highlight"> <pre class="highlight shell"><code>pip <span class="nb">install </span>pulseflow-mlops <

DEV Community

5m44 minutes ago

ModelsLive

Building a Real-Time Dota 2 Draft Prediction System with Machine Learning

<p>I built an AI system that watches live Dota 2 pro matches and predicts which team will win based purely on the draft. Here's how it works under the hood.</p> <p><strong>The Problem</strong><br> Dota 2 has 127 heroes. A Captain's Mode draft produces roughly 10^15 possible combinations. Analysts spend years building intuition about which drafts work — I wanted to see if a model could learn those patterns from data.</p> <p><strong>Architecture</strong></p> <p><em>Live Match → Draft Detection → Feature Engineering → XGBoost + DraftNet → Prediction + SHAP Explanation</em></p> <p>The system runs 24/7 on Railway (Python/FastAPI). When a professional draft completes, it detects the picks within seconds, runs them through two models in parallel, and publishes the prediction to a Telegram channel

DEV Community

5m37 minutes ago

Knowledge Map

TopicsEntitiesSource

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 202 connections

Scroll to zoom · drag to pan · click to open

Discussion

No comments yet — be the first to share your thoughts!

More in Products

ProductsLive

I open sourced a production MLOps pipeline. Here is what it took to get it to PyPI and Hugging Face in one day.

DEV Community

5m44 minutes ago

ProductsLive

🚀 Build a Full-Stack Python Web App (No JS Framework Needed)

<p>Most developers assume you <em>need</em> React, Next.js, or Vue for modern web apps.</p> <p>But what if you could build a full-stack app using <strong>just Python</strong>?</p> <p>In this post, I’ll show you how to build a real web app using Reflex — a framework that lets you create frontend + backend entirely in Python.</p> <h2> 🧠 What You’ll Build </h2> <p>We’ll create a simple <strong>Task Manager App</strong> with:</p> <ul> <li>Add tasks</li> <li>Delete tasks</li> <li>Reactive UI (auto updates)</li> <li>Clean component-based structure</li> </ul> <h2> ⚙️ Setup </h2> <p>First, install Reflex:<br> </p> <div class="highlight js-code-highlight"> <pre class="highlight shell"><code>pip <span class="nb">install </span>reflex </code></pre> </div> <p>Create a new project:<br> </p> <div class

DEV Community

3m41 minutes ago

ProductsLive

MiniScript Weekly News — Apr 1, 2026

<h2> Development Updates </h2> <p>Work on <strong>MiniScript 2</strong> continues to pick up speed, and the team shared that a working <strong>REPL</strong> is now in place in both C# and C++. The latest dev log also mentions a refactor to better preserve globals across REPL entries, plus a fix for multi-function REPL handling and Ctrl-D to exit.<br><br> GitHub: <a href="https://github.com/JoeStrout/miniscript2" rel="noopener noreferrer">miniscript2</a></p> <p>On the <strong>raylib-miniscript</strong> side, there were a few useful updates landed this week: <code>resourceCounts</code> now reports loaded resources, <code>FileHandle</code> was added, and the text mutation intrinsics were refreshed with new <code>...Alloc</code> variants. These changes should help with debugging leaks and keep

DEV Community

4m35 minutes ago

ProductsLive

Programming Logic: The First Step to Mastering Any Language

<p>Categories: Beginner - Backend - Dictionary</p> <h3> Definition </h3> <p><strong>Programming Logic</strong> is the coherent organization of instructions that allows a computer to execute tasks in a sequential and logical manner. For a beginner, it can be understood as the development of a "step-by-step" process (<strong>algorithm</strong>) to solve a challenge, serving as the essential foundation before learning any specific programming language.</p> <h3> Use Cases </h3> <ul> <li>Creating decision flows in <strong>e-commerce</strong> systems.</li> <li>Automating manual and repetitive processes.</li> <li>Developing search engines and data filters.</li> <li>Structuring business rules in financial applications.</li> </ul> <h3> Practical Example </h3> <div class="highlight js-code-highlight

DEV Community

2m25 minutes ago