Research Papers research paper arxiv ai artificial-intelligence

Multi-Dimensional Autoscaling of Stream Processing Services on Edge Devices

arXivby [Submitted on 8 Oct 2025 (v1), last revised 27 Mar 2026 (this version, v2)]March 30, 20262 min read1 views

arXiv:2510.06882v2 Announce Type: replace-cross Abstract: Edge devices have limited resources, which inevitably leads to situations where stream processing services cannot satisfy their needs. While existing autoscaling mechanisms focus entirely on resource scaling, Edge devices require alternative ways to sustain the Service Level Objectives (SLOs) of competing services. To address these issues, we introduce a Multi-dimensional Autoscaling Platform (MUDAP) that supports fine-grained vertical scaling across both service- and resource-level dimensions. MUDAP supports service-specific scaling ta — Boris Sedlak, Philipp Raith, Andrea Morichetta, V\'ictor Casamayor Pujol, Schahram Dustdar

View PDF HTML (experimental)

Abstract:Edge devices have limited resources, which inevitably leads to situations where stream processing services cannot satisfy their needs. While existing autoscaling mechanisms focus entirely on resource scaling, Edge devices require alternative ways to sustain the Service Level Objectives (SLOs) of competing services. To address these issues, we introduce a Multi-dimensional Autoscaling Platform (MUDAP) that supports fine-grained vertical scaling across both service- and resource-level dimensions. MUDAP supports service-specific scaling tailored to available parameters, e.g., scale data quality or model size for a particular service. To optimize the execution across services, we present a scaling agent based on Regression Analysis of Structural Knowledge (RASK). The RASK agent efficiently explores the solution space and learns a continuous regression model of the processing environment for inferring optimal scaling actions. We compared our approach with two autoscalers, the Kubernetes VPA and a reinforcement learning agent, for scaling up to 9 services on a single Edge device. Our results showed that RASK can infer an accurate regression model in merely 20 iterations (i.e., observe 200s of processing). By increasingly adding elasticity dimensions, RASK sustained the highest request load with 28% less SLO violations, compared to baselines.

Subjects:

Distributed, Parallel, and Cluster Computing (cs.DC); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Performance (cs.PF)

Cite as: arXiv:2510.06882 [cs.DC]

(or arXiv:2510.06882v2 [cs.DC] for this version)

https://doi.org/10.48550/arXiv.2510.06882

arXiv-issued DOI via DataCite

Submission history

From: Boris Sedlak [view email] [v1] Wed, 8 Oct 2025 10:51:50 UTC (6,755 KB) [v2] Fri, 27 Mar 2026 15:35:04 UTC (6,732 KB)

Original source

arXiv

https://arxiv.org/abs/2510.06882

Was this article helpful?

Ask AI about this article

Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

researchpaperarxiv

ModelsLive

Positional Restructuring of System Prompts: Mitigating Transformer Attention Bias in Sub-Frontier Models

I built a sovereign AI system on a Mac Mini that kept forgetting facts written in its own system prompt. Instead of upgrading hardware, I figured out why — and found some things I was not expecting. The obvious part: moving critical facts from the middle to the beginning and end of the system prompt fixes recall (2.0 to 7.0 on a verification battery). This builds on Liu et al.'s lost-in-the-middle work. The less obvious part: a model with 83.4% IFBench scored 3.4/10 on fact recall while a model with 23.9% IFBench scored 7.5/10 after restructuring. Instruction-following and fact recall appear to be independent capabilities. I have not seen this documented elsewhere. The paper also covers a behavioral rule methodology that took a 32B model from 6.2 to 9.4 across seven dimensions with cold re

discuss.huggingface.co

1mabout 1 hour ago

ModelsRecent

Anthropic to all AI companies: Our research tells that all LLMs sometimes act like they have emotion, so - The Times of India

Anthropic to all AI companies: Our research tells that all LLMs sometimes act like they have emotion, so The Times of India

Google News: Claude

1mabout 22 hours ago

ProductsLive

"Be Anything You Want" — OK, Here's How (Technically)

This is a submission for the DEV April Fools Challenge What I Built "I Want To Be..." is a life advice generator that takes your dreams and fulfills them — literally. Want to be rich? Change your name to Richard. Want to be a ninja? Wear all black and move slightly too quietly. People will get the idea. Want to be a cat? Knock something off a table and maintain eye contact. Cat energy. It's a genie who passed the bar exam for loopholes. You asked, we delivered. Technically. 44 categories of deadpan, literally-correct life hacks — from "astronaut" to "wizard" to "left alone" — plus 24 universal fallback answers for the truly original dreamers. Every answer is technically true. None of them are helpful. All of them are stamped 100% LEGIT ADVICE . Demo Try it live on GitHub Pages Type in your

DEV Community

4mabout 2 hours ago

Knowledge Map

TopicsEntitiesSource

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 123 connections

Scroll to zoom · drag to pan · click to open

Discussion

No comments yet — be the first to share your thoughts!

More in Research Papers

Research PapersLive

ARTIFICIAL INTELLIGENCE KEYNOTE SPEAKER FOR CORPORATE EVENTS & AI CONFERENCES - futuristsspeakers.com

ARTIFICIAL INTELLIGENCE KEYNOTE SPEAKER FOR CORPORATE EVENTS & AI CONFERENCES futuristsspeakers.com

Google News: AI

1mabout 2 hours ago

Research PapersRecent

This Wi-Fi receiver can work inside a nuclear reactor, keeping robots connected

The research, presented at the IEEE International Solid-State Circuits Conference in San Francisco earlier this year, shows the receiver can continue operating after exposure to 500 kilograys of radiation. That level of endurance far exceeds what even space-grade electronics are designed to handle. Read Entire Article

TechSpot

1mabout 18 hours ago

Research PapersRecent

AI Music & Creators Conference - Bennett College

AI Music & Creators Conference Bennett College

Google News: AI

1mabout 18 hours ago

Research PapersRecent

Can space solve AI's crisis? Oracle cuts 30,000 workers while half of Earth projects remain stuck - Cryptopolitan

Can space solve AI's crisis? Oracle cuts 30,000 workers while half of Earth projects remain stuck Cryptopolitan

GNews AI USA

1mabout 13 hours ago