Live
Black Hat USADark ReadingBlack Hat AsiaAI BusinessAI is making crypto's security problem even worse, Ledger CTO warnsCoinDesk AIIntroducing llmlite: The First Unified LLM Provider Library for the @ZigLang EcosystemDEV CommunityOpenAI Is Bleeding Cash. Its Solution? Military Contracts. - jacobin.comGoogle News: OpenAI‘Cognitive Surrender’ Is a New and Useful Term for How AI Melts BrainsGizmodo‘Cognitive Surrender’ Is a New and Useful Term for How AI Melts Brains - GizmodoGoogle News: AIHow I Built a Production Observability Stack — And Broke It Twice Before It WorkedDEV CommunityMyth of the AI Oracle - LawfareGoogle News: AIVibe Coding Is Dead. Orchestration Is What Comes Next.DEV CommunityUnlocking the Depths of Acting: A Journey Through MethodologiesDEV CommunityMastercard and Google Are Building the Trust Layer for AI That Spends MoneyDEV CommunityHow I Built an Islamic Storytelling App with AI, Audio Narration & 8 LanguagesDEV CommunityAWS CDK Deployment Best PracticesDEV CommunityBlack Hat USADark ReadingBlack Hat AsiaAI BusinessAI is making crypto's security problem even worse, Ledger CTO warnsCoinDesk AIIntroducing llmlite: The First Unified LLM Provider Library for the @ZigLang EcosystemDEV CommunityOpenAI Is Bleeding Cash. Its Solution? Military Contracts. - jacobin.comGoogle News: OpenAI‘Cognitive Surrender’ Is a New and Useful Term for How AI Melts BrainsGizmodo‘Cognitive Surrender’ Is a New and Useful Term for How AI Melts Brains - GizmodoGoogle News: AIHow I Built a Production Observability Stack — And Broke It Twice Before It WorkedDEV CommunityMyth of the AI Oracle - LawfareGoogle News: AIVibe Coding Is Dead. Orchestration Is What Comes Next.DEV CommunityUnlocking the Depths of Acting: A Journey Through MethodologiesDEV CommunityMastercard and Google Are Building the Trust Layer for AI That Spends MoneyDEV CommunityHow I Built an Islamic Storytelling App with AI, Audio Narration & 8 LanguagesDEV CommunityAWS CDK Deployment Best PracticesDEV Community
AI NEWS HUBbyEIGENVECTOREigenvector

Multi-Dimensional Autoscaling of Stream Processing Services on Edge Devices

arXivby [Submitted on 8 Oct 2025 (v1), last revised 27 Mar 2026 (this version, v2)]March 30, 20262 min read1 views
Source Quiz

arXiv:2510.06882v2 Announce Type: replace-cross Abstract: Edge devices have limited resources, which inevitably leads to situations where stream processing services cannot satisfy their needs. While existing autoscaling mechanisms focus entirely on resource scaling, Edge devices require alternative ways to sustain the Service Level Objectives (SLOs) of competing services. To address these issues, we introduce a Multi-dimensional Autoscaling Platform (MUDAP) that supports fine-grained vertical scaling across both service- and resource-level dimensions. MUDAP supports service-specific scaling ta — Boris Sedlak, Philipp Raith, Andrea Morichetta, V\'ictor Casamayor Pujol, Schahram Dustdar

View PDF HTML (experimental)

Abstract:Edge devices have limited resources, which inevitably leads to situations where stream processing services cannot satisfy their needs. While existing autoscaling mechanisms focus entirely on resource scaling, Edge devices require alternative ways to sustain the Service Level Objectives (SLOs) of competing services. To address these issues, we introduce a Multi-dimensional Autoscaling Platform (MUDAP) that supports fine-grained vertical scaling across both service- and resource-level dimensions. MUDAP supports service-specific scaling tailored to available parameters, e.g., scale data quality or model size for a particular service. To optimize the execution across services, we present a scaling agent based on Regression Analysis of Structural Knowledge (RASK). The RASK agent efficiently explores the solution space and learns a continuous regression model of the processing environment for inferring optimal scaling actions. We compared our approach with two autoscalers, the Kubernetes VPA and a reinforcement learning agent, for scaling up to 9 services on a single Edge device. Our results showed that RASK can infer an accurate regression model in merely 20 iterations (i.e., observe 200s of processing). By increasingly adding elasticity dimensions, RASK sustained the highest request load with 28% less SLO violations, compared to baselines.

Subjects:

Distributed, Parallel, and Cluster Computing (cs.DC); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Performance (cs.PF)

Cite as: arXiv:2510.06882 [cs.DC]

(or arXiv:2510.06882v2 [cs.DC] for this version)

https://doi.org/10.48550/arXiv.2510.06882

arXiv-issued DOI via DataCite

Submission history

From: Boris Sedlak [view email] [v1] Wed, 8 Oct 2025 10:51:50 UTC (6,755 KB) [v2] Fri, 27 Mar 2026 15:35:04 UTC (6,732 KB)

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by Eigenvector · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

researchpaperarxiv

Knowledge Map

Knowledge Map
TopicsEntitiesSource
Multi-Dimen…researchpaperarxivaiartificial-…arXiv

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 123 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!