Sie befinden Sich nicht im Netzwerk der Universität Paderborn. Der Zugriff auf elektronische Ressourcen ist gegebenenfalls nur via VPN oder Shibboleth (DFN-AAI) möglich. mehr Informationen...
Ergebnis 17 von 226

Details

Autor(en) / Beteiligte
Titel
QoS-aware edge AI placement and scheduling with multiple implementations in FaaS-based edge computing
Ist Teil von
  • Future generation computer systems, 2024-08, Vol.157 (C), p.250-263
Ort / Verlag
Netherlands: Elsevier B.V
Erscheinungsjahr
2024
Link zum Volltext
Quelle
Alma/SFX Local Collection
Beschreibungen/Notizen
  • Resource constraints on the computing continuum require that we make smart decisions for serving AI-based services at the network edge. AI-based services typically have multiple implementations (e.g., image classification implementations include SqueezeNet, DenseNet, and others) with varying trade-offs (e.g., latency and accuracy). The question then is how should AI-based services be placed across Function-as-a-Service (FaaS) based edge computing systems in order to maximize total Quality-of-Service (QoS). To address this question, we propose a problem that jointly aims to solve (i) edge AI service placement and (ii) request scheduling. These are done across two time-scales (one for placement and one for scheduling). We first cast the problem as an integer linear program. We then decompose the problem into separate placement and scheduling subproblems and prove that both are NP-hard. We then propose a novel placement algorithm that places services while considering device-to-device communication across edge clouds to offload requests to one another. Our results show that the proposed placement algorithm is able to outperform a state-of-the-art placement algorithm for AI-based services, and other baseline heuristics, with regard to maximizing total QoS. Additionally, we present a federated learning-based framework, FLIES, to predict the future incoming service requests and their QoS requirements. Our results also show that our FLIES algorithm is able to outperform a standard decentralized learning baseline for predicting incoming requests and show comparable predictive performance when compared to centralized training. •We define a QoS-aware placement/scheduling problem for edge AI as an ILP.•We consider two time scales for placement/scheduling while predicting requests.•We prove the problem is NP-hard for both placement and scheduling.•We present an FL-based technique for predicting incoming edge AI requests.•We propose novel EI placement algorithm that outperforms state-of-the-art baseline.•We validate efficiency of placement/scheduling algorithms with real-world trace data.•Results confirm efficacy of FL approach against decentralized/centralized learning.
Sprache
Englisch
Identifikatoren
ISSN: 0167-739X
eISSN: 1872-7115
DOI: 10.1016/j.future.2024.03.035
Titel-ID: cdi_osti_scitechconnect_2335417

Weiterführende Literatur

Empfehlungen zum selben Thema automatisch vorgeschlagen von bX