Adaptive SFC Management and Orchestration Based on DRL in Edge Intelligence for Computation Efficiency
Seyha Ros, Taikuong Iv, Intae Ryoo, Seokhoon KimNetwork functions virtualization (NFV) is an emerging technology that enables flexible service deployment for supporting the Beyond 5G/6G network. NFV transforms physical network devices into virtual network functions (VNF) over Edge Computing capabilities, thereby facilitating the agility of network services and reducing management costs. To effectively monitor Internet of Things (IoT) network resources, service function chaining (SFC) is used for its virtualizations to ensure the multi-service requirements are sufficiently in capability, scalability, and flexibility for computation workloads alignments. However, to satisfy the resource availability requirements and efficiency under several conditions, SFC reconfiguration methods face the challenges in meeting significant latency requirement of delay-sensitive applications while reaching the importance of energy saving on orchestration timespan. In this paper, we propose task management-aware SFC and orchestrating schemes, namely GNN-PPO. In this framework, we utilize the Graph Neural Network (GNN), which relies on the message-passing neural network (MPNN), to capture all the abstraction of physical resource nodes and link capabilities over MEC node states. In particularly, GNN is divided construction into two phrases: (1) GNN represents nodes for all the Mobile edge computing (MEC) nodes, which have a global view on resources of computation and communicational capabilities that could serve as carriers; (2) VNFs are transferred into graph networks by using feature-extraction MPNN to manage each VIM that seeks an optimal and reliable analysis of traffic fluctuations. Lastly, Deep Reinforcement Learning (DRL) is used to embrace the network determination in policy strategy, which utilizes a Proximal Policy Gradient (PPO). On the other hand, we propose a novel network architecture based on PPO to perform the design for the optimization of resource utilization and facilitate energy consumption on MEC servers under diverse setting scenarios, which enables continuous policy enforcement for our system. With the experimental results, we compare our proposed solution with reference schemes in terms of rewards with learning rate and batch size, average request acceptance, SFC success, packet delivery, throughput, and resource utilization ratio that confirm the scheme’s scalability and practical suitability for IoT network deployment.