Multi Mode Collaborative Internet of Vehicles Edge Computing Task Unloading Based On Deep Rein-forcement Learning
Affiliation:

Civil Aviation Flight University of China

Fund Project:

The Central University Basic Research Business Fee Fund Project (J2023-027); Open Fund of Key Laboratory of Flight Techniques and Flight Safety; CAAC(No. FZ2022KF06) and China postdoctoral science foundation( No. 2022M722248).

  • Article
  • | |
  • Metrics
  • |
  • Reference
  • |
  • Related
  • |
  • Cited by
  • | |
  • Comments
    Abstract:

    With the rapid development of Internet of Vehicles technology, edge computing plays an increasingly important role in information processing applications of Internet of Vehicles. However, the computing resources and storage capa-bilities of vehicle equipment in the Internet of Vehicles are limited, and vehicles need to process large amounts of data and complex computing tasks. How to efficiently utilize edge computing resources is an important issue. A single offloading method may not be able to meet the needs of the scenario. Consider the collaboration of multiple offload-ing methods to improve the overall performance and flexibility of the system. In response to the above challenges, a multi-mode collaborative vehicle network edge computing task offloading solution based on deep reinforcement learning is proposed. By using Vehicle to Vehicle (V2V), Vehicle to Vehicle Alliance (V2A), Vehicle to Roadside Unit (V2R) and Roadside Unit to Base (Roadside Unit to Base) Station, R2B) and other communication methods realize collaborative processing of tasks with high computing density and high latency requirements. In order to cope with the complexity of the dynamic network environment, the Markov Decision Process (MDP) is used for problem modeling and optimization, and deep reinforcement learning is introduced to handle continuous actions and state space. In particular, the MCTO algorithm is proposed, which can efficiently adapt to the dynamic environment of the Internet of Vehicles and significantly reduce the delay of the entire Internet of Vehicles system. The simulation results show that the proposed MCTO algorithm has good convergence and is significantly improved in terms of system delay compared with other reinforcement learning algorithms, with the overall performance improved by 28.67%.

    Reference
    Related
    Cited by
Get Citation
Share
Article Metrics
  • Abstract:
  • PDF:
  • HTML:
  • Cited by:
History
  • Received:May 23,2024
  • Revised:January 12,2025
  • Adopted:February 17,2025
Article QR Code

Address:No. 219, Ningliu Road, Nanjing, Jiangsu Province

Postcode:210044

Phone:025-58731025