Sie befinden Sich nicht im Netzwerk der Universität Paderborn. Der Zugriff auf elektronische Ressourcen ist gegebenenfalls nur via VPN oder Shibboleth (DFN-AAI) möglich. mehr Informationen...
Ergebnis 5 von 441
Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019, p.2645-2653
2019
Volltextzugriff (PDF)

Details

Autor(en) / Beteiligte
Titel
Multi-Agent Reinforcement Learning for Order-dispatching via Order-Vehicle Distribution Matching
Ist Teil von
  • Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019, p.2645-2653
Ort / Verlag
New York, NY, USA: ACM
Erscheinungsjahr
2019
Quelle
ACM Digital Library
Beschreibungen/Notizen
  • Improving the efficiency of dispatching orders to vehicles is a research hotspot in online ride-hailing systems. Most of the existing solutions for order-dispatching are centralized controlling, which require to consider all possible matches between available orders and vehicles. For large-scale ride-sharing platforms, there are thousands of vehicles and orders to be matched at every second which is of very high computational cost. In this paper, we propose a decentralized execution order-dispatching method based on multi-agent reinforcement learning to address the large-scale order-dispatching problem. Different from the previous cooperative multi-agent reinforcement learning algorithms, in our method, all agents work independently with the guidance from an evaluation of the joint policy since there is no need for communication or explicit cooperation between agents. Furthermore, we use KL-divergence optimization at each time step to speed up the learning process and to balance the vehicles (supply) and orders (demand). Experiments on both the explanatory environment and real-world simulator show that the proposed method outperforms the baselines in terms of accumulated driver income (ADI) and Order Response Rate (ORR) in various traffic environments. Besides, with the support of the online platform of Didi Chuxing, we designed a hybrid system to deploy our model.

Weiterführende Literatur

Empfehlungen zum selben Thema automatisch vorgeschlagen von bX