Start Over

Learning to Delay in Ride-Sourcing Systems: A Multi-Agent Deep Reinforcement Learning Framework.

Authors :: Ke, Jintao
Xiao, Feng
Yang, Hai
Ye, Jieping
Source :: IEEE Transactions on Knowledge & Data Engineering. May2022, Vol. 34 Issue 5, p2280-2292. 13p.
Publication Year :: 2022
Abstract: Ride-sourcing services are now reshaping the way people travel by effectively connecting drivers and passengers through mobile internets. Online matching between idle drivers and waiting passengers is one of the most key components in a ride-sourcing system. The average pickup distance or time is an important measurement of system efficiency since it affects both passengers’ waiting time and drivers’ utilization rate. It is naturally expected that a more effective bipartite matching (with smaller average pickup time) can be implemented if the platform accumulates more idle drivers and waiting passengers in the matching pool. A specific passenger request can also benefit from a delayed matching since he/she may be matched with closer idle drivers after waiting for a few seconds. Motivated by the potential benefits of delayed matching, this paper establishes a two-stage framework which incorporates a combinatorial optimization and multi-agent deep reinforcement learning methods. The multi-agent reinforcement learning methods are used to dynamically determine the delayed time for each passenger request (or the time at which each request enters the matching pool), while the combinatorial optimization conducts an optimal bipartite matching between idle drivers and waiting passengers in the matching pool. Four tailored reinforcement learning methods, delayed multi-agent deep Q learning (Delayed-M-DQN), delayed multi-agent actor-critic (Delayed-M-A2C), delayed multi-agent Proximal Policy Optimization (Delayed-M-PPO), and delayed multi-agent actor-critic with experience replay (Delayed-M-ACER), are developed. Through extensive empirical experiments with a well-designed simulator, we show that the proposed framework is able to remarkably improve system performances, by well balancing the trade-off among pick-up time, matching time, successful matching rate. [ABSTRACT FROM AUTHOR]

Subjects :: *REINFORCEMENT learning
*MULTIAGENT systems
*DEEP learning
*COMBINATORIAL optimization

Details

Language :: English
ISSN :: 10414347
Volume :: 34
Issue :: 5
Database :: Academic Search Index
Journal :: IEEE Transactions on Knowledge & Data Engineering
Publication Type :: Academic Journal
Accession number :: 156273253
Full Text :: https://doi.org/10.1109/TKDE.2020.3006084

Full Text Access

View/download PDF

Tools

Email
Cite

Printer

Authors Abstract Subjects Details

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Learning to Delay in Ride-Sourcing Systems: A Multi-Agent Deep Reinforcement Learning Framework.

Abstract

Subjects

Details

Tools

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Learning to Delay in Ride-Sourcing Systems: A Multi-Agent Deep Reinforcement Learning Framework.

Abstract

Subjects

Details

Tools

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources