Modern on-demand meal delivery platforms pose significant management challenges due to complex interactions among strategically behaving decentralized stakeholders. Existing approaches struggle to manage these decentralized decisions effectively and l...
Modern on-demand meal delivery platforms pose significant management challenges due to complex interactions among strategically behaving decentralized stakeholders. Existing approaches struggle to manage these decentralized decisions effectively and lead to inefficient resource utilization and operational shortcomings. To overcome these limitations, this dissertation proposes an actionable framework based on multi-agent reinforcement learning (MARL) for adaptive demand and resource management in decentralized operational environments. The dissertation contains two distinct yet complementary studies. The first study focuses on a dynamic order assignment algorithm which anticipate the strategic behavior of heterogeneous couriers. It employs a MARL-inspired interaction model integrated with a reinforcement learning-driven bipartite matching approach to optimize real-time task allocation. Simulation experiments demonstrate substantial improvements in target dispatch time compliance, order backlogs, and workload distribution across couriers. The second study addresses dynamic service area sizing through a cooperative MARL approach using the QMIX algorithm. This method dynamically adapts Customer Service Areas (CSA) and Driver Dispatch Areas (DDA), explicitly considering spatial spillover effects and real-time fluctuations in demand and courier availability. A key methodological innovation is the integration of a flexible constraint-handling mechanism without explicit penalties, supported by a novel best-effort action selection strategy, ensuring robust operational adaptability. Empirical validation through simulation experiments exhibits significant enhancements in real-time responsiveness and overall system efficiency compared to independent optimization approach. This dissertation integrates strategic agent interactions into centralized adaptive management strategies and advances the field by proposing a multi-agent approach to managing complex systems involving stakeholder interactions, thereby enhancing the practical foundations of adaptive resource management in decentralized service environments.