Our project will simulate a city where many customers continuously order food delivery from a restaurant. Customers will make their orders with certain probability distribution during one day. Input is the number of delivery men, locations of all customers, and time-based distributions of their orders during one day. Our goal is to deliver foods to all customers within shortest cumulative waiting time in one day.
We will use reinforcement learning, Q-learning, and greedy-algorithm to complete our project.
Quantitative: The baseline of our project is to assign each food order to the nearest delivery man and complete all the food delivery orders. We are using several metrics to evaluate our performance:
Qualitative: It will be impressive if there is no another arrangement that would minimize the total waiting time and there is no customers waiting more than an hour.
Oct. 21 10:10AM, 2019