No more Cancellations? No more 'Jana Kahan Hai?'?
Uber Driver Ride Acceptance Model

No more Cancellations? No more 'Jana Kahan Hai?'

Good news for travelers! On a personal front majority of my ride-booking used to get canceled after the question of 'Jana Kahan Hai?'.

I am sure my fellow subscribers must have experienced the feeling when the driver calls and asks the dreaded question: 'Jana Kahan Hai' (where do you have to go) and then cancels the trip.

With the latest update now Uber is showing trip destinations to drivers before they decide to accept a ride to enable them to make informed choices. You will likely not have to pay Uber trip cancellation fee and the driver is unlikely to cancel the trip either.

The thing now from change from here, it has been always hard to predict if a driver would cancel a ride or not as the conversation of user-driver used to be on a phone call. Since the driver is now more informed about the pick and drop location of the user and is incentives separately for the distance traveled to pick up the user, the focus now shifts from cancellation to driver accepting a ride.

Let's understand from a Data Science (Machine Learning) perspective what happens behind the scene to match user requests to a driver whose probability to accept the ride is high and get your request fulfilled in the minimum time possible.

Matching

In summary, to find the best cab drivers for you — within a few seconds; these ride-hailing companies (Uber, Lyft, Ola, Rapido, etc.) run a matching algorithm and also check a driver’s ride acceptance probability before pushing a request to them.

In this Newsletter, we shall discuss how we can build a driver ride acceptance probabilistic model?

Objective: Predict if a driver will accept ride request or not and find the probability of acceptance?

This Blog is Sponsored by ProjectPro .

Stop going to multiple online forums to hack together solutions. ProjectPro has ready-made professional project templates for data extraction, data analysis, data visualization, model deployment, and?more.

If you are interested to learn demand forecasting at a ride-hailing platform check out my Data Science course at ProjectPro.

Characteristics / Features Required -

In order to figure out the features required to solve this problem in a ride-hailing business, a data scientist must be well-versed with domain knowledge. Product thinking is always important for a data scientist.

1. Trip

  • Drive to User (Client) Distance (Driver to Pickup Distance)
  • Time of the Day (Morning / Afternoon / Evening / Late Night)
  • Trip duration/distance (Pick up to Drop Distance)

Pickup to Drop Distance

  • Payment method (some drivers prefer Cash Mode more than online payments)
  • Destination of the client (drivers don’t like to go to a destination where they will have to struggle to get their next client)
  • Ride Type — Pool request or Normal request

2. Driver

a) Enroute or Available

  • Whether the driver is available (no rides)
  • If the driver is en route — the trip is about to end or in the middle of it

b) Historic Features

  • In the last one week/month avg. number of rides accepted rate
  • Total trips completed throughout the day - If the driver has achieved his incentive target of the day or not
  • Current days acceptance rate (acceptance rate = Number of requests accepted / Total requests received)

3. Vehicle Type

  • Cab / Auto Rickshaw / Bike (based on vehicle type some drivers don’t accept client pickups in lonely areas - safety concerns)
  • Many Bike / Auto Rickshaw drivers don't prefer ride requests of > 10kms

Auto Ride

4. Rider (Client)

  • Rider Rating (driver doesn’t like to deal with clients below 4 ratings)
  • Rider gender (some drivers don’t accept ride requests based on request time and client gender)
  • Rider Image (interesting experiment someone did by replacing his image with a zombie image and got less driver acceptance; proving the image of the client to the driver also matters)

Interesting read profile image matters on both (driver and rider) side ??: 'Zombie' drivers are scamming people out of cash with horrible profile pictures

5. Traffic

  • Based on request time — drivers sometimes don’t like to accept rides in heavy traffic zones (the duration of the trip to complete or to reach the client - depends on traffic in the region)

6. Special Events (occasional change)

  • Weather — Rainy Day / Sunny
  • Covid restrictions in that Area (Quarantine zone)
  • Festival Day based on region — Holi, Diwali, Christmas, etc.

Rainy Day

Modeling

We now have a rich feature set that can help us predict whether a driver will accept a client’s ride request or not. We use standard statistical machine learning supervised classification algorithms(with spot-checking):

  • Logistic Regression (Linear Model)
  • Decision Tree (Non-Linear Model)
  • Bagging Classifier — Random Forest Classifier (Ensemble Model)
  • Boosting Classifier — LightGBM, XGBoost, etc.

Model Metrics: AUC-ROC, F-beta score (beta = 2; if Recall is twice important as Precision)

Model Inference: Find out the closest <K> candidate drivers to the rider and send trip requests to drivers based on the (Highest to Lowest) probability of their acceptance from the model output.

Conclusion

Conclusion

I hope you understood the business problem and can relate to the features we picked for modeling out the patterns. While there is no silver bullet solution and these problems are way more complex, our aim was to improve the user experience and minimize the user-driver matching time as even a millisecond of change in the driver-user matching algorithm can help save millions of dollars.

According to a paper entitled The Cost of Latency in High-Frequency Trading, a 1-millisecond advantage in latency can be worth upwards of $100 million per year.

Reference: Hindustan Times Article on Uber's Update

I hope you learned something new from this post. If you liked it, hit ??, subscribe to my newsletter, and share this with others. Stay tuned for the next one!

Connect, Follow or Endorse me on?LinkedIn ?if you found this read useful. To learn more about me visit:?Here

The newsletter is now read by more than 4000 subscribers. If you are building an AI or a data product or service, you are invited to become a sponsor of one of the future newsletter issues. Feel free to reach out to?[email protected]?for more details on sponsorships.

Other Recommended Newsletters:

[1]?Experimentation when you can't A/B Test | Beyond A/B Testing - Switchbacks & Synthetic Control Group

[2]?Mastering A/B Testing by understanding Pitfalls

[3]?Data Science in Ride-Hailing at Ola, Uber, Rapido, etc.

Subscribe to get Email Notification:?HERE
Debarchan Basu

School of AI @ IIT Delhi | IISc Bangalore | BITS Pilani

2 年

Honestly, this would not solve the problem at all. Because in this case, the driver would invariably cancel trips to locations from where they won't presumably get more trips. Pehle wo call karke cancel karte the, ab khud hi dekhke cancel karenge.

Amit Sharma

Staff Software engineer, Data Engineer |Tech lead | Machine Learning

2 年

Very Intresting. I am on vacation in india from last week and noticed this happening. I see most cases drivers are transferring to other drivers and this keep going forever. For sure this will improve.

Parth Shetye

Operations - Restaurants // QSR // E-Commerce // Travel // Tourism

2 年

Driver knowing the final destination will impact the user experience and decline % is also bound to jump. One of the reasons I preferred Uber was I don't have to keep on asking 10's of Kali Peeli for a ride.

Shaurya Uppal

Data Scientist | MS CS, Georgia Tech | AI, Python, SQL, GenAI | Inventor of Ads Personalization RecSys Patent | Makro | InMobi (Glance) | 1mg | Fi

2 年

I have a course about demand forecasting in ride-hailing domain on ProjectPro: https://lnkd.in/dkCAktD2

Srikanth Soma

AI @ Databricks | Top Machine Learning Voice

2 年

This is a good read. One more feature to consider is if there is any surge fare. That acts as an incentive for the driver to accept.

要查看或添加评论,请登录

社区洞察

其他会员也浏览了