Real-time Driver-Rider Matching Service with Apache Samza

a real time driver rider matching service based n.w
1 / 5
Embed
Share

Explore a cutting-edge real-time driver-rider matching service built on Apache Samza, enhancing ride-sharing experiences. Discover a three-tier architecture, implementation details, and performance testing insights for optimal functionality.

  • Real-time Matching
  • Apache Samza
  • Ride-sharing
  • Architecture
  • Performance Testing

Uploaded on | 0 Views


Download Presentation

Please find below an Image/Link to download the presentation.

The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author. If you encounter any issues during the download, it is possible that the publisher has removed the file from their server.

You are allowed to download the files provided on this website for personal or commercial use, subject to the condition that they are used lawfully. All files are the property of their respective owners.

The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author.

E N D

Presentation Transcript


  1. A Real-time Driver-Rider Matching service based on Apache Samza Group 22 Pan Kong, Zhangfan Dong, and Kaiyue Sun

  2. Background Ride-sharing is a trending topic both in research area and industry Research on characterizing the behavior of carpoolers -overlap in people s commute, upper bound to practical ride-sharing system, Prof. Markopoulou, UCI In pratical, ride-sharing system -web-based, carpooling.com, eRideShare.com -mobile-based, promisingUber, Lyft

  3. Three-tier Architecture Presentation Tier Presentation Tier Throughput for popular databases UI on Browsers UI on Browsers HBase: 2.5K op/sec MySQL: 25K op/sec MongoDB: 10K op/sec Cassandra: 25K op/sec Application Tier Application Tier Data Tier (a single node, read intensive workload) Web Server Web Server User Profiles Matching Engine Problems The driver location update and rider location are transitory data, no need to store in database Database -> bottleneck, on high workload (Exclusive access, lock) e.g 100,000 drivers, centralized database, driver update every 4sec, throughput -> 25K write/sec Matching engine is also a bottleneck Stream Processing Tier Apache Samza Matching Engine User Profiles

  4. Architecture & Implementation Architecture & Implementation SAMZA Push Kafka Msg Matching result Matching Engine Input Topics: - Driver Location - Ride Request Pull Kafka Msg Driver updates Ride requests Query Server HTTP Request Driver updates Ride requests Push Kafka Msg Driver updates Ride requests Output Topics: - Match kafka Client UI HTTPServer REST Proxy HTTP Response Pull Kafka Msg Matching result Matching result Live Demo https://ec2-52-40-203-24.us-west-2.compute.amazonaws.com/

  5. Performance Testing fig.1 Throughput fig.2 Latency

Related


More Related Content