Enhancing Message Throughput via Aggregation in Parallel Programming

improving throughput of fine grained messages n.w
1 / 16
Embed
Share

This research delves into improving the throughput of fine-grained messages by employing aggregation techniques. It focuses on the Topological Routing and Aggregation Module (TRAM) as a solution, illustrating enhancements and future prospects in Charm++ 6.10. Various benchmarks, examples, and features are discussed, shedding light on the optimization of point-to-point communication and message handling.

  • Parallel programming
  • Message throughput
  • Aggregation techniques
  • Charm++ 6.10
  • Communication optimization

Uploaded on | 0 Views


Download Presentation

Please find below an Image/Link to download the presentation.

The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author. If you encounter any issues during the download, it is possible that the publisher has removed the file from their server.

You are allowed to download the files provided on this website for personal or commercial use, subject to the condition that they are used lawfully. All files are the property of their respective owners.

The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author.

E N D

Presentation Transcript


  1. Improving Throughput of Fine-grained Messages with Aggregation Venkatasubrahmanian Narayanan Parallel Programming Laboratory University of Illinois Urbana-Champaign

  2. Agenda Motivation TRAM s interface What s new in Charm++ 6.10 Future work

  3. Whats the problem? Point-to-point messages are general-purpose Need to be independent of each other Unnecessary per-message cost

  4. Unoptimized

  5. The existing solution Topological Routing and Aggregation Module(TRAM) Combine messages headed in the same direction Easy drop-in solution

  6. Whats the difference? The default approach: vs TRAM:

  7. Benchmarks(old)

  8. Example Drop-in replacement *

  9. Aggregation policy illustrated Flush when buffer fills up Flush on timeout

  10. Routing Virtual grid topology

  11. Feature limitations(prior to release) Support only for 1D chare arrays and groups Messages must be fixed size

  12. New feature updates Charm++ 6.10 adds support for multidimensional chare arrays Future release: Variable sized messages Tunable parameters

  13. Future aggregation policy Flush on reaching threshold Flush on timeout (same as earlier) Send messages larger than cutoff directly

  14. Use cases? Generalized stencil computation Graph algorithms Any communication-heavy application

  15. Future work Improve current implementation Support for more computational topologies

  16. Questions?

More Related Content