Impact of Science DMZ and Commodity Internet Fusion

Impact of Science DMZ and Commodity Internet Fusion
Slide Note
Embed
Share

When the Science DMZ meets the Commodity Internet, a fascinating fusion occurs, creating a high-performance environment with fast data transfer rates. By combining unlimited storage, frictionless connectivity, and open-source tools, researchers experience significant spikes in data traffic, enhancing their capabilities. This integration leads to optimized data utilization and seamless transition between peering points, revolutionizing research and exploration domains.

  • Science DMZ
  • Commodity Internet
  • High-performance Computing
  • Data Transfer
  • Research

Uploaded on Feb 21, 2025 | 0 Views


Download Presentation

Please find below an Image/Link to download the presentation.

The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author.If you encounter any issues during the download, it is possible that the publisher has removed the file from their server.

You are allowed to download the files provided on this website for personal or commercial use, subject to the condition that they are used lawfully. All files are the property of their respective owners.

The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author.

E N D

Presentation Transcript


  1. What happens when the Science DMZ meets the Commodity Internet? Presenter: Joe Breen Work done: Brian Haymore, Sam Liston University of Utah Center for High Performance Computing

  2. Boil and Bubble, Toil and Trouble, What do you get with a Science DMZ, That's mixed up with unlimited storage Fast and free? Image credit: http://www.zastavki.com/eng/Holidays/Halloween/wallpaper-24660.htm

  3. Start with a researcher

  4. Offer him candy (unlimited storage)

  5. Tell her its free

  6. Mix in a "frictionless" Science DMZ environment with 40G and 10G Data Transfer Nodes Image: https://fasterdata.es.net/science-dmz/science-dmz-architecture/

  7. Mix in a well built commodity cloud service that can consume lots of data quickly

  8. Throw in an open source parallel tool that knows how to efficiently utilize a cloud provider s API Google Drive Amazon S3 Openstack Swift / Rackspace cloud files / Memset Memstore Dropbox Google Cloud Storage Amazon Cloud Drive The local filesystem http://rclone.org/

  9. What do you get?

  10. 14+Gig spikes, 5-8Gb/s of sustained traffic

  11. One 10G commodity pipe fills completely, traffic rolls to next available close peering point, and fills it too

  12. R&E routes yanked temporarily by Cloud vendor NOC to allow service to other commodity users and to better understand nature of congestion

  13. A call from a cloud provider NOC early in the morning asking to stop. (at least for a bit)

  14. Almost 100TB of data moved in 2.5 days

  15. Very happy researchers who want more

  16. What's an HPC center to do? Start mixing more...

  17. Multiple vendors now offering Apps/unlimited storage targeted at EDUcation *individual* individual* users.

  18. Same vendors offer multi-tier Cloud storage for purchase for archival storage, mid-level storage, highly available storage, specific application storage

  19. Today's HPC researcher use cases for large personal cloud storage Using as another storage bin for keeping points in time source code, input files, etc. Using instead of a USB drive - a temporary location for: Saving Snippets of code Looking at single files Moving data from a national resource that requires cleaning Sharing data to a distributed international audience

  20. Potential other use cases for personal cloud storage or organization cloud storage Use as large archive for different data sets for individual researcher Use as archive for a research group or a collection of collaborating groups "service account" with Linux extended ACLs to tar up and backup multiple collaborating research groups using the same file systems allows a simple formula for groups to replicate data in another location -- flexibility for users at their level and convenience Back off info from scratch drives - shuffle back and forth from Cloud provider and from HPC scratch Accommodate over DMZ DTN box ... The list continues

  21. Some of these vendors have Research & Education peering directly, some of them only maintain Commodity peering

  22. Commodity peering not designed for bursty research or long term large data set file transfers Large research data set transfer Normal Commodity traffic Commodity peering points serve lots of companies and businesses

  23. Do we need additional tools in our ecosystem? How do we protect the community, the vendors, and the collaborative network environments, AND, encourage the innovation?

  24. Should we more aggressively leverage the emerging capabilities of the national Research and Education Software Defined Networking backbone? Match = Action Match flow (source IP, source port, destination IP, destination port ) => Action apply QoS or block per flow

  25. Data Transfer Nodes are generally multi-tenant, serving the transfer needs of several groups simultaneously. Image: https://fasterdata.es.net/science-dmz/science-dmz-architecture/ How might we work with one tenant AND continue to service other tenants well without null routing whole networks or even individual DTNs?

  26. Does the community need a more precision tool? OR Image: http://onsurg.com/instrument-handling-10-blade/

  27. Summary Science DMZs are frictionless, Commodity connections to Cloud vendors are not Cloud vendors are coming up with new business plans that researchers like to use creatively Do emerging technologies allow us to create new tools that might enable the innovation and protect the ecosystem?

Related


More Related Content