Efficient GPU-Based Text Analytics without Decompression

g tadoc enabling efficient gpu based text n.w
1 / 18
Embed
Share

Explore the innovative approach of enabling efficient GPU-based text analytics without decompression, addressing challenges and providing solutions. This research combines insights from academia and industry, offering a groundbreaking method for processing compressed data effectively.

  • GPU
  • Text Analytics
  • Decompression
  • Efficiency
  • Analytics

Uploaded on | 0 Views


Download Presentation

Please find below an Image/Link to download the presentation.

The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author. If you encounter any issues during the download, it is possible that the publisher has removed the file from their server.

You are allowed to download the files provided on this website for personal or commercial use, subject to the condition that they are used lawfully. All files are the property of their respective owners.

The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author.

E N D

Presentation Transcript


  1. G-TADOC: Enabling Efficient GPU-Based Text Analytics without Decompression Feng Zhang , Jidong Zhai , Xipeng Shen #, Onur Mutlu , Xiaoyong Du Renmin University of China Tsinghua University #North Carolina State University ETH Zu rich 1

  2. Outline 1. Background 2. Motivation 3. Challenges 4. Our Solution 5. Evaluation 6. Conclusion 2/18

  3. 1. Background TADOC: Text Analytics Directly on Compression Input: Rules: file0 file1 file0: w1 w2 w3 w1 w2 w4 w1 w2 w3 w1 w2 w4 R0 R1 R1 SPT1 R2 w1 R1 R2 w3 R2 w4 R2 w1 w2 R0: R1 R1 SPT1 R2 w1 file1: w1 w2 w1 R1: R2 w3 R2 w4 (b) TADOC compressed data (a) Original data 4 5 5 7 6 0 5 6 2 6 3 6 0 1 w1: 0 w2: 1 w3: 2 w4: 3 R0: 4 R1: 5 R2: 6 SPT1: 7 R2: w1 w2 (c) DAG Representation (d) Numerical representation (e) Compressed data in numerical form [1] Zhang, F., Zhai, J., Shen, X., Mutlu, O., & Chen, W. (2018). Efficient document analytics on compressed data: Method, challenges, algorithms, insights. Proceedings of the VLDB Endowment, 11(11), 1522-1535. 3/18

  4. 1. Background CFG Relation Information Propagation Example: word count <w,i> Word table <w1,6>, <w2,5>, <w3,2>, <w4,2> SPT1 R0: R1 R1 R2 w1 w1: 2 w2: 2 w3: 1 w4: 1 2 + 1 +1 = 6 2 + 1 = 5 2 = 2 2 = 2 Step 3 <w1,2>, <w2,2> <w3,1>, <w4,1> w1: 1 w2: 1 w3: 1 w4: 1 R1: R2 w3 R2 w4 2 = 2 2 = 2 Step 2 Step 1 <w1,1>, <w2,1> w1 w2 R2: 4/18

  5. 2. Motivation GPU popular in data science limited GPU memory different from CPUs Previous GPU-based methods does not apply 5/18

  6. 3. Challenges Challenge 1: GPU parallelism for TADOC Example: R2 depends on R0 and R1, and R1 depends on R0 file0 file1 R0: R1 R1 SPT1 R2 w1 dependencies Edges R1: R2 w3 R2 w4 R2: w1 w2 6/18

  7. 3. Challenges Challenge 2: TADOC final result update conflict of massive GPU threads Example: writing to a global buffer file0 file1 global buffer thread w1 R0: R1 R1 SPT1 R2 w1 w2 thread R1: R2 w3 R2 w4 thread w_i R2: w1 w2 write conflicts 7/18

  8. 3. Challenges Challenge 3: sequence maintenance of TADOC com- pressed data on GPUs Example: cross-rule sequence file0 file1 R0: R1 R1 SPT1 R2 w1 sequence: w1-w2-w3 R1 and R2 R1: R2 w3 R2 w4 R2: w1 w2 8/18

  9. 4. Our Solution modules phases G-TADOC initialization TADOC compressed data data structures data structure preparation light-weight scanning parallel execution engine TADOC program graph traversal top-down traversal / bottom-up traversal sequence support result merging results 9/18

  10. 4. Our Solution Fine-Grained Thread-Level Execution Engine thread2 thread0 thread3 thread1 root R1 R2 w1 spt1 R2 w2 w4 spt2 R3 R1: R2: R3: thread6 w5 R4 R4 w5 w2 R5 thread4 thread5 R5: thread8 R4: thread7 w9 w6 w8 w6 w7 w8 Fine-grained thread-level partitioning design. Top-down / bottom-up traversal design 10/18

  11. 4. Our Solution G-TADOC Data Structures G-TADOC maintained memory pool Thread-safe data structures Head and tail structures for sequence support Locks 0 1 0 0 0 root Entries -1 0 -1 1 -1 w1 w2 w3 R1 w8 w9 w10 Keys 126 163 78 tail head Values 1 1 1 0 0 w4 w5 w6 w7 R1: Next 2 -1 -1 (d) Add key = 78 (suppose hash to 1), and value = 1. 11/18

  12. 4. Our Solution Sequence Support in G-TADOC Phase 1: initialization for head and tail buffers Phase 2: graph traversal with sequence support 12/18

  13. 5. Evaluation Six benchmarks Word Count, Inverted Index, Sequence Count, Ranked Inverted Index, Sort, Term Vector from [1] Five datasets 62 MB ~ 50 GB Four platforms three generations of Nvidia GPUs Pascal, Volta, and Turing micro-architectures Spark cluster (10 nodes on Amazon EC2) [1] Zhang, F., Zhai, J., Shen, X., Mutlu, O., & Chen, W. (2018). Efficient document analytics on compressed data: Method, challenges, algorithms, insights. Proceedings of the VLDB Endowment, 11(11), 1522-1535. 13/18

  14. 5. Evaluation On average, G-TADOC achieves 31.1 speedup over TADOC. wordCount termVector sort sequenceCount invertedIndex rankedInvertedIndex 10000 1000 speedup 100 10 1 A: 580MB B: 2.1GB C: 50GB D: 62MB E: 2.9GB Turing (GeForce RTX 2080 Ti) 14/18

  15. 5. Evaluation Speedups in different phases wordCount termVector sort sequenceCount invertedIndex rankedInvertedIndex 100 1000 speedup speedup 100 10 10 1 1 A B C D E A B C D E (a) Phase 1: initialization. (b) Phase 2: traversal. 15/18

  16. 5. Evaluation Analysis of performance metrics invertedIndex rankedInvertedIndex wordCount termVector sort sequenceCount achieved occupancy 100 80 DRAM throughput 80 60 60 (GB/s) (%) 40 40 20 20 0 0 A B C D E A B C D E (a) DRAM throughput. (b) Achieved occupancy. 16/18

  17. 6. Conclusion G-TADOC, the first framework enabling efficient GPU-based text analytics directly on compressed data Our work can help put much larger content directly in GPU memory. 17/18

  18. Thanks! Any questions? Feng Zhang , Jidong Zhai , Xipeng Shen #, Onur Mutlu , Xiaoyong Du Renmin University of China Tsinghua University #North Carolina State University ETH Zu rich 18/18

Related


More Related Content