Overview of New Experimental Coding Tools by Sarah Parker

Overview of New Experimental Coding Tools by Sarah Parker
Slide Note
Embed
Share

"Explore the latest coding tools in the realm of AV1 codec technology. From block partitions to transform schemes, this overview delves into the advancements shaping the next generation of coding techniques. Discover the flexible block partition schemes, extended transform partitioning, and data-driven transforms that are revolutionizing the industry."

  • Coding Tools
  • AV1 Codec
  • Transform Schemes
  • Experimental Technology
  • Block Partitions

Uploaded on Mar 16, 2025 | 0 Views


Download Presentation

Please find below an Image/Link to download the presentation.

The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author.If you encounter any issues during the download, it is possible that the publisher has removed the file from their server.

You are allowed to download the files provided on this website for personal or commercial use, subject to the condition that they are used lawfully. All files are the property of their respective owners.

The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author.

E N D

Presentation Transcript


  1. An Overview of New Experimental Coding Tools Sarah Parker Google

  2. Overview Background Technical Overview Block Partitions Transform Coding Tools Intra Coding Tools Inter Coding Tools In-Loop Restoration Tools Conclusion

  3. Overview Background Technical Overview Block Partitions Transform Coding Tools Intra Coding Tools Inter Coding Tools In-Loop Restoration Tools Conclusion

  4. Background AV1 released in 2018 31% BDRATE reduction over VP9 Accounts for over 80% of the desktop market YouTube default on compatible browsers up to 720p AOMedia continues to explore new coding tools for the next generation codec

  5. Overview Background Technical Overview Block Partitions Transform Coding Tools Intra Coding Tools Inter Coding Tools In-Loop Restoration Tools Conclusion

  6. Highly Flexible Block Partitions Current Scheme: 128x128 R R R R

  7. Highly Flexible Block Partitions Proposed Scheme: R R R R R 128x128 R R R R R

  8. Overview Background Technical Overview Block Partitions Transform Coding Tools Intra Coding Tools Inter Coding Tools In-Loop Restoration Tools Conclusion

  9. New Transform Partitioning Current scheme: Rectangular Blocks Square Blocks

  10. New Transform Partitioning New extended partitioning scheme: Same partitions available for both square and rectangular blocks 0.6% coding gain, currently inter frames only

  11. Data Driven Transforms Add MDTX: Transforms learned from collected block data Different schemes for inter and intra blocks

  12. Data Driven Transforms for Inter Inter RDOTs: 2D combinations of a MDTX, flipped MDTX, and DCT 6 additional, 22 total for Inter DCT ADST FLIPADST IDTX MDTX FLIPMDTX Hor \ Ver DCT_DCT ADST_DCT FLIPADST_DCT H_DCT DCT MDTX_INTER_2 MDTX_INTER_5 DCT_ADST ADST_ADST FLIPADST_ADST H_ADST ADST DCT_FLIPADST ADST_FLIPADST FLIPADST_FLIPADST H_FLIPADST FLIPADST V_DCT V_ADST V_FLIPADST IDTX IDTX MDTX MDTX_INTER_3 MDTX_INTER_1 MDTX_INTER_7 FLIPMDTX MDTX_INTER_6 MDTX_INTER_8 MDTX_INTER_4

  13. Data Driven Mode Dependent Transforms for Intra Intra RDOTs: MDTX/MDTX, MDTX/DCT, DCT/MDTX for each mode DCT ADST IDTX MDTX (1 per mode) Hor \ Ver DCT_DCT ADST_DCT H_DCT DCT DCT_MDTX DCT_ADST ADST_ADST H_ADST ADST V_DCT V_ADST IDTX IDTX MDTX (1 per mode) MDTX_DCT MDTX_MDTX

  14. Non-separable Mode-dependent RDOTs for Intra Motivation: intra residual blocks exhibit patterns that are not horizontal/vertical Non-separable transforms tend to better capture these patterns Goal: introduce a 2D non-sep. transform for each intra mode D157_PRED (first 16 tx. functions) D67_PRED (first 16 tx. functions)

  15. Non-separable Mode-dependent RDOTs for Inra Motivation: intra residual blocks exhibit patterns that are not horizontal/vertical Non-separable transforms tend to better capture these patterns Goal: introduce a 2D non-sep. transform for each intra mode Constrained to 8x8, 4x8 and 8x4 for complexity reasons Exploring second order transforms for larger block sizes D157_PRED (first 16 tx. functions) D67_PRED (first 16 tx. functions)

  16. RD Results Separable transforms only Sep, & non-sep. transforms Lowres BD rate / instructions Midres BD rate / instructions Lowres BD rate / instructions Midres BD rate / instructions -0.699% / 308.5% -0.485% / 273.6% -0.791% / 337.8% -0.875% / 296.5% Inter + intra Inter + intra -0.447% / 284.1% -0.243% / 255.4% -0.447% / 284.1% -0.243% / 255.4% Inter only Inter only -0.346% / 242.8% -0.652% / 238.4% -0.273% / 219.6% -0.243% / 218.2% Intra only Intra only -1.086% / 435.9% -1.125% / 463.4% -0.637% / 348.5% -0.443% / 377.1% Key frames Key frames

  17. Overview Background Technical Overview Block Partitions Transform Coding Tools Intra Coding Tools Inter Coding Tools In-Loop Restoration Tools Conclusion

  18. Adaptive Recursive Filtering Based Intra Prediction Fit the filter coefficients adaptively for each block being predicted: training region from already reconstructed pixels (available in both encoder and decoder) regularization coefficient filter coefficients used to predict the target block

  19. Adaptive Recursive Filtering Based Intra Prediction Available Filter Configurations , , 2-tap filters: full context , , 3-tap filters: left context , 4-tap filters: top context 54 modes 18 filter configurations 3 training regions =

  20. Adaptive Recursive Filtering Based Intra Prediction Current experiment results: Test Settings midres (keyframe- only) hdres (keyframe- only) -1.207% -1.094% 7 adaptive modes 5 adaptive modes -0.954% -1.125% 3 adaptive modes -0.721% -0.845%

  21. ML Based Entropy Coding for Intra Prior Art: Estimate the symbol probability distribution using lookup tables. Lookup tables indexed by contexts Shared initialization Lookup table Lookup table update update Entropy Coding Entropy Coding symbol symbol bitstream

  22. ML Based Entropy Coding for Intra Disadvantages: To keep the size of the tables small, we cannot afford too many contexts. The contexts have to be discrete / categorical. Proposed idea: Replace lookup tables with ML models such as logistic regression or neural network models Use the ML models as classifiers to estimate symbol probabilities

  23. ML Based Entropy Coding for Intra Shared initialization ML Model ML Model update update Entropy Coding Entropy Coding symbol symbol bitstream

  24. ML Based Entropy Coding for Intra Use logistic regression model to estimate the probabilities of intra prediction modes Use top and left neighbor blocks modes as input features 0.6% coding gains for keyframes

  25. Overview Background Technical Overview Block Partitions Transform Coding Tools Intra Coding Tools Inter Coding Tools In-Loop Restoration Tools Conclusion

  26. Inter Coding Tools Motion vector prediction Illumination compensation Flexible motion vector precision

  27. Overview Background Technical Overview Block Partitions Transform Coding Tools Intra Coding Tools Inter Coding Tools In-Loop Restoration Tools Conclusion

  28. In-Loop Restoration Tools Compressed Bitstream Predictor Info Best Predictor Inter / Intra Reconstruct Blocks Loss introduced T Q Coefficients Restore Frame Source block Prediction Residue Deblocking, Others [Many new opportunities] Restoration Info Reference Frame store Several tools in AV1 for in-loop restoration other than Deblocking CDEF (CLPF, Deringing) Loop Restoration (Wiener, Dual Self-guided filter)

  29. In-Loop Restoration Tools Compressed Bitstream Predictor Info Best Predictor Inter / Intra Reconstruct Blocks Loss introduced T Q Coefficients Restore Frame Source block Prediction Residue Deblocking, Others [Many new opportunities] Restoration Info Reference Frame store Replace CDEF and loop filter with CNN 3-4% coding gain Still exploring complexity reduction Non-Separable Wiener Filters

  30. Overview Background Technical Overview Block Partitions Transform Coding Tools Intra Coding Tools Inter Coding Tools In-Loop Restoration Tools Conclusion

  31. Conclusion About 2% additive BDRATE reduction over AV1 Most experiments still a work in progress Many more experiments to come, suggestions welcome!

More Related Content