Quality-Based Transcoding (QBT) – Simplifying Transcoders For Target Video Quality

For video streaming applications, three significant requirements are (1) high video quality, (2) a reliable system, and (3) optimal cost of the offering.

The current solutions in the market offer video transcoding with user settings for different transcoding parameters. It is difficult for media content companies to choose appropriate transcoding settings because –

  • multiple encoding parameters require expert knowledge,
  • choosing appropriate settings for different types of video content is not practical.
ashok magadum vidarka technologies

Ashok Magadum is the Founder and CEO of Vidarka Technologies; a company focused on the domain of High Quality, Cost Optimal, Scalable, and Reliable Video Streaming Solutions for VOD, Live and Low Latency Streaming covering OTT/FAST, EdTech, E-Shopping, Gaming, and many more application areas.

Quality-Based Transcoding (QBT) offers simple target quality settings for transcoding to media companies so that they can achieve the target video quality with optimal bit rates and, in turn, achieve an optimal cost.

In this article, we will share the results of Quality Based Transcoding for 1080p H.264/AVC and 720p H.264/AVC for mobile devices.

Also, do check out my earlier articles on H.264/AVC vs. HEVC and Chunk-based High-Speed Transcoding.

Quality Mode and Target VMAF for the Video transcoding for different target devices

Target Quality for 1080P H264 Video Encode Mode
ModeTarget deviceTarget VMAFContent
MobileMobile and Tablet95 (Mobile)Edtech, Sports, OTT, E-Shopping
EconomicalTV88OTT: TV Shows, Movies, Sports
StandardBigger size screen TV92Standard or High-Quality Movies
PremiumHigh-Quality Movies, Sports and Shows95Premium Movies, sports and shows
HDRHDR TVs97HDR Content

Need for Target Quality-based transcoding

Target quality-based transcoding refers to adjusting a video’s quality during transcoding to meet a specific target quality level.

There are several reasons why target quality-based transcoding is essential:

Bandwidth and Storage

Videos with higher quality require more bandwidth (CDN) and storage space. If the target quality level is too high, the video may take longer to upload or download and may take up too much storage space. Adjusting the target quality level allows you to optimize the video for its intended purpose and save on storage and bandwidth costs.

User Experience

A video that is too low in quality may be challenging to watch or understand. At the same time, a video that is too high in quality may take too long to buffer or may be incompatible with specific devices. By optimizing the video for its target quality level, you can ensure that it provides the best possible user experience.

Quality Control

Target quality-based transcoding can also help ensure the video meets specific quality standards. For example, if you are creating a professional video, you may need to ensure that it meets specific quality criteria; by setting a target quality level, you can ensure that the video meets these requirements.

Conventional Transcoding settings

Popular video transcoding engines provide support for various video formats.

The most popular video compression format is H.264/AVC.

Relatively newer formats HEVC, VP9, AV1, and VVC, are slowly gaining popularity. The settings which modify, fine-tune, and improve video coding efficiency are bitrate, IDR period/Key interval, GOP structure, Adaptive Quantization, scaling, and many more.

Challenges in choosing the options

The video quality is directly affected by several factors, and some of these are –

Key Frame interval and GOP structure.

  • Longer GOP structures can lead to higher compression ratios, resulting in smaller file sizes but lower video quality in erroneous conditions.
  • In contrast, shorter GOP structures can improve video quality but result in larger file sizes.

Complexity of the Content:

The complexity of the video content can affect the choice of encoding parameters. For example, videos with fast-moving scenes or complex textures may require shorter GOP structures to maintain high-quality video.

Available Bandwidth

The available bandwidth can also affect the choice of encoding parameters. Lower bandwidth may require more aggressive compression techniques, such as longer GOP structures or higher compression ratios. In comparison, higher bandwidth may allow for more relaxed parameters that result in higher-quality video.

Time and Computational Resources

Encoding video can be a computationally intensive process, and choosing the optimal encoding parameters may require significant time and resources to experiment with different settings and compare results.

How Target Quality-based Transcoding helps

When the encoding decision is left to the encoder based on the target quality and provided bitrate range, it can pick appropriate settings for various scenes and encode them, achieving the best quality and saving the bitrate significantly.

Two-pass encoders can achieve this by analyzing the video in the 1st Pass and encoding the video in the 2nd pass. But, it has certain limitations –

Two Pass Encoder Limitations

A two-pass encoder is a video compression technique that analyzes the video content in two passes to determine the optimal encoding parameters for the highest quality and compression efficiency. While two-pass encoding can provide superior results to single-pass encoding, it has some limitations.

Here are a few:

Time-consuming

Two-pass encoding can be more time-consuming than single-pass encoding, requiring two separate encoding passes. This can be a limitation for real-time applications or situations requiring fast encoding.

Less Flexible

Two-pass encoding requires that the video content be fully available before encoding begins. This can be a limitation for live streaming or other real-time applications where the video content is generated in real-time.

Increased Complexity

Two-pass encoding is a more complex technique than single-pass encoding, requiring additional processing and analysis of the video content.

Single Pass Encoder and Advantages

Single-pass video encoding, or one-pass encoding, is a video compression technique where the video is encoded in a single pass without needing multiple analysis passes. Here are some advantages of single-pass video encoding:

Speed

Single-pass encoding is generally faster than multi-pass encoding methods because it avoids the additional time required for multiple analysis passes. This makes it suitable for real-time encoding or situations where time is a critical factor.

Simplicity

Single-pass encoding is simpler to implement and use since it eliminates the need for complex analysis passes and the associated decision-making processes. It can be an efficient choice for applications that require straightforward and quick video encoding.

Real-time streaming

Single-pass encoding is particularly useful for real-time streaming applications like video conferencing, live broadcasting, or online gaming. It enables the encoding process to keep up with the real-time input, ensuring minimal delay and smooth playback.

Reduced computational resources

By eliminating the need for multiple analysis passes, single-pass encoding reduces the computational resources required. This can be advantageous in scenarios with limited hardware or computing power, such as on mobile devices or low-end computers.

Single pass encoder with few frames look ahead can provide the best prediction for Quality-Based Transcoding. Low complexity and faster turnaround time help to use this encoder for live video transcoding use cases.

H.264/AVC Encoder Quality Settings

VMAF (Video Multimethod Assessment Fusion) is a popular video quality metric that measures the perceived quality of a video. The optimal target VMAF (or VMAF score) for different video content can vary depending on the intended use, delivery platform, and viewer preferences. Here are some general guidelines for target VMAF scores for different types of video content:

Streaming platforms and Online video

For most streaming platforms and online video services, a target VMAF score between 86 and 92 is often considered a good quality. This range balances efficient compression and maintains a visually pleasing viewing experience.

High-definition (HD) content

HD content typically requires higher-quality targets. A target VMAF score of 88+ is desirable for good quality video, and a VMAF of 92 or above is often desirable for HD videos to ensure a clear and crisp viewing experience, especially for content with intricate details or fast motion. Higher VMAF provides ensures better quality.

HDR video designed to provide more color gamut and luminance range needs lesser compression losses to preserve the best quality.

Virtual Reality (VR), 360-degree videos, and Metaverse streaming

VR and 360-degree videos require a high level of immersion and quality to deliver an engaging experience. A target VMAF score of 92 or above is generally recommended to minimize artifacts, blurriness, or other issues that can diminish the sense of presence in VR environments.

Mobile Mode

For sports, Entertainment, Edtech, and E-shopping videos, mobile devices have a significant share. Video of 720p / 1080p with a 25 to 30 FPS frame rate provides a good experience. Being a small-screen 720p video with good quality can provide an excellent visual experience to users. The table below lists different classes of videos, Video bitrate with QBT & VMAF with on-mobile mode. The target mobile VMAF is 95.

Target BR: 1600Kbps Actual1280×720 
 Mobile: VMAFBR in KbpsBR Savings
TV Show1100.00493.0069.19%
TV Show299.23682.0057.38%
TV Show399.05896.0044.00%
TV Show499.461007.0037.06%
Movie198.48560.0065.00%
Movie299.901047.0034.56%
Movie399.511097.0031.44%
Animation: Movie499.13927.0042.06%
Animation: Movie599.05923.0042.31%
Sports196.921525.005.06%
Sports298.361098.0031.38%
Sports396.171465.0033.44%
Sports499.891423.0011.06%
Sports596.42921.0042.44%
Average 98.70  947.4338.96% 

The average bitrate for 720P is 947Kbps, where the range of bitrates is 493Kbps to 1525kbps. The average VMAF is 98.7, and min and max are 96.17 to 100. This mode of transcoding is suitable for Mobile and tablet devices. Here we can save bandwidth along with ensuring high video quality.

Economical Quality

For the Economical Mode, the target VMAF is 88 for 1080P Video. Most TV shows, movies, and News content can be streamed in this mode with good visual experience.

Target BR: 3000Kbps Actual1920×1080 
 VMAFBR in KbpsBR Savings
TV Show192.0482872.40%
TV Show289.40122459.20%
TV Show389.52161346.23%
TV Show489.69191236.27%
Movie187.92106364.57%
Movie292.81196034.67%
Movie392.45210929.70%
Animation: Movie492.02175741.43%
Animation: Movie590.21161246.27%
Sports187.3528754.17%
Sports292.78213728.77%
Sports389.00207130.97%
Sports494.5927358.83%
Sports586.46207130.97%
 Average90.45 1854.79  38.17%

For 1080P videos, the average bitrate is around 1860Kbps. The average VMAF is 90+. Min and max VMAFs are 86.46 and 92.81. Visually provides good quality.

Standard Quality

For the Standard Mode, the target VMAF is 92. In many cases, the transparent quality is considered as VMAF of 95, and +/- 3 VMAF points are nearly non-differential. Hence this mode provides very good quality for premium sports and shows.

Target BR: 4000Kbps         Actual1920×1080 
           VMAFBR in KbpsBR Savings
TV Show193.67103774.08%
TV Show292.00158860.30%
TV Show392.05208247.95%
TV Show492.47260534.88%
Movie191.83149162.73%
Movie295.02247438.15%
Movie394.68278430.40%
Animation: Movie494.14226143.48%
Animation: Movie592.64203349.18%
Sports191.79331417.15%
Sports297.10327218.20%
Sports395.45318320.43%
Sports496.9039022.45%
Sports593.28315821.05%
 Average 93.79 2513.14 37.17%

For 1080P videos, the average bitrate is around 2514Kbps. The average VMAF is 93+. Min and Max VMAFs are 92.79 and 96.9. Visually provides very good quality.

Premium Quality

For the Premium Mode, the target VMAF is 95, the transparent video quality target. This mode provides original-quality video for premium movies, sports, and shows.

Target BR: 5000Kbps5000 Kbps1920×1080
VMAFBR in KbpsBR Savings
TV Show195.84177864.44%
TV Show294.96253949.22%
TV Show394.76348230.36%
TV Show495.08410717.86%
Movie194.41213157.38%
Movie297.49390921.82%
Movie397.25409318.14%
Animation: Movie496.75380323.94%
Animation: Movie595.81328634.28%
Sports194.40412417.52%
Sports297.90407218.56%
Sports396.86396920.62%
Sports497.38445210.96%
Sports595.44395120.98%
Average96.023549.7129.01%

For 1080P videos, the average bitrate is around 3550Kbps. The average VMAF is 96+. Min and Max VMAFs are 94.76 and 97.49. Visually provides better quality.

HDR Quality

The desired quality video is very high for HDR video to retain the color information and brightness range of information. The target VMAF is 97. This mode provides the desired dynamic range of luminance and color gamut.

Target BR: 6000Kbps Actual1920×1080 
 VMAFBR in KbpsBR Savings
TV Show197.51534210.97%
TV Show296.96534510.92%
TV Show396.89515414.10%
TV Show496.8755986.70%
Movie196.78381836.37%
Movie298.0854359.42%
Movie398.1055577.38%
Animation: Movie497.7856066.57%
Animation: Movie597.6255038.28%
Sports197.5256455.92%
Sports298.5655677.22%
Sports397.9254419.32%
Sports497.316207-3.45%
Sports597.4054299.52%
 Average 97.52 5403.369.94% 

For 1080P videos, the average bitrate is around 5400Kbps. The average VMAF is 97.5. Min and Max VMAFs are 96.78 and 98.56.

Summary and Conclusion

With QBT (Quality-Based Transcoding), the desired quality of the video can be achieved along with bitrate savings. For Mobile Video, a VMAF of 95+ can be achieved with an average of 950 Kbps bitrate for 720P.

For 1080P for TV devices Economical mode achieves an average VMAF 90+ of 1850 Kbps bitrate. The standard mode achieves a VMAF of 93+ with 2513 Kbps. The Premium mode provides an average VMAF of 96 with an average bitrate of 3550 Kbps.

The HDR mode provides an average VMAF of 97.52 with an average bitrate of 5400 Kbps.

We conclude that Quality-Based Transcoding has a simple interface, single pass optimized processing, target quality videos, and bandwidth savings, making it the best solution for VOD and Live streaming applications.

    Ashok Magadum
    Ashok Magadum
    Founder and CEO at Vidarka Technologies

    Ashok Magadum is the Founder and CEO of Vidarka Technologies. He has a Master's from Manipal University and 21+ years of experience in the design and development of various Video codecs, to name a few MPEG-2/AVC/SVC/HEVC/Dolby Vision/Content Adaptive, Video Transcoding, Cloud Media Streaming, computer vision, ADAS, DSP, Embedded Systems and various Multimedia Technologies.

    He has worked for various startups such as Ittiam System, Zenverge Inc, Saranyu Technologies, and MNCs like NXP, Freescale, and Samsung for the design and development of many solutions.

    Website: www.vidarka.com

    For any queries or more details, please reach us at [email protected].

    1200x200-Pallycon

    Leave a Comment

    Your email address will not be published. Required fields are marked *

    Enjoying this article? Subscribe to OTTVerse and receive exclusive news and information from the OTT Industry.