Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Low Adoption of <track> Element #10866

Open
yashrajbharti opened this issue Dec 16, 2024 · 9 comments
Open

Low Adoption of <track> Element #10866

yashrajbharti opened this issue Dec 16, 2024 · 9 comments

Comments

@yashrajbharti
Copy link

yashrajbharti commented Dec 16, 2024

What is the issue with the HTML Standard?

Inaccessibility of Web Videos Due to Low <track> Adoption

Only 0.5% of web videos include a <track> tag (Web Almanac 2024 Report), leaving the vast majority of online video content inaccessible for individuals who rely on captions.

Impact

  • Excludes people with hearing impairments, non-native speakers, and others who benefit from captions.
  • Undermines accessibility standards on the web, which aim to make content inclusive by default.

Why This Matters

The <track> element was introduced to provide captions, but its low adoption highlights significant barriers for developers, such as:

  • Lack of awareness or knowledge about the <track> element.
  • Additional effort required to create and manage caption files.
  • Limited incentives for publishers to prioritize captions.

Gap in the Standard

The standard does not currently address how browsers or user agents might provide fallback mechanisms for captions when no <track> is present. This gap leaves accessibility largely dependent on content authors, which is not a scalable or equitable solution.

@yashrajbharti yashrajbharti changed the title Low Adoption of <track> Element Results in Accessibility Barriers for Web Videos Low Adoption of <track> Element Dec 16, 2024
@aardrian
Copy link

aardrian commented Jan 3, 2025

The standard does not currently address how browsers or user agents might provide fallback mechanisms for captions when no is present.

That feels like it's outside the scope of HTML. Partly because:

  • that involves UAs creating content from scratch (remember that captions represent more than spoken dialogue),
  • UAs and OSes already offer platform-level features to auto-generate captions from speech (I can do it on my Windows machine and Android device today, though they are far from ideal),
  • Audio Description (one of the uses of <track>) has no support in browsers today, so some of that gap is from UAs, not authors.

Here are existing bugs about lack of AD support:

To address two of your points about barriers to developers:

  • Additional effort required to create and manage caption files.

This is also true for any alternatives to media files, including AD and alternative text, which are realities of supporting all users.

  • Limited incentives for publishers to prioritize captions.

True for any accessibility considerations, and why arguments proliferate around business cases and SEO and branding, but still boil down to risk under existing laws (which incorporate or reference WCAG).

This gap leaves accessibility largely dependent on content authors, which is not a scalable or equitable solution.

I agree there is a gap from authors, but do you have a proposal for how HTML could make authors do better or how, as a technical standard, HTML could define some sort of fallback?

@yashrajbharti
Copy link
Author

Thank you for your thoughtful response! I appreciate the points you've raised. I understand the concerns about the scope of the proposal, particularly around user agents (UAs) generating content from scratch, such as captions.

However, I want to emphasize that the issue of the lack of captions in web videos is significant, with only 0.5% of web videos currently having captions, as you can see from resources like Web Almanac by HTTPArchive.

Audio Description (one of the uses of <track>) has no support in browsers today

I agree that there are gaps in support for Audio Descriptions (AD), and I'm aware that work is being done on this. However, this shouldn’t detract from the fact that captions, which are more readily available, can still play a crucial role. The track element offers a potential solution, and UAs can certainly take on the responsibility of handling cases where captions are missing.

Captions represent more than spoken dialogue

I completely agree. Captions often need to reflect non-verbal sounds or other relevant context. However, in the absence of comprehensive captions, even a basic fallback would be a huge step forward. It’s certainly not perfect, but it’s better than no captions on the web. It can also get better over time with this mindset.

Chromium: 1447858 Feature Request: text-based Audio Descriptions

I don’t feel aligned with the notion that the lack of AD support should dilute the importance of the issue at hand. While AD support is being addressed separately (Compatibility chart), captions are a crucial part of the accessibility conversation, and pursuing solutions for captions should remain a priority.

I agree there is a gap from authors, but do you have a proposal for how HTML could make authors do better or how

I’m open to feedback and ideas, and I think it’s important for the community to brainstorm solutions. The existing 0.5% and 0.1% availability rates for captions and audio captions are very low, and I don’t believe that pushing the issue out of scope will help solve the problem.

@aardrian
Copy link

aardrian commented Jan 3, 2025

So your issue is about low adoption of closed captions specifically (provided in HTML via the <track> element). Got it. And you have identified a gap but have not proposed a solution. Which is fine. And are asking for feedback and ideas. Ok.

Here is my feedback / ideas:

  • There is plenty of pre-existing content out there on best practices, how to make captions, legal requirements, etc. You can raid all of that to propose some suggestions for fallback content.
  • The Understanding doc for SC 1.2.2 offers some starting points (and guardrails for failures) for your effort.
  • Starting a discussion with the W3C working group might be a good idea. That is more likely to include dedicated accessibility stakeholders given its focus.

But absent suggestions, I still think this is out of scope for HTML.

As an aside...

I don’t feel aligned with the notion that the lack of AD support should dilute the importance of the issue at hand. While AD support is being addressed separately (Compatibility chart) [...]

I am aware of the MDN compat chart. I am responsible for its accuracy. But it does not speak to support "being addressed separately". The issues I linked above speak to that.

@ericwbailey
Copy link

ericwbailey commented Jan 3, 2025

I'm not sure this is the right place to voice this sort of thing—the lack of adoption of the standard does not speak to its efficacy.

track's usage being low is like ruby or other elements, in that it's author responsibility. If you're angling for a heuristics-based UA fallback solution it might be more effective to bring it up with browser vendors.

@yashrajbharti
Copy link
Author

yashrajbharti commented Jan 3, 2025

Thanks a lot!
I intend to take as much feedback and tips in doing so. Currently instead of the solutions, I am more focused on the problem and its use cases. And I would love to understand better how to go about it from a11y experts.

I feel the problem statement, "Only 0.5% of web videos include a <track> tag, leaving a majority of online video content inaccessible for individuals who rely on captions", is sufficient for opening an issue here.

I think, unlike ruby, low adoption of track has its implications on web accessibility.

Also, as per https://whatwg.org/faq#adding-new-features the kind of issue that the WHATWG process prefers: A problem description, without any proposed solution required or expected, and I shall be looking to draft feature requests to various vendors as well.

@eric-carlson
Copy link

  • Audio Description (one of the uses of <track>) has no support in browsers today, so some of that gap is from UAs, not authors.

WebKit has experimental support for both Standard and Extended Audio Descriptions. It uses the text to speech engine to render cues from a WebVTT track who's kind is descriptions.

When Extended AD is enabled, playback is automatically paused if it takes longer to speak a cue's text than the cue's specified duration. If playback is paused, it is automatically resumed when the utterance finishes.

When only Standard AD is enabled, playback is not automatically paused even if the text to speech engine has not finished speaking the cue text after the cue's end time.

These features are not enabled by default yet, but you can try them by selecting Settings from the "Safari" menu, selecting the Feature Flags pane, typing "description" in the Search box, and checking one or both features.

@eric-carlson
Copy link

Only 0.5% of web videos include a <track> tag (Web Almanac 2024 Report), leaving the vast majority of online video content inaccessible for individuals who rely on captions.

One thing that is difficult to measure is the number of web videos that have captions but do not include a <track> element because the page authors chose to render the captions from JavaScript instead of using the browser's native caption support. This is done by some of the largest video-centric sites, e.g. YouTube, Netflix, BBC, etc

@aardrian
Copy link

aardrian commented Jan 9, 2025

@eric-carlson

WebKit has experimental support for both Standard and Extended Audio Descriptions. […]

I very much appreciate your feedback. I followed your steps. No luck. As this is not the right venue to sort out WebKit support (or my bad test code), I documented my steps on the WebKit issue.

@eric-carlson
Copy link

@aardrian

I very much appreciate your feedback. I followed your steps. No luck. As this is not the right venue to sort out WebKit support (or my bad test code), I documented my steps on the WebKit issue.

Thank you for filing the WebKit issue. There is nothing wrong with your test code, I just forgot one important step in my description above - you need to enable Audio Descriptions in system preferences:

  • Open System Settings
  • Open the Accessibility panel
  • Click on Audio Descriptions
  • Enable the Play audio descriptions when available toggle

I added this to the WebKit issue you filed, but I am adding it here as well for those that don't click through.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Development

No branches or pull requests

4 participants