Risk
3/4/2010
04:52 PM
Connect Directly
Google+
LinkedIn
Twitter
RSS
E-Mail
50%
50%

YouTube Gets Auto-Captioning From Google Speech Tech

The video sharing site now allows anyone to generate captions for videos with coherent spoken English.

In a move to make its massive store of video content more accessible, Google's YouTube is making automated caption generation available to all YouTube users.

YouTube initially added a caption feature in 2008. Last November, it introduced auto-captioning for a select group of partners.

Now, any video created with a clear audio track -- unless disallowed, an option for some of YouTube's content partners -- can be captioned automatically, thanks to the speech-to-text algorithms that power Google Voice Search.

What's more, those captions can be translated from English into one of 50 supported languages at the viewer's discretion.

At the moment, auto-captioning only works in videos with spoken English, but Google product manager Hiroto Tokusei says in a blog post that YouTube plans to support the captioning of more languages in the months ahead.

In a related effort, Google is also working to turn Android phones into universal translators through a combination of speech-to-text and translation technology.

"For content owners, the power of auto-captioning is significant," said Tokusei. "With just a few quick clicks your videos can be accessed by a whole new global audience. And captions can make is easier for users to discover content on YouTube."

Captions, as text content, are useful to Google as a way to improve search relevancy. And with the volume of information that Google has to manage -- over 20 hours of video are uploaded to YouTube every minute -- every improvement helps.

Although speech-to-text conversion isn't perfect, Tokusei says that Google's technology is getting better. Video owners can also improve caption files by downloading them, making corrections, and then uploading them back to YouTube.

Other Google accessibility projects include a talking RSS reader for Android devices, support for WAI-ARIA, the Accessible Rich Internet Applications Suite, in Google Chrome, and support for the AxsJAX framework.

About 650 million people live with a disability, according to the UN.

By 2015, Professor Adrian Davis of the British MRC Institute of Hearing Research estimates that more than 700 million people will be suffering from hearing loss of more than 25 dB, a consequence both of aging and of exposure to noise, among other causes.

Last October, Google consolidated its accessibility resources at a single Web address.

Comment  | 
Print  | 
More Insights
Comments
Newest First  |  Oldest First  |  Threaded View
Register for Dark Reading Newsletters
White Papers
Video
Cartoon Contest
Write a Caption, Win a Starbucks Card! Click Here
Latest Comment: This comment is waiting for review by our moderators.
Current Issue
Security Operations and IT Operations: Finding the Path to Collaboration
A wide gulf has emerged between SOC and NOC teams that's keeping both of them from assuring the confidentiality, integrity, and availability of IT systems. Here's how experts think it should be bridged.
Flash Poll
New Best Practices for Secure App Development
New Best Practices for Secure App Development
The transition from DevOps to SecDevOps is combining with the move toward cloud computing to create new challenges - and new opportunities - for the information security team. Download this report, to learn about the new best practices for secure application development.
Slideshows
Twitter Feed
Dark Reading - Bug Report
Bug Report
Enterprise Vulnerabilities
From DHS/US-CERT's National Vulnerability Database
CVE-2017-0290
Published: 2017-05-09
NScript in mpengine in Microsoft Malware Protection Engine with Engine Version before 1.1.13704.0, as used in Windows Defender and other products, allows remote attackers to execute arbitrary code or cause a denial of service (type confusion and application crash) via crafted JavaScript code within ...

CVE-2016-10369
Published: 2017-05-08
unixsocket.c in lxterminal through 0.3.0 insecurely uses /tmp for a socket file, allowing a local user to cause a denial of service (preventing terminal launch), or possibly have other impact (bypassing terminal access control).

CVE-2016-8202
Published: 2017-05-08
A privilege escalation vulnerability in Brocade Fibre Channel SAN products running Brocade Fabric OS (FOS) releases earlier than v7.4.1d and v8.0.1b could allow an authenticated attacker to elevate the privileges of user accounts accessing the system via command line interface. With affected version...

CVE-2016-8209
Published: 2017-05-08
Improper checks for unusual or exceptional conditions in Brocade NetIron 05.8.00 and later releases up to and including 06.1.00, when the Management Module is continuously scanned on port 22, may allow attackers to cause a denial of service (crash and reload) of the management module.

CVE-2017-0890
Published: 2017-05-08
Nextcloud Server before 11.0.3 is vulnerable to an inadequate escaping leading to a XSS vulnerability in the search module. To be exploitable a user has to write or paste malicious content into the search dialogue.

Dark Reading Radio
Archived Dark Reading Radio
In past years, security researchers have discovered ways to hack cars, medical devices, automated teller machines, and many other targets. Dark Reading Executive Editor Kelly Jackson Higgins hosts researcher Samy Kamkar and Levi Gundert, vice president of threat intelligence at Recorded Future, to discuss some of 2016's most unusual and creative hacks by white hats, and what these new vulnerabilities might mean for the coming year.