RJI Futures Lab #171: IBM Watson Speech to Text

November 4, 2016

autoEdit is an open source tool that can help journalists transcribe video and audio.

Reporting by Jon Doty and Rachel Wise.

IBM Watson Speech to Text is a service that uses machine intelligence to convert the spoken word into written transcriptions. Pietro Passarelli, a Knight-Mozilla Fellow at Vox Media, has integrated this technology into an open-source tool that can turn video interviews into edited stories.
Reporting by Jon Doty

For more information:

The tool Pietro Passarelli describes is AutoEdit, an open-source project created as part of his Knight-Mozilla fellowship with the Vox Media product team.
AutoEdit takes no more than five minutes to transcribe an interview. According to Passarelli, the application breaks apart interviews, transcribes the speech, then reassembles all the pieces.
The IBM Speech to Text service provides an API that allows users to add speech transcription capabilities into applications. To transcribe accurately, “the service leverages machine intelligence to combine information about grammar and language structure with knowledge of the composition of the audio signal. The service continuously returns and retroactively updates the transcription as more speech is heard.”
The IBM Watson Speech to Text service uses speech recognition capabilities to convert Arabic, English, Spanish, French, Brazilian Portuguese, Japanese and Mandarin speech into text. It supportsuncompressed audio files up to 100MB.

Rachel Wise is an editor at the Futures Lab at the Reynolds Journalism Institute and co-producer of the weekly Futures Lab video update.

RJI Futures Lab web banner The Reynolds Journalism Institute’s Futures Lab video update features a roundup of fresh ideas, techniques and developments to help spark innovation and change in newsrooms across all media platforms. Visit the RJI website for the full archive of Futures Lab videos, or download the iPad app to watch the show wherever you go. You can also sign up to receive email notification of each new episode.

Comments are closed.

Who We Are

MediaShift is the premier destination for insight and analysis at the intersection of media and technology. The MediaShift network includes MediaShift, EducationShift, MetricShift and Idea Lab, as well as workshops and weekend hackathons, email newsletters, a weekly podcast and a series of DigitalEd online trainings.

About MediaShift »
Contact us »
Sponsor MediaShift »
MediaShift Newsletters »

Follow us on Social Media

@MediaShiftorg
@Mediatwit
@MediaShiftPod
Facebook.com/MediaShift

RJI Futures Lab #171: IBM Watson Speech to Text

Who We Are

Follow us on Social Media

MediaShift

Mark Glaser