Reporting by Jon Doty and Rachel Wise.
IBM Watson Speech to Text is a service that uses machine intelligence to convert the spoken word into written transcriptions. Pietro Passarelli, a Knight-Mozilla Fellow at Vox Media, has integrated this technology into an open-source tool that can turn video interviews into edited stories.
Reporting by Jon Doty
For more information:
- The tool Pietro Passarelli describes is AutoEdit, an open-source project created as part of his Knight-Mozilla fellowship with the Vox Media product team.
- AutoEdit takes no more than five minutes to transcribe an interview. According to Passarelli, the application breaks apart interviews, transcribes the speech, then reassembles all the pieces.
- The IBM Speech to Text service provides an API that allows users to add speech transcription capabilities into applications. To transcribe accurately, “the service leverages machine intelligence to combine information about grammar and language structure with knowledge of the composition of the audio signal. The service continuously returns and retroactively updates the transcription as more speech is heard.”
- The IBM Watson Speech to Text service uses speech recognition capabilities to convert Arabic, English, Spanish, French, Brazilian Portuguese, Japanese and Mandarin speech into text. It supportsuncompressed audio files up to 100MB.
Rachel Wise is an editor at the Futures Lab at the Reynolds Journalism Institute and co-producer of the weekly Futures Lab video update.
The Reynolds Journalism Institute’s Futures Lab video update features a roundup of fresh ideas, techniques and developments to help spark innovation and change in newsrooms across all media platforms. Visit the RJI website for the full archive of Futures Lab videos, or download the iPad app to watch the show wherever you go. You can also sign up to receive email notification of each new episode.