-
Website
http://mashable.com/ -
Original page
http://mashable.com/2006/03/14/castingwords-podcast-transcriptions/ -
Subscribe
All Comments -
Community
-
Top Commenters
-
Robert Basil
142 comments · 8 points
-
Jennifer Van Grove
149 comments · 23 points
-
r0cketman22
317 comments · 52 points
-
rajagiri4
160 comments · 2 points
-
barringtonarch
150 comments · 4 points
-
-
Popular Threads
-
Enter the Zappos Sharing Happiness $3,000 Shopping Spree Giveaway Contest
5 hours ago · 94 comments
-
Head to Head: Chrome for Mac vs. Chrome for Windows
1 hour ago · 7 comments
-
iPhone App Offers Instant Speech-to-Text Transcription
3 hours ago · 17 comments
-
Your Next Car Radio Might Be Pandora
4 hours ago · 21 comments
-
Google Launches Chrome for Mac
6 hours ago · 28 comments
-
Enter the Zappos Sharing Happiness $3,000 Shopping Spree Giveaway Contest
I worked on a voice transcription automation project for about 3 months as a proof of concept. The project was trying to evaluate whether the voice to text tools available could take a recorded business meeting and transcribe it so that it could be readable and searchable on the corporate intranet portal. After much mucking with the tools, trying to train them and testing different scenarios, we had to conclude that it wasn't a viable option within the current abilities of the technologies.
Looking at the percentage of correct transcription even on a highly trained transcriber with only a single person speaking, it just didn't work very well.
Adding the problem of having it figure out which of several speakers was speaking was even further beyond the technology. And the fact that there were so many industry specific terms (acronyms or even more common words that mean different things in the context of the industry and so were mis-transcribed easily), just complicated things even more.
Voice recognition works fine for a very limited vocabulary, for instance, when you call the bank and it asks you to say things to choose options. Those are single or small numbers of words that it's already expecting. Transcription is fast paced and not predefined.
Yes, things have probably changed in 2.5 years, but I'd be suprised if these issues have been solved. If they had I would have expected a lot more buzz about voice to text. Instead it seems like that area is pretty dead.
A human transcribing a podcast, especially if there are several people speaking and specialized terminology, is really the only viable option, I think. Casting Words seems to have been relatively successful so far with the accuracy of their transcriptions (at least according to their presentation at ETech).
Too bad it's not easier and automatable, but it's a nice test case for Mechanical Turk for sure.
HORRIBLE business model
Hopefully they employ mashable for some strategy consulting before it's too late.
We hope so.
Yeah, sorry - I shouldn't have used such an old quote. I love the idea of using MTurk to power a company - I hope it works out well!
Interesting.
Regards,
Steve.