Speech-to-Text: Dictation software for OS X
[Last updated 5 June 2009]
Speech-to-text software, sometimes known as dictation software, is something that lets you talk to the computer in some form and have the computer react appropriately to what you are saying. This is totally different to text-to-speech software, which is software can read out text already in the computer.
There are two types of speech-to-text software available. One type is called “command and control” and it lets you speak commands to your computer to control it; hence the name. For example, a command that the computer understands might be, “go to the Apple website” or, “tell me the time”. Each command is pre-programmed and the computer will only recognise those commands it’s been programmed for; you can’t use this software to write an email or use iChat for example.
Command and control software for the Mac - known as “Speakable Items” (or sometimes, confusingly, “spoken commands”) - is already built into every OS X computer, although most people don’t know about it. You don’t need to download, buy, or install anything to get this software to work, just a microphone that works with your computer. The main drawback is that the Speakable Items software programmed for English with a standard American accent, and has significant trouble with any other accent. It doesn’t function at all with languages other than English.
Some resources for getting you up and running with Speakable Items include:
- MacWorld Article about Speakable Safari Bookmarks and other aspects of OS X Speakable Items
- Apple Support Articles about Speakable Items
The other type of speech-to-text software is usually called “dictation” software. This is the type that lets you write an article like this one, type stuff to your friends in iChat, or type an email. There used to be a version of IBM’s ViaVoice for OS X but it hasn’t been updated for several years and is no longer available. There is only one dictation-capable speech-to-text software available for OS X which is being updated and developed and it’s MacSpeech Dictate. Dictate is the successor to a program named iListen which MacSpeech used to produce.
Like all dictation-capable text-to-speech products, MacSpeech Dictate works very well for some people and very badly for others. Whether it will work for you depends on many things including: how much effort you’re willing to put into learning it, how good your microphone is, your age (text to speech usually works less well for children), how much your accent matches what the program expects, and whether your voice changes a lot through the day.
MacSpeech Dictate is also still fairly new software - it was only released on the 15th of February, 2008. In comparison, the premiere speech recognition program for Windows is Dragon Naturally Speaking which has been in development since the 1980s[1].
When MacSpeech Dictate was originally released it had several major problems which made it unusable for people with disabilities, but most of these have now been resolved:
- There was no good help functions inside the application - this was rectified in Dictate version 1.3
- It didn’t learn from corrections - this was rectified in Dictate version 1.2
- Couldn’t spell words out by voice - this was rectified in Dictate version 1.2
- Couldn’t request individual key presses (such as command-s or command-option-escape) by voice - this was rectified in Dictate version 1.3
- Couldn’t be taught new words, such as names or jargon specific to your profession - this was largely rectified in Dictate version 1.2, although some words still resist training
- There is no way to control the mouse by voice - still outstanding, although mouse clicks by voice can be accomplished with a consumer-level add-on.
I tried using the old iListen program a few years ago and could not get results that were useful, an on-screen keyboard was the best solution at the time. Although MacSpeech Dictate is in its early days as a program, its recognition of my particular voice is hugely better than iListen’s was. This is not surprising though, as MacSpeech Dictate’s speech recognition engine is based on the same engine used by Windows’ Dragon NaturallySpeaking - widely recognised as the best consumer speech recognition available.
MacSpeech Dictate- US Spelling
- American
- American - Inland Northern
- American - Southern
- American - Teens
- Australian
- British
- Indian
- Latino
- Southeast Asian
- UK Spelling
- Australian
- British
- Indian
- Southeast Asian
Website: MacSpeech Dictate
- Ricky Buchanan











I am an old rascal (pilot WW II). I am in pretty good shape for my age, with one exception, I am profoundly deaf. What I need to communicate is something like an ipod with speech to text. I’m sure there are may people with the same problem. The problem is social isolation which lowers the quality of life.
HELP!
@Alan: Unfortunately, the technology isn’t good enough for what you need yet. The iPods can’t do speech to text and even if you carried a laptop around it’s speech to text functions have to be trained for a specific person’s voice, and that person has to speak very clearly and carefully for the speech to text to work. So the sucky thing is that the computers just aren’t smart enough for you yet
I’m not sure there’s anything technology wise that can help you with chatting with a random person. I can’t think of anything that would really be better than a pencil and paper and getting people to write stuff down for you.
I’m at Accessing Higher Ground and heard of a nice device called the Ubi Duo (demo at Grant Laird, Jr’s Blog).
It looks pretty cool!
@conniereece http://tinyurl.com/787fbw http://tinyurl.com/9cvnod
I’m not sure is this is the right place to post this plea/request for help. I guess…in the 9 months or so I’ve developed what I thought was mild carpal tunnel…it probably was…but due to a downsizing and incompetent work environ ( i teach pro apps, etc)…It’s morphed into something…else.
Maybe fybromyalgia, who knows…
Point is. I can’t afford an intel mac, and my livelihood…and all my past 12 year docs and stuff are mac based.
So I’m really stuck in Leopard/PPc G4 land…
what do i do?
i do think ilisten or something would help a little even if it’s not great, but you can’t buy it anymore.
Any help you can think of would be much appreciated. I’ve looked … legacy sites, questionably legal torrents, memory maximizers, on and on…
And can’t do much more.
So anyway, if this sounds a little desperate, it is. And I’m definitely over my typing quotient for the day.
Thanks either way. This is a great site.
Eleo
@Eleo: I suspect if you contacted the MacSpeech people they may be able to help get you an old copy of iListen, or there may be one available on eBay. You should also look into text expansion utilities such as TextExpander and TypeIt4Me where you can define shortcuts for words. There are several other free and low cost strategies listed in the article Seven ways to stop your arms hurting. Best of luck.
I have physical challenges that would make a good speech to text program very helpful to me as a writer. I’ve tried an old version of naturally speaking (4) but it doesn’t work very well for poetry as it doesn’t understand the form (line breaks instead of paragraphs, for example) and is more trouble than help in many ways- ok for a first draft of individual poem but no good for editing or a large manuscript.
Would Macspeech dictate be any better for me?
@Gail: I’ve never tried using Macspeech Dictate for poetry but I don’t think there would be a big problem with it. You could always use ‘keystroke shift enter’ for inserting line breaks, if the usual commands weren’t to your satisfaction. I don’t think you’ll really know unless you try it though, unfortunately.