Posts Tagged with 'mac-speech'

Speech-to-Text: Dictation software for Mac OS X

A microphone

Speech-to-text software, sometimes known as dictation software, is something that lets you talk to the computer in some form and have the computer react appropriately to what you are saying. This is totally different to text-to-speech software, which is software can read out text already in the computer.

Command and Control Software

There are two types of speech-to-text software available. One type is called "command and control" and it lets you speak commands to your computer to control it; hence the name. For example, a command that the computer understands might be, "go to the Apple website" or, "tell me the time". Each command is pre-programmed and the computer will only recognise those commands it's been programmed for; you can't use this software to write an email or use iChat for example.

Command and control software for the Mac - known as "Speakable Items" (or sometimes, confusingly, "spoken commands") - is already built into every OS X computer, although most people don't know about it. You don't need to download, buy, or install anything to get this software to work, just a microphone that works with your computer. The main drawback is that the Speakable Items software programmed for English with a standard American accent, and has significant trouble with any other accent. It doesn't function at all with languages other than English.

Some resources for getting you up and running with Speakable Items include:

Dictation Software

The other type of speech-to-text software is usually called "dictation" software. This is the type that lets you write an article like this one, type stuff to your friends in iChat, or type an email. The most common Windows software for speech to text dictation - you've probably heard of it - is Dragon NaturallySpeaking. There is only one dictation-capable speech-to-text software available for OS X which is being updated and developed and it's [msd]. Dictate is the successor to a program named iListen which MacSpeech used to produce.

MacSpeech Dictate iconLike all dictation-capable text-to-speech products, MacSpeech Dictate works very well for some people and very badly for others. Whether it will work for you depends on many things including: how much effort you're willing to put into learning it, how good your microphone is, your age (text to speech usually works less well for children), how much your accent matches what the program expects, and whether your voice changes a lot through the day.

MacSpeech Dictate is also still fairly new software - it was only released on the 15th of February, 2008. In comparison, the premiere speech recognition program for Windows is Dragon NaturallySpeaking which has been in development since the 1980s[1].

When MacSpeech Dictate was originally released it had several major problems which made it unusable for people with disabilities, but most of these have now been resolved:

  • There was no good help functions inside the application - this was rectified in Dictate version 1.3
  • It didn't learn from corrections - this was rectified in Dictate version 1.2
  • Couldn't spell words out by voice - this was rectified in Dictate version 1.2
  • Couldn't request individual key presses (such as command-s or command-option-escape) by voice - this was rectified in Dictate version 1.3
  • Couldn't be taught new words, such as names or jargon specific to your profession - this was largely rectified in Dictate version 1.2, although some words still resist training
  • There was no way to control the mouse by voice - this was finally rectified in Dictate version 2.0.

I tried using the old iListen program a few years ago and could not get results that were useful, an on-screen keyboard was the best solution at the time. Although MacSpeech Dictate is in its early days as a program, its recognition of my particular voice is hugely better than iListen's was. This is not surprising though, as MacSpeech Dictate's speech recognition engine is based on the same engine used by Windows' Dragon NaturallySpeaking - widely recognised as the best consumer speech recognition available.

[msd] requires the requires Intel-based Macintosh hardware and requires Mac OS X 10.5.6 (Leopard) and higher. Thirteen English dialects/accents are supported, and US and UK spelling options. These are:

  • US Spelling
    • American
    • American - Inland Northern
    • American - Southern
    • American - Teens
    • Australian
    • British
    • Indian
    • Latino
    • Southeast Asian
  • UK Spelling
    • Australian

    • British

    • Indian

    • Southeast Asian

Specialised versions - Dictate Medical and Dictate Legal - are available for dictating in these language areas, and Dictate International is now available and recognises speech in French, German, and Italian. MacSpeech have strongly hinted that Spanish language recognition is next on their agenda.

MacSpeech Dictate is a great program for dictation and some computer control, but it is not something that will let you control the computer completely "hands free". For quadriplegic users and others who need full computer control, you will need to supplement Dictate with use of a mouth stick and keyboard, or a program such as SwitchXS for switch access to functions not available by voice. I highly recommend Dictate though, it's part of my suite of accessibility technology and I use it whenever I am able to.

Website: [msd]

- Ricky Buchanan

[msddisclaim]

[msdbanner]

Dragon Dictate 2.0 Review

Icon for Dragon DictateI've been using Dragon Dictate 2.0 for Mac for about two weeks along with Growl. Overall, I'm glad I upgraded although there are still some problems with the system.

Installing the upgrade was very easy. In fact, I still have MacSpeech Dictate 1.5.9 on my computer because while I paid for an upgrade what I got was a new copy of [dd] 2.0. When you go to upgrade one of your profiles from a MacSpeech Dictate profile you will have to redo the training but it doesn't take long.

Once you have Dragon Dictate 2.0 installed you will notice minor cosmetic changes in the interface. Overall, you should be able to dictate just as you did before. Most of the changes seem to be additions rather than changes or subtractions. Also, Growl version 1.2 can be used to notify recognised text rather than having a window always visible on the screen. Later versions of Growl do not work as well at this time.

I primarily access my computer using Zoom and was hoping that the new Mouse Grid feature would enable me to use my mouse less. While Mouse Grid is fairly easy to use, even for someone with low vision, it depends on there being "clicks" of the mouse. So, I haven't been able to use it to look around my screen.

Dragon Dictate is much more stable than MacSpeech Dictate. I can now use websites that involve Flash, such as YouTube, with Dragon Dictate running. It is still a good idea to save your profile regularly but crashes seem more infrequent.

The Available Commands Window has had some changes. More choices are now listed. Some of the commands seem shorter. This makes it easier for someone like me whose disability affects their voice and breathing. Most of the commands work beautifully. The only one I've had trouble with is "reply to this message" in Mail.

Lisa using Dragon Dictate on her Mac

Lisa using Dragon Dictate on her Mac

One thing that has not been fixed is that I'm still experiencing small words like "the" and "A" being recognized when I am not speaking. Admittedly, part of this may be caused by my use of a Plantronics 995 headset, which is not approved. Clearing those words with "scratch that" can cause glitches with what I'm doing. Oddly, muting the microphone with the headset doesn't always stop this problem so I strongly recommend putting Dragon Dictate to sleep when you won't be using it for a few minutes.

Overall, I would recommend Dragon Dictate 2.0. The increased stability alone makes dictation easier. Plus, with the increased number of available commands and Mouse Grid I think that Dragon Dictate will be more useful for those who have trouble using their hands rather than just preferring to speak.

- Lisa Connor

[msddisclaim]

[msdbanner]

Dragon Dictate for Mac 2.0 Announced

Icon for MacSpeech DictateNuance Communications today announced the release of Dragon Dictate for Mac 2.0, a paid and rebranded upgrade for MacSpeech Dictate.

This is a major upgrade, bringing Dictate much closer to the Windows based Dragon NaturallySpeaking product. Major features include:

  • Uses the same speech recognition engine as the new Dragon NaturallySpeaking 11
  • Mouse movement with voice commands using a 3 by 3 grid system is now built in.
  • Mouse clicking with voice commands including clicks with modifiers, double clicking, etc., is now built in.
  • Proofreading documents with the Mac's built in text-to-speech commands is now also included.
  • More than one microphone can now be attached to a single profile.
  • New editing commands have been added so they match the commands that will be familiar to Windows Naturally Speaking users.

This sounds like it could now function as a complete keyboard replacement for disabled Mac users, which is great news!

Unfortunately my computer is still off being fixed (the first fix only worked for a few days), so I haven't had the chance to try this new version. As soon as is humanly possible, I will be getting myself a copy and testing it out. Meanwhile, Dan Cohen at GearDiary has reviewed Dragon Dictate and declares it 'awesome' - not a bad start!

Dragon Dictate for Mac costs US$199 including a basic microphone. The upgrade costs US$49 for a downloadable version, more if you need the upgrade on CD or want to purchase a new microphone at the same time.

If you use this banner to purchase your upgrade online I will get a small portion of your upgrade price, which will help support me and ATMac:

[msdbanner]

Have you upgraded yet? Are you planning to upgrade soon, or later, or not at all? And what new feature are you most excited about?

- Ricky Buchanan

Improve MacSpeech Dictate Recognition With One Easy Step

Icon for MacSpeech DictateWhen trying to dictate using MacSpeech, things can be very frustrating at first. Trying to learn to use a new program and memorise many commands at the same time as you are learning the art of dictation is always going to be difficult. Happily, there is one thing which you can do that will greatly increase the chances that you will learn to use [msd] usefully.

Select 'Preferences' from the Dictate menu to open MacSpeech Dictate's preference pane. Click on the recognition icon and check the box labelled "Always show recognition window when dictating". Make sure the other checkbox, the one labelled "Close recognition window after each choice", is unchecked as shown here:

macspeech-recognition-prefs

You may also wish to increase the maximum number of alternatives shown in the recognition window, and if you have a fast computer then moving the slider towards the end marked "Accuracy" may also improve your recognition.

Having the recognition window open whenever you are dictating will make it much more likely that you will use your voice for corrections. This will, in turn, allow MacSpeech dictate to fine tune your voice profile each time you use the program. Having a better voice profile will reduce the number of correction which are needed as Dictate will recognise your voice more accurately.

- Ricky Buchanan

[msddisclaim]

[msdbanner]

MacSpeech Dictate Legal Released

Icon for MacSpeech Dictate LegalHot on the heels of the [msd] version 1.5 upgrade and MacSpeech Dictate Medical, there's another member of the MacSpeech family: MacSpeech Dictate Legal.

Dictate Legal sports 30,000 legal words and terms, and retails with a higher quality microphone than the plain MacSpeech Dictate version. There are some cosmetic changes to the program's colours (yellow rather than blue), but otherwise this is the same product as the MacSpeech Dictate 1.5.* series, just with added legal vocabulary.

Reports I've seen from around the web have been very positive although few people actually have the software yet. There's a good review of MacSpeech Dictate Legal written by Victor Medina.

MacSpeech Dictate Legal currently retails for US$595 including a high quality noise-cancelling microphone. MacSpeech Dictate 1.5 customers can crossgrade for US$345, and customers with previous MacSpeech Dictate for US$395.

Website: [msd]

- Ricky Buchanan
[msddisclaim]

[msdbanner]