« October 2005 | Main | December 2005 »

November 18, 2005

Podcasts Converted to Text

With that in mind, a few companies are trying to make search engines actually listen to big audio and video files. From there, speech-to-text software can generate written transcripts, which are searched in addition to metadata.

Posted by Michelle at 04:00 PM | Comments (0) | TrackBack

War-Zone Test for Babel-Fish Tool

Funded by Darpa, the system would allow troops to communicate in Arabic through a laptop computer equipped with voice recognition and translation software. Troops could speak in English and have their words instantly translated into Iraqi Arabic, "spoken" by a computerized man's voice. The program also translates Arabic into English.

"Will it replace the need for an interpreter when you're having some sort of high-level conversation? Absolutely not," said Kristin Precoda, speech technology research lab director at SRI International and one of the developers of the program that got underway in May. "But it is absolutely to the point where it could be useful in some carefully chosen situations."

The effort to combine machine translation and voice recognition, a project Darpa calls TranTac, is an extension of earlier research spearheaded by SRI that culminated in the Phraselator. That device, essentially a PDA programmed to translate English phrases into other languages, is widely used by troops and medical workers in Afghanistan and Iraq.

But the Phraselator still has serious limitations, Precoda said. It can be impractical in countries with low literacy rates, such as Afghanistan. Additionally, many people are unfamiliar with the PDA interface.

Posted by Michelle at 03:51 PM | Comments (0) | TrackBack

November 16, 2005

The Voice Business Market Outlook: Key Technologies and Emerging Market Opportunities

Business Insights is offering a report (for $1530) on the market outlook for voice recognition and related technologies:

The growing number of customer success stories and benchmarks has helped convince more businesses that speech is a viable business solution that provides measurable ROI benefits; by 2008, the global voice business market will be worth $3.1bn, with increasing growth rates through 2007, at which point growth rates will begin to stabilize and fall off as the market matures. While the platform market as a whole will continue to grow, it will have the lowest CAGR for 2004-2008 of any value chain segment. Speech recognition and text-to-speech engines will account for the lion’s share of revenues within the segment, although voice authentication will increase in prominence, through 2005 and onwards. The applications segment is expected to become the biggest portion of the voice business pie by 2008, representing 32% of total revenues. Voice business services will be one of the fastest growing value chain segments through 2008, with a CAGR of 35%.

Posted by Michelle at 02:31 PM | Comments (0) | TrackBack

New electronic toys give control back to the children

The buzzword for hot electronic toys this season is "control."

In short, kids (of all ages) love stuff they can command with the touch of a button, the sounds of their voices and the music they play.

Increasingly sophisticated voice- and touch-recognition technology means that everything from fuzzy creatures to fearsome dinosaurs can respond to orders.

Posted by Michelle at 02:26 PM | Comments (0) | TrackBack

MacSpeech Adds PowerPoint ScriptPak

MacSpeech announced the immediate availability of its new ScriptPak for PowerPoint 2004 on Tuesday. The new ScriptPak adds voice recognition features to iListen 1.6.5 for PowerPoint 2004. It includes over 200 commands, including the ability to voice-control slide shows. The PowerPoint ScriptPak is available for download at the MacSpeech web site, and is priced at US$19.

Posted by Michelle at 02:21 PM | Comments (0) | TrackBack

November 15, 2005

Dictaphone Introduces New Microphone with Full Mouse Capabilities for Speech Recognition Applications

This new microphone from Dictaphone sounds pretty useful... well, if you're using Dictaphone voice recognition products, that is...

Dictaphone Corporation has introduced the PowerMicII microphone, combining advanced dictation and review functions with full Microsoft Windows mouse capabilities to support physicians who are dictating and self-correcting reports generated with speech recognition technology. The PowerMicII reduces keyboard and mouse reliance for caregivers using Dictaphone's Enterprise Workstation and PowerScribe Workstation speech recognition systems, which in turn dramatically increases ease of use and productivity.

Posted by Michelle at 05:55 PM | Comments (0) | TrackBack

Nightingale Speech-Enables EMR Solution With Dragon NaturallySpeaking Medical From Nuance; Nightingale Taps Dragon NaturallySpeaking to Deliver Complete EMR Solution; Reduces Costs and Improves Overall Patient Care

Canadians may soon see their friendly local Emergency Medical Technicians using voice recognition to speed their medical care:

Nuance Communications, Inc., formerly ScanSoft, Inc., the leading provider of speech and imaging solutions for businesses and consumers around the world, today announced that Nightingale, the leading provider of ASP-based solutions to Canadian healthcare providers, has selected Dragon NaturallySpeaking Medical to enable speech recognition within its Nightingale electronic medical records (EMR) solutions. The new integration will enable Nightingale's customers to dictate directly into the myNightingale EMR system, dramatically lowering administrative costs, errors, and turnaround time typically associated with manually transcribing critical medical records.

Posted by Michelle at 05:42 PM | Comments (0) | TrackBack

November 11, 2005

Military Bloggers Compete to Support Wounded

In recognition of the continuing need to supply voice-activated laptops to wounded military personnel, Valour-IT has mounted a fundraising drive among major military and civilian bloggers. Representing the Air Force, Marines, Army and Navy, military and civilian bloggers with thousands of readers have raised over $40,000 in the first 5 days of a 10-day drive.

Posted by Michelle at 04:26 PM | Comments (0) | TrackBack

The Voice-Activated R2-D2

This motorized replica of the headstrong little droid from the Star Wars films responds to voice commands, navigates rooms and hallways, and makes any home feel like it has been transported to a galaxy far, far away. R2 obeys more than 40 voice commands ("Turn around!" "Move forward two units!") and he plays games like tag, using an infra-red sensor to search for people in a room. His sensor helps him follow behind you, or it can be set to detect motion, turning R2 into a room sentry that sounds an alarm when a secured area is invaded. R2's lights, swiveling dome top, and distinctive happy and sad sounds faithfully mimic the real thing, right down to his occasional "bad mood." (A simple command of, "R2, behave yourself!" snaps him out of it.) R2 can also replay sounds and dialog from "Star Wars" movies, answer yes-or-no questions, and dance while playing the famed cantina music.

Posted by Michelle at 04:20 PM | Comments (0) | TrackBack

November 09, 2005

AccuNurse 2.0 Provides Instant Communication Between Senior Care Centers and Families

AccuNurse is a voice-activated, hands-free documentation and communication system that the nursing staff of senior care facilities uses to review resident care plans, and document care as it takes place. The system features some of the most advanced technology integration on the market today. Sophisticated speech recognition technology, combined with VoIP technology, allows staff members to quickly communicate resident information directly to the AccuNurse database, which allows staff members to report information faster and more accurately than ever before. AccuNurse also includes extensive reporting capabilities that lead to improved staff management, resident care, and compliance with state and federal regulatory requirements. Unlike other automated documentation systems, all staff members can easily learn and use AccuNurse.

Posted by Michelle at 12:38 PM | Comments (0) | TrackBack

Symposium to showcase voice-operated wheelchair

Every Tuesday morning for nine months, four UNM students have been working to make a voice-activated wheelchair a reality.

"This can really help people in the end. It can really serve a purpose later on and help somebody's life out," said Patrick Thomas, a UNM senior.

SAATI--a speech activated, all-terrain, interactive wheelchair--is one of 468 projects to be showcased at the University on Nov. 21 at the Undergraduate Research and Creativity Symposium.

Thomas and his team members, Chris Martinez, J.P. Sena and Leanne Storey, have been working on the prototype for their senior design class, which is a two-semester course in the Electrical and Computer Engineering Department. The wheelchair is operated with a microphone and speaker that recognizes distinct commands from the user, Martinez said.

Posted by Michelle at 12:34 PM | Comments (0) | TrackBack

Speech-activated services support Windows Mobile devices

Some more information about the new MobileSpeech package (see the original post here) released by Lab9, with screenshot and examples of voice commands.

Posted by Michelle at 11:15 AM | Comments (0) | TrackBack

November 06, 2005

MacSpeech Announces iListen Transcription Package

MacSpeech announced the immediate availability of two new package solutions for iListen on Thursday, the iListen Transcription Solution DS-2, and the iListen Plantronics Bundle. Both include the latest iListen software, along with an upgraded microphone headset.

Posted by Michelle at 03:52 PM | Comments (0) | TrackBack

BBC launches website for disabled

The BBC, working with computing and disability charity Ability Net, has launched a site designed to help people with disabilities get the most out of the World Wide Web.

My Web, My Way is intended to equip anyone using their computer with the tools and understanding to enable them to make the most of the internet, whatever their ability or disability, and regardless of the operating system (Windows, Mac or Linux) they use.

The site provides help to people who would benefit from making changes to their browser, operating system or computer by giving advice on a wide range of specialist hardware such as alternative keyboards and mice, and software such as voice recognition, screen reading and word prediction.

Posted by Michelle at 03:29 PM | Comments (0) | TrackBack

Expo showcases aids for the disabled

Seebacher, 40, learned how she can borrow a voice recognition device that will allow her to dictate rather than type at her computer.

...At Thursday's event, she discovered the Assistive Technology Lending Library. The program, based at Temple University, offers 3,000 items, such as an oversized computer keypad for people with vision problems or limited motor skills.

The free program is available to all Pennsylvanians with disabilities. It lets them try devices before buying them.

Posted by Michelle at 03:27 PM | Comments (0) | TrackBack

Sick of automation? Dial 0 for human

The widespread proliferation of automated customer service systems is part of a profound change in the way American businesses deal with customers. A lot of attention has been focused on how consumers end up speaking to call centers in India or other countries when they phone for help. But Gartner Inc., a market research firm in San Francisco, said many companies are bypassing call centers altogether by asking their customers to serve themselves with the help of technology.

Self-service activities range from customers scanning and bagging their own groceries to consumers using automated voice systems or websites to purchase tickets, submit insurance claims, manage bank accounts, or adjust financial portfolios. By 2010, Gartner says, self-service will account for 58 percent of all service interactions, up from 35 percent today.

Posted by Michelle at 03:05 PM | Comments (0) | TrackBack

MobileSpeech 1.0 for Windows Mobile Smartphone and Pocket PC

Great news, a competitor to the the widely popular Voice Command from Microsoft is now available. MobileSpeech 1.0 offers the ability to dial and start applications from not only Windows Mobile Pocket PC but also Windows Mobile Smartphone devices. The new application is priced at a very reasonable $15.95.

Lab9 Solutions Inc. announced today the general availability of MobileSpeech 1.0 for WinCE SmartPhones and Pocket PC's. MobileSpeech is the first in a series of Integrated, Speech Activated Services providing Contact and Digit Dialing, Application Launching, and Contact Lookups, featuring innovative hands-free and eyes-free operation that users can customize to their skill and experience level.

Posted by Michelle at 02:57 PM | Comments (0) | TrackBack

November 04, 2005

Live speech-translation technology unveiled

Technology that provides live translation of speech from one language to another has been revealed by scientists from the US and Europe.

This and other translation technologies were demonstrated publicly for the first time at Carnegie Mellon University in Pittsburgh, US, last Thursday. They were developed by researchers from the International Center for Advanced Communication Technologies (InterACT), a collaboration between Carnegie Mellon and the University of Karlsruhe in Germany.

Posted by Michelle at 02:22 AM | Comments (0) | TrackBack

Social Security Offers Voice-Recognition Phone Feature

The speech recognition technology allows a computer to understand what callers are saying and to conduct an automated "dialogue" with them. Using this technology, callers to Social Security's national 800 number can update their addresses or telephone numbers; change direct deposit information; request a replacement Medicare card; and request verification of Social Security benefits.

Posted by Michelle at 01:57 AM | Comments (0) | TrackBack

November 03, 2005

Parrot Introduces the Industry's First Bluetooth Hands-Free Car Stereo: Parrot Rhythm n' Blue

No word on whether you can control the music playing functions by voice though! Now that would be handy. (Yeah, I know, background noise. But it mutes itself when you make a phone call, why can't it temporarily mute itself while you issue the "Skip" command?)

The Parrot RHYTHM N'BLUE features a premium AM/FM/CD/MP3 stereo receiver. In addition to tuning in your favorite radio stations, Parrot Rhythm n' Blue lets you listen to your CDs and MP3s while pairing with most Bluetooth mobile phones on the market, allowing the driver to make calls, take calls, and access a mobile address book totally hands-free via Parrot's renowned voice-recognition technology. A quick glance at your receiver provides Caller ID, and the radio or music source is automatically muted as the phone audio is routed through the vehicle's speakers for crisp, clear audio... and, because it's Bluetooth, you can do this all without being tethered to your phone.

Posted by Michelle at 01:49 AM | Comments (0) | TrackBack

November 02, 2005

It's VoIP, Jim, but not as we know it: nVidia CEO predicts tech developments

The future of wireless communications could be a world without mobile phones, where Wi-Fi signals pick up your voice commands from a chip-enabled lapel on your breast pocket, then VoIP (voice over IP) converts the voice signals to data and sends them across the internet to powerful servers that can identify the caller's voice and connect them immediately to the person they're trying to find.

Posted by Michelle at 11:37 PM | Comments (0) | TrackBack

speak4CAD Voice Recognition Software Doubles the Pace of Computer-Aided Design 'CAD'

Enact Technology, LLC has announced the release of its speak4CAD recognition software, the first software product that dramatically improves AutoCAD productivity. There are over 3 million AutoCAD users worldwide making it the dominant CAD software. During speak4CAD's beta-user tests, productivity was measured by comparing manual drawings to those created by spoken drawing commands and dimensions (video demo). Bruce Swan, Enact Technology Senior Partner says, "We are extremely pleased at the reception speak4CAD software is receiving. One of our beta-test users, Architect Fernando Andrade's response is typical of what we're hearing from everyone now using our product. He says that speak4CAD has entirely changed the way he uses AutoCAD because it enables him to focus on drawing instead of the keyboard--a more natural way to work."

Posted by Michelle at 11:19 PM | Comments (0) | TrackBack