ERCIM News 64

January 2006
Contents

Cover ERCIM News 64

This issue in pdf
(76 pages; 12Mb)

Subscription

Archive:
Cover ERCIM News 63
previous issue:
Number 63
October 2005:

previous issues online

Next issue:
April 2006

Next Special theme:
Space Exploration

Call for the next issue

About ERCIM News

W3C to Internationalize and Secure Voice Browsing

Following the successful technical Workshop held in Beijing last November 2005, and taking into account valuable inputs from the VoiceXML Forum, the W3C announced new work on extensions to components of the Speech Interface Framework which will both extend Speech Synthesis Markup Language functionality to Asian and other languages, and include speaker verification features into the next version of VoiceXML, version 3.0. Addressing both areas expands both the reach and functionality of the framework.

The Speech Synthesis Markup Language (SSML), a W3C Recommendation since 2004, is designed to provide a rich, XML-based markup language for assisting the generation of synthetic speech in Web and other applications. The essential role of the markup language is to provide authors of synthesizable content a standard way to control aspects of speech such as pronunciation, volume, pitch, rate, etc. across different synthesis-capable platforms.

While these attributes are critical, additional attributes may be even more important to specific languages. For example, Mandarin Chinese, the most widely spoken language in the world today, also has the notion of tones - the same written character can have multiple pronunciations and meanings based on the tone used. Given the profusion of cellphones in China - some estimate as high as over one billion - the case for extending SSML for Mandarin is clear in terms of sheer market forces. Including extensions for Japanese, Korean and other languages will ensure that a fuller participation possible of the world on the Web.

Users of telephony services and the Web are also demanding speaker verification. Identity theft, fraud, phishing, terrorism, and even the high cost of resetting passwords have heightened interest in deploying biometric security for all communication channels, including the telephone. Speaker verification and identification is not only the best biometric for securing telephone transactions and communications, it can work seamlessly with speech recognition and speech synthesis in VoiceXML deployments.

Link:
Voice Browser Activity: http://www.w3.org/Voice/