W3C Uniquitous Web Domain

The Voice Browser Working Group

The Voice Browser Working Group's mission is to support browsing the web by voice. The web is much more than just the web pages you can see, it is also the web pages you can hear and speak to. While end users are familiar with interacting with visual html web pages rendered in their browser of choice, many users might be surprised to realize that today they regularly interact with the voice web through VoiceXML (VXML) and other technologies developed and standardized by the Voice Browser Working Group. Just as many sites have an html presence on the web for visual browsing, most large companies have a vxml presence on the web for voice browsing, which is most often accessed by calling the companies phone number. Unlike most visual web browsers, voice web browsers are typically without chrome and run in the cloud, so they are often transparent to the end user. But otherwise, all the normal power of the web applies including taking advantage of web services, markup, linking, uris, cacheing, standards, accessibility, and cross-browser support.

There are a suite of independent standards that are also supported as a parts of VoiceXML. These standards can be, and are being, used alone in non-VXML contexts; however, they achieve a powerful synergy when used in support of VXML. The latest recommendation of these web standards are:

VoiceXML (VXML)
a language for for creating audio dialogs that feature synthesized speech, digitized audio, recognition of spoken and DTMF key input, recording of spoken input, telephony, and mixed initiative conversations;
Speech Grammar Recognition Specification (SRGS)
a document language that can be used by developers to specify the words and patterns of words to be listened for by a speech recognizer or other grammar processor;
Semantic Interpretation for Speech Recognition (SISR)
a document format that represents annotations to grammar rules for extracting the semantic results from recognition;
Pronunciation Lexicon Specification (PLS)
a representation of phonetic information for use in speech recognition and synthesis;
Speech Synthesis Markup Language (SSML)
a markup language for rendering a combination of prerecorded speech, synthetic speech, and music;

In addition to recommendations that are used as part of VoiceXML, there are a couple of powerful specifications that are used to interact and control voice sessions (as well as control many other types of sessions and processes). These are:

Call Control (CCXML)
a markup language to enable fine-grained control of speech (signal processing) resources and telephony resources to perform scenarios such as call screening, whisper call waiting, and call transfer;
State Chart XML (SCXML)
a markup language to simply and precisely represent the semantics of state machines;

The W3C Voice Browser Working Group (members only) is chartered to develop the next generation of the voice web. Currently the group's main two areas of focus in the near term are on driving SCXML to last call status and on the next version of VoiceXML (3.0).

News

05 July 2011: Call Control eXtensible Markup Language (CCXML) Version 1.0 is a W3C Recommendation

The Voice Browser Working Group is pleased to announce that CCXML is now a W3C Recommendation!

CCXML is designed to provide telephony call control support for dialog systems, such as VoiceXML.

26 April 2011: State Chart XML (SCXML): nineth Working Draft is published

9th WD of SCXML is published. A diff-marked versionis also available for comparison purposes. The main difference from the previous draft is corrections to the interpretation algorithm.

4-5 June 2011: Workshop on Mobile and Web Technologies in Social and Economic Development

The World Wide Web Foundation is organising the Workshop on Mobile and Web Technologies in Social and Economic Development in Tanzania. The workshop is about themes that are dear to the foundation and related to voice-browsing in developing countries. See also the official announcement on the foundation's site.

16 December 2010: State Chart XML (SCXML): eighth Working Draft is published

8th WD of SCXML is published. A diff-marked versionis also available for comparison purposes. The main difference from the previous draft is the removal of profiles.

16 December 2010: Voice Extensible Markup Language (VoiceXML) 3.0: eighth Working Draft is published

8th WD of VXML 3.0 is published. A diff-marked versionis also available for comparison purposes. The main differences from the previous draft are described in Appendix F.

7 September 2010: Speech Synthesis Markup Language (SSML) Version 1.1 is a W3C Recommendation

SSML 1.1 is an official recommendation. See the press release for more about this important milestone.

31 August 2010: Voice Extensible Markup Language (VoiceXML) 3.0: Seventh Working Draft is published

7th WD of VXML 3.0 is published. A diff-marked versionis also available for comparison purposes. The main differences from the previous draft are described in Appendix F.

30 June 2010: Workshop on Conversational Applications

The summary of the Workshop on Conversational Applications in Somerset, New Jersey, US on 18-19 June 2010 is now available. Participants from 12 organizations fucused discussion on the use cases of possible conversational applications and clarified limitations of the current W3C language model in order to develop a more comprehensive one. Detailed minutes are also available.

17 June 2010: Voice Extensible Markup Language (VoiceXML) 3.0: Sixth Working Draft is published

6th WD of VXML 3.0 is published. A diff-marked versionis also available for comparison purposes. The main differences from the previous draft are described in Appendix F.

13 May 2010: State Chart XML (SCXML): Seventh Working Draft is published

7th WD of VXML 3.0 is published. A diff-marked versionis also available for comparison purposes. The main differences from the previous draft are the removal of the <anchor> element, a revision of the interpretation algorithm and addition of a brief description on DOM Event I/O Processor.

Matt Womer (mdw@w3.org), Team Contact for Voice Browser Working Group
$Id: Overview.html,v 1.665 2011/07/12 14:26:50 mdw Exp $.
This page was generated using XSLT. The XML source is also available for viewing on an XSLT-enabled browser.