Another abandoned server code base... this is kind of an ancestor of taskrambler.
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
 
 
 

1442 lines
52 KiB

<?xml version="1.0" encoding="iso-8859-1"?>
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN"
"http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd">
<html xmlns="http://www.w3.org/1999/xhtml">
<head>
<meta http-equiv="Content-Type"
content="text/html; charset=iso-8859-1" />
<title>W3C Multimodal Interaction Working Group</title>
<style type="text/css">
/*<![CDATA[*/
<!--
body {
margin-left: 8%;
margin-right: 5%;
background-color: white;
font-family: Trebuchet, Arial, sans-serif
}
h1 { margin-left: -4%; color: rgb(0,92,160) }
h2 { margin-left: -4%; color: rgb(0,92,160)}
h3 { margin-left: 0% }
h3.notoc {margin-left: 2% }
h4 { margin-left: 0% }
a.toc {font-style: italic; text-decoration: none}
dt {color: rgb(0,92,160); font-weight: bold}
pre { color: green; font-weight: bold }
em { font-style: italic }
strong { text-transform: uppercase; font-weight: bold }
b { color:#990066}
th
{
background: #005a9c;
color: #fff;
}
p.splash { color: #990066}
p.info {text-align: center }
p.linktotheend {
margin-left: 10%;
margin-right: 10%;
}
p.banner { margin-left: -3% }
b.black { color: black}
b.pink {color:#990066}
p.pink {color:#990066}
blockquote { color: black; font-style:italic; font-family: sans-serif;}
pre,tt { font-family: monospace; font-weight: normal }
pre { margin-left: 4% }
div.disclaimer {margin-left: -8%}
p.mission {
font-style: italic;
margin-left: 20%;
margin-right: 20%;
}
div.color {
background: rgb(255,204,255);
padding: 0.5em;
border: none;
width: 80%;
font-family: Trebuchet, Arial, sans-serif;
}
blockquote.color {
background: rgb(255,204,255);
padding-left: 0.5em;
padding-right: 0.5em;
padding-top: 0.2em;
padding-bottom: 0.2em;
border: none;
margin-right: 5%;
font-style: normal;
font-family: serif;
}
li {margin-top: 0.2em; margin-bottom: 0}
li.important {
margin-top: 0.4em;
margin-bottom: 0.2em;
background: rgb(255,255,204);
}
blockquote h3 {text-align: center}
.navbar {text-align: center}
.question {font-style: italic}
:visited { color: navy }
:link { color: blue }
a:hover { background-color: #ffffcc }
a:active { color: red }
//-->
/*From http://www.w3.org/2005/09/table.css*/
table
{
border-collapse: collapse;
margin: 1em auto;
}
table caption
{
margin-left: auto;
margin-right: auto;
}
table, tr, th, td { border: 1px solid black; }
th, td { padding: 5px 1em; }
th
{
/* background: #005a9c; */
/* color: #fff; */
background: rgb(200,200,200);
color: #000;
}
.note { font-size: 120%; font-style: italic; color: red }
/*]]>
*/</style>
</head>
<body bgcolor="white" text="black" link="blue">
<p class="banner"><a href="http://www.w3.org/"><img alt="W3C"
width="72" height="48" border="0" src="/Icons/w3c_home.gif" /></a>
<a href="/UbiWeb/">
<img src="/Icons/ubi212" width="212" alt="Ubiquitous Web Domain" border="0" />
</a></p>
<h1>Multimodal Interaction Activity</h1>
<p class="mission"><em>Extending the Web to support multiple
modes of interaction.</em></p>
<p class="navbar"><a href="#intro">Introduction</a> |
<a href="#status">Current Situation</a> |
<a href="#plan">Revised publication plan</a> |
<a href="#work">Work in Progress</a> |
<a href="http://lists.w3.org/Archives/Public/www-multimodal/">Email Archive</a> |
<a href="Group">Working&nbsp;Group</a> |
<a href="/2004/01/pp-impl/34607/status">Patent&nbsp;Disclosures</a> |
<a href="/2009/05/mmi-charter.html">Charter</a> |
<a href="Activity.html">Activity&nbsp;Statement</a> |
<a href="#contacts">Contacts</a></p>
<blockquote class="color">
<p style="text-align: center">News</p>
<ul>
<li>20 September 2011:
<a href="/TR/2011/REC-InkML-20110920/">
Ink Markup Language (InkML)
</a>
is a W3C Recommendation.
</li>
<li>6 September 2011: The second Last Call Working Draft
of <a href="/TR/2011/WD-mmi-arch-20110906/"> Multimodal Architecture
and Interfaces</a> is published. <br/> The main normative change from
the previous draft is removing the 'immediate' field from the
following Life Cycle Events: CancelRequest, PauseRequest.
</li>
<li>10 May 2011:
<a href="/TR/2011/PR-InkML-20110510/">
Ink Markup Language (InkML)
</a>
is a W3C Proposed Recommendation.
</li>
<li>8 April 2011:
The Last Call Working Draft of <a href="http://www.w3.org/TR/2011/WD-emotionml-20110407/">Emotion Markup Language (EmotionML) 1.0</a> is published.
The main change from the previous draft is the inclusion of a
mechanism for defining emotion vocabularies.
A <a href="http://www.w3.org/TR/2011/WD-emotionml-20110407/#changelog"> precise list of changes from the previous draft</a> is also available for comparison purposes.
<br/>
Also <a href="http://www.w3.org/TR/2011/WD-emotion-voc-20110407/">Vocabularies for EmotionML</a> is published as a First Public Working Draft.
This document represents a public collection of emotion vocabularies that can be used with EmotionML to represent emotions and related states. It was originally part of an earlier draft of the EmotionML specification, but was moved out of it so that we can easily update, extend and correct the list of vocabularies as required.
</li>
<!--
<li>4-5 June 2011:
<a href="http://www.webfoundation.org/">The World Wide Web Foundation</a>
is organising
<a href="http://public.webfoundation.org/2011/01/MW4D_WS/">
the Workshop on Mobile and Web Technologies in Social and Economic Development</a>
in Tanzaniaunce.
The workshop is about themes that are dear to the foundation and
related to multi-channel data collection using Web technologies.
# both your groups: voice-browsing in developing countries,
# and multi-channel data collection using web technologies.
See also the <a href="http://www.webfoundation.org/2011/02/announcing-the-workshop-on-mobile-and-web-technologies-in-social-and-economic-development/">official announcement</a> on the foundation's site.
</li>
-->
<li>1 Mach 2011:
<a href="http://www.w3.org/TR/2011/NOTE-mmi-mcbp-20110301/">
Best practices for creating MMI Modality Components
</a>
is published as a Working Group Note.
</li>
<li>25 January 2011:
The Last Call Working Draft of <a href="/TR/2011/WD-mmi-arch-20110125/">
Multimodal Architecture and Interfaces</a> is published.
<br/>
The main change from the previous draft is the tightening of the
language to make the requirements more precise.
A
<a href="/TR/2011/WD-mmi-arch-20110125/diff.html">
diff-marked version</a> is also available for comparison purposes.
</li>
<li>
11 January 2011:
<a href="http://www.w3.org/TR/2011/CR-InkML-20110111/">
Ink Markup Language (InkML)</a>
is a W3C Candidate Recommendation
(See also
the group's
<a href="http://www.w3.org/2002/mmi/2011/inkml-irp/">Implementation Report Plan</a>).
</li>
<li>5-6 October 2010:
<a href="http://www.w3.org/2010/10/emotionml/cfp.html">
The EmotionML Workshop</a>
was held in Paris, France, hosted by Telecom ParisTech.
The <a href="/2010/10/emotionml/summary.html">summary</a>
and <a href="/2010/10/emotionml/minutes.html">
detailed minutes</a> are available online.
Participants from 12 organizations discussed use cases of possible
emotion-ready applications and clarified several key requirements for
the current EmotionML to make the specification even more useful.
</li>
<li>21 September 2010:
The seventh Working Draft of <a href="/TR/2010/WD-mmi-arch-20100921/">
Multimodal Architecture and Interfaces</a> is published.
<br/>
The main changes from the previous draft are (1) the inclusion of
state charts for modality components, (2) the addition of a
'confidential' field to life-cycle events and (3) the removal of the
'media' field from life-cycle events.
A
<a href="/TR/2010/WD-mmi-arch-20100921/diff.html">
diff-marked version</a> is also available for comparison purposes.
</li>
<li>29 July 2010:
The second Working Draft of
<a href="http://www.w3.org/TR/2010/WD-emotionml-20100729/">
Emotion Markup Language (EmotionML) 1.0
</a> is published.
A
<a href="http://www.w3.org/TR/emotionml/diffmarked.html">
diff-marked version</a> is also available for comparison purposes.
<br />
Please send your comments to the Multimodal Interaction public mailing
list (&lt;<a href="mailto:www-multimodal@w3.org">www-multimodal@w3.org</a>&gt;).
</li>
<li>18-19 June 2010:
<a href="http://www.w3.org/2010/02/convapps/cfp.html">
The workshop on Conversational Applications</a>
was held in Somerset, NJ, (USA), hosted by Openstream.
The <a href="/2010/02/convapps/summary.html">summary</a>
and <a href="/2010/02/convapps/minutes.html">
detailed minutes</a> are available online.
Participants from 12 organizations fucused discussion on the use
cases of possible conversational applications and clarified
limitations of the current W3C language model in order to
develop a more comprehensive one.</li>
<!--
<li class="important">18-19 June 2010:
<a href="http://www.w3.org/2010/02/convapps/cfp.html">
Workshop on Conversational Applications</a>
will be held in Somerset, NJ, (USA), hosted by Openstream.
<br/>
<br/>
<span class="note">
*** The deadline to send position papers is now extended to April 30. ***
</span>
<br/>
<br/>
Please see the
<a href="http://www.w3.org/2010/02/convapps/cfp.html">
Call for Participation</a> for details.
To participate in the Workshop, please submit a position paper
(either as an individual or organization)
to
&lt;<a href="mailto:team-convapps-ws-submit@w3.org">team-convapps-ws-submit@w3.org</a>&gt;
by 11:59 EDT on 30 April 2010.
<br/>
<br/>
To help with planning, please let us know as soon as possible if you
are interested in attending by sending the following information to
&lt;<a href="mailto:team-convapps-ws-submit@w3.org">team-convapps-ws-submit@w3.org</a>&gt;:
<ul>
<li>that a representative from your organization plans to submit a position paper</li>
<li>how many participants you want to send to the workshop (either one or two)</li>
<li>whether or not you wish to make a presentation during the workshop</li>
</ul>
</li>
<li>27 May 2010:
The second Last Call Working Draft of
<a href="http://www.w3.org/TR/2010/WD-InkML-20100527/">
Ink Markup Language (InkML)</a> is published.
<br/>
This draft incorporates a small number of extensions to the
<a href="http://www.w3.org/TR/2006/WD-InkML-20061023">
previous version</a>, including that channels can now report values of
floating point type, and brush properties may be specified.
A
<a href="http://www.w3.org/TR/2010/WD-InkML-20100527/diff.html">
diff-marked version</a> is also available for comparison purposes.
<br />
Please send your comments to the Multimodal Interaction public mailing
list (&lt;<a href="mailto:www-multimodal@w3.org">www-multimodal@w3.org</a>&gt;) by 17 June 2010.
When sending e-mail, please put the text "[ink]" in the subject,
preferably like this: "[ink] .summary of comment."
</li>
<li>15 December 2009:
<a href="http://www.w3.org/TR/2009/NOTE-emma-usecases-20091215/">
Use Cases for Possible Future EMMA Features
</a>
is published as a Working Group Note.
</li>
<li>29 October 2009:
The sixth Working Draft of <a href="/TR/2009/WD-mmi-arch-20091201/">
Multimodal Architecture and Interfaces</a> is published.
<br/>
The document as a whole has changed significantly and the group
welcomes review.
The main changes from the previous draft are (1) clarifying the
relationship to EMMA, (2) simplifying the architecture constituents,
(3) adding a description on HTTP transport of lifecycle events and (4)
adding an example of handwriting recognition modality component.
A
<a href="http://www.w3.org/TR/2009/WD-mmi-arch-20091201/diff.html">
diff-marked version</a> is also available for comparison purposes.
</li>
<li>29 October 2009:
<a href="http://www.w3.org/TR/2009/WD-emotionml-20091029/">
Emotion Markup Language (EmotionML) 1.0</a>
is published as a First Public Working Draft.
</li>
<li class="important">19 August 2009:
The Multimodal Interaction Working Group is relaunched.
The updated <a href="/2009/05/mmi-charter.html">charter</a>
is available on the W3C Web server.
</li>
<li class="important">11 June 2009:
<a href="http://www.w3.org/2009/03/ink/cfp.html">
The Workshop on using Ink in Multimodal Applications within the W3C's Multimodal Architecture and Interfaces</a> is canceled due to limited participation caused by the influenza pandemic.
</li>
-->
<li>10 February 2009:
<a href="http://www.w3.org/TR/2009/REC-emma-20090210/">
EMMA: Extensible MultiModal Annotation markup language</a>
is a W3C Recommendation.
(<a href="http://www.w3.org/2009/02/emma-pressrelease.html">press release</a>)
</li>
<!--
<li>15 December 2008:
<a href="/TR/2008/PR-emma-20081215/">
EMMA: Extensible MultiModal Annotation markup language
</a>
is a W3C Proposed Recommendation.
</li>
<li>16 October 2008:
<a href="/TR/2008/WD-mmi-arch-20081016/">
Multimodal Architecture and Interfaces</a>: Fifth Working Draft is published.
</li>
<li>23 September, 2008:
<a href="/2002/mmi/2007/emma-irp/">Implementation Report Plan</a>
of
<a href="http://www.w3.org/TR/2007/CR-emma-20071211/">
Extensible MultiModal Annotation markup language (EMMA)
Candidate Recommendation
</a>
is modified.
Following test assertions have been removed from the Implementation
Report Plan document, because they are actually not described in the
EMMA specification: 801, 902, 903 and 1501.
Please see
<a href="http://lists.w3.org/Archives/Public/www-multimodal/2008Sep/0004.html">
the announcement sent to the Multimodal Interaction public list
</a>
for the details on the modification.
</li>
<li>2 July 2008:
<a href="/TR/2008/NOTE-mmi-auth-20080702/">
Authoring Applications for the Multimodal Architecture
</a>: First Public Working Group Note is published.
</li>
<li>28 April 2008:
A
<a href="http://lists.w3.org/Archives/Public/www-multimodal/2008Apr/0006.html">
reminder
</a>
on the implementation reports for the
<a href="http://www.w3.org/TR/emma/">
Extensible MultiModal Annotation markup language (EMMA)
</a>
specification sent out.
The Multimodal Interaction Working Group very much welcome
implementation reports.
The reports should be sent to
<a href="http://lists.w3.org/Archives/Public/www-multimodal/">
www-multimodal@w3.org
</a>
in the format described in the
<a href="http://www.w3.org/2002/mmi/2007/emma-irp">
implementation report plan</a>.
</li>
<li>14 April 2008:
<a href="/TR/2008/WD-mmi-arch-20080414/">
Multimodal Architecture and Interfaces</a>: Fourth Working Draft is published.
</li>
<li>
3-7 March, 2008:
MMIWG f2f meeting was held in Orlando, US, on 3-7 March 2008, hosted
by Voxeo. MMI Architecture workshop feedback, topics for the next MMI
Architecture and the need of guidelines for integrating a modality
into the MMI Architecture were discussed jointly with the VBWG.
</li>
<li>
22 January, 2008:
<a href="http://www.w3.org/2002/mmi/2007/emma-irp/#intro">
Implementation Report Plan</a>
of
"EMMA: Extensible MultiModal Annotation markup language"
Candidate Recommendation is modified with following point:
<ul>
<li>The minimum Candidate Recommendation period in the
Implementation Report Plan is modified from "?? 2007" to "14 April
2008" as defined in the
<a href="http://www.w3.org/TR/2007/CR-emma-20071211/#status">
Candidate Recommendation</a>.</li>
</ul>
</li>
<li>
11 December, 2007:
<a href="/TR/2007/CR-emma-20071211/">
EMMA: Extensible MultiModal Annotation markup language</a>
is a W3C Candidate Recommendation.
(See also
<a href="/2002/mmi/2007/emma-irp/">Implementation Report Plan</a>)</li>
<li>
16-17 November, 2007:
Workshop on W3C's Multimodal Architecture and Interfaces was held at
Keio University in Japan.
The
<a href="http://www.w3.org/2007/08/mmi-arch/cfp.html">Call for Participation</a>,
the
<a href="http://www.w3.org/2007/08/mmi-arch/agenda.html">Agenda</a>,
the
<a href="http://www.w3.org/2007/08/mmi-arch/minutes.html">Minutes</a>,
the
<a href="http://www.w3.org/2007/08/mmi-arch/summary.html">Summary</a>
are available.
The
<a href="http://www.w3.org/2007/08/mmi-arch/topics.html">Issue List</a>
raised during the workshop is also available.
</li>
<li>
28 August 2007:
<a href="http://www.w3.org/2007/08/mmi-arch/cfp.html">
The Workshop on W3C's Multimodal Architecture and Interfaces
</a>
will be held at Keio University in Fujisawa, Japan on
16-17 November 2007.
Position papers are due 5 October.
</li>
<li>
11 May 2007:
Jerry Carter (Nuance), Rafah Hosn (IBM) and Kaz Ashimura (W3C)
gave talks on
<a href="http://www.w3.org/2007/05/w3c-track.html#mmi">
"Multimodal Web to Expand Universal Access"
</a>
at W3C Track in WWW2007 Conference, Banff, Canada.
</li>
<li>
9 April 2007:
<a href="/TR/2007/WD-emma-20070409/">Second Last Call
Working Draft for
EMMA: Extensible MultiModal Annotation markup language
</a></li>
<li>
30 March 2007:
<a href="http://www.w3.org/News/2007#item59">
The Multimodal Interaction Working Group
is relaunched</a>.
The updated
<a href="/2006/12/mmi-charter.html">charter</a> is available
on the W3C Web server.</li>
<li><a href="/TR/2006/WD-mmi-arch-20061211/">Third Working
Draft for the Multimodal Architecture and Interfaces</a>,
11 Decembrer 2006</li>
<li>
<a href="/2006/Talks/1024-srig-inkml/InkML-IWFHR06.pdf">
InkML slides presented at IWFHR 2006 are available</a>, 24 October
2006</li>
<li><a href="/TR/2006/WD-InkML-20061023/">Last Call Working Draft for
the Ink Markup Language (InkML)
</a>, 23 October 2006 (Comments invited by 18 December 2006)</li>
<li><a href="/TR/2006/NOTE-mmi-suggestions-20060911/">Common
Sense Suggestions for Developing Multimodal User Interfaces</a>,
11 September 2006</li>
<li><a href="/TR/2006/NOTE-mmi-dev-feedback-20060414/">Working
Group Note on Multimodal Application Developer Feedback</a>,
14 April 2006</li>
<li><a href="/TR/2006/WD-mmi-arch-20060414/">Second Working
Draft for the Multimodal Architecture and Interfaces</a>,
14 April 2006</li>
<li><a href="/TR/emma/">Last Call Working Draft for the Extensible
Multimodal Annotation Markup Language specification</a> - 16 September
2005 (Comments invited by 28 October 2005)</li>
<li><a href="/2005/03/MWeb-seminar.html">W3C Seminar on Multimodal
Web Applications for Embedded Systems</a> - 21 June 2005</li>
<li><a href="/TR/2004/WD-DPF-20041122/">Last Call Working Draft
for the Dynamic Properties Framework specification</a>
- 11 November 2004 (formerly called the System and Environment
framework)</li>
<li><a href="/TR/2004/WD-InkML-20040928/">InkML specification</a>
- 28 September 2004</li>
<li><a href="http://www.w3.org/2004/02/mmi-workshop-cfp.html">W3C
Workshop on Multimodal Interaction</a> - 19-20 July 2004,
Sophia Antipolis, France. (<a href="/2004/02/mmi-workshop/Agenda">schedule</a>, <a href="/2004/02/mmi-workshop/papers">papers</a>.)</li>
<li><a href="/TR/modality-interface/">Modality Component Interface
Requirements and Capabilities</a> - 10 May 2004</li>
<li><a href="/TR/mmi-framework/">Multimodal Interaction Framework</a>
- 6 May 2003</li>
<li><a href="/TR/inkreqs/">Ink Requirements</a>
- 22 January 2003</li>
<li><a href="/TR/EMMAreqs/">EMMA Requirements</a>
- 13 January 2003</li>
<li><a href="/TR/mmi-reqs/">Multimodal Interaction Requirements</a>
- 8 January 2003</li>
<li><a href="/TR/mmi-use-cases/">Multimodal Interaction Use Cases</a>
- 4 December 2002</li>
-->
</ul>
</blockquote>
<h2 id="intro">Introduction</h2>
<h3>The Mission</h3>
<!--
<p>The Multimodal Interaction Activity is focused on developing
open standards that enable the following vision:</p>
<ul>
<li>Extending the Web to allow multiple modes of interaction:
<ul class="sub-bullet">
<li>GUI, Speech, Vision, Pen, Gestures, Haptic interfaces, ...</li>
</ul>
</li>
<li>Anyone, Anywhere, Any device, Any time
<ul class="sub-bullet">
<li>Accessible through the user's preferred modes of interaction
with services that adapt to the device, user and environmental
conditions</li>
</ul>
</li>
</ul>
<p>Users will be able to provide input via speech, handwriting,
or keystrokes, with output presented via displays, pre-recorded
and synthetic speech, audio, and tactile mechanisms such as
mobile phone vibrators and Braille strips. Application developers
will be able to provide an effective user interface for whichever
modes the user selects.</p>
<p>This vision won't be realized by one working group in isolation,
and the plan is to work in close cooperation with a range of other
W3C working groups and external standards organizations.</p>
-->
<p>The Multimodal Interaction Activity seeks to extend the Web to
allow users to dynamically select the most appropriate mode of
interaction for their current needs, including any disabilities,
whilst enabling developers to provide an effective user interface
for whichever modes the user selects. Depending upon the device,
users will be able to provide input via speech, handwriting, and
keystrokes, with output presented via displays, pre-recorded and
synthetic speech, audio, and tactile mechanisms such as mobile
phone vibrators and Braille strips.</p>
<p>Multimodal interaction offers significant ease of use benefits
over uni-modal interaction, for instance, when hands-free operation
is needed, for mobile devices with limited keypads, and for controlling
other devices when a traditional desktop computer is unvailable to
host the application user interface. This is being driven by advances
in embedded and network-based speech processing that are creating
opportunities for integrated multimodal Web browsers and for solutions
that separate the handling of visual and aural modalities, for example,
by coupling a local XHTML user agent with a remote VoiceXML user agent.</p>
<h3 id="audience">Target Audience</h3>
<p>The <a href="Group/">Multimodal Interaction Working Group</a>
(<a href="http://cgi.w3.org/MemberAccess/AccessRequest">member only
link</a>) should be of interest to a range of organizations in
different industry sectors:</p>
<dl>
<dt>Mobile</dt>
<dd>Multimodal applications are of particular interest for mobile
devices. Speech offers a welcome means to interact with smaller
devices, allowing one-handed and hands-free operation. Pen input
enables handwriting, gestures, drawings and specialized notations.
Users benefit from being able to choose which modalities they find
convenient in any situation. The Working Group should be of interest
to companies developing smart phones and personal digital assistants
or who are interested in providing tools and technology to support
the delivery of multimodal services to such devices.</dd>
<dt>Automotive Telematics</dt>
<dd>With the emergence of dashboard integrated high resolution color
displays for navigation, communication and entertainment services,
W3C's work on open standards for multimodal interaction should be
of interest to companies working on developing the next generation
of in-car systems.</dd>
<dt>Multimodal interfaces in the office</dt>
<dd>Multimodal has benefits for desktops, wall mounted interactive
displays, multi-function copiers, and other office equipment,
offering a richer user experience and the chance to use speech
and pens as alternatives to the mouse and keyboard. W3C's
standardization work in this area should be of interest to
companies developing client software and application authoring
technologies, and who wish to ensure that the resulting standards
live up to their needs.</dd>
<dt>Multimodal interfaces in the home</dt>
<dd>In addition to desktop access to the Web, multimodal interfaces
are expected to add value to remote control of home entertainment
systems, as well as finding a role for other systems around the home.
Companies involved in developing embedded systems and consumer
electronics should be interested in W3C's work on multimodal
interaction.</dd>
</dl>
<h2 id="status">Current Situation</h2>
<p>The Multimodal Interaction Working Group was launched in 2002
following a joint workshop between the W3C and the WAP Forum.
<!--Relevant W3C Member contributions have been received on SALT and X+V.-->
The Working Group's initial focus was on <a
href="/TR/mmi-use-cases/">use cases</a> and <a
href="/TR/mmi-reqs/">requirements</a>. This led to the publication
of the <a href="/TR/mmi-framework/">W3C Multimodal Interaction
Framework</a>, and in turn to work on <a href="/TR/emma/">extensible
multi-modal annotations</a> (EMMA), and <a href="/TR/InkML/">InkML</a>,
an XML language for ink traces. The Working Group has also worked
on integration of composite multimodal input; dynamic adaptation
to device configurations, user preferences and environmental
conditions (now transferred to the <a href="/2001/di/">Device
Independence Activity</a>); modality component interfaces; and a
study of current approaches to interaction management. The Working
Group has now been re-chartered through 31 January 2009 under the
terms of the <a href="/Consortium/Patent-Policy-20040205/">W3C
Patent Policy</a> (5 February 2004 Version). To promote the widest
adoption of Web standards, W3C seeks to issue Recommendations that
can be implemented, according to this policy, on a Royalty-Free
basis. The Working Group is chaired by <a
href="mailto:dahl@conversational-technologies.com">Deborah Dahl</a>.
The W3C Team Contact is <a href="mailto:ashimura@w3.org">Kazuyuki
Ashimura</a>.</p>
<h3>We want to hear from you!</h3>
<p>We are very interested in your comments and suggestions.
If you have implemented multimodal interfaces, please share
your experiences with us, as we are particularly interested
in reports on implementations and their usability for both
end-users and application developers. We welcome comments on any
of our published documents. If you have a proposal for
multimodal authoring language, please let us know. To <a
href="mailto:www-multimodal-request@w3.org?subject=subscribe">subscribe</a>
to the discussion list send an email to www-multimodal-request@w3.org
with the word subscribe in the subject header. Previous discussion
can be found in the public <a href=
"http://lists.w3.org/Archives/Public/www-multimodal/">archive</a>.
To <a
href="mailto:www-multimodal-request@w3.org?subject=unsubscribe">unsubscribe</a>
send an email to www-multimodal-request@w3.org with the word
unsubscribe in the subject header.</p>
<h3 id="join">How to join the Working Group</h3>
<p>If your organization is already a <a
href="/Consortium/Member/List">member of W3C</a>, ask your <a
href="/Member/ACList">W3C Advisory Comittee Representative</a> (<a
href="http://cgi.w3.org/MemberAccess/AccessRequest">member only
link</a>) to fill out the online registration form to confirm
that your organization is prepared to commit the time and expense
involved in particpating in the group. You will be expected to
attend all Working Group meetings (about 3 or 4 times a year) and
to respond in a timely fashion to email requests. Further details
about joining are available on the <a
href="./Group/Overview.html">Working Group</a> (<a
href="http://cgi.w3.org/MemberAccess/AccessRequest">member only
link</a>) page. Requirements for patent disclosures, as well as terms
and conditions for licensing essential IPR are given in the W3C
<a href="/Consortium/Patent-Policy-20040205/">Patent Policy</a>.</p>
<p>More information <a href="http://www.w3.org/Consortium/">about the W3C</a>
is available, as is information about <a
href="http://www.w3.org/Consortium/Prospectus/Joining">joining W3C</a>.</p>
<h3>Patent Disclosures</h3>
<p>W3C maintains a <a rel="disclosure"
href="http://www.w3.org/2004/01/pp-impl/34607/status">public list of
any patent disclosures</a> made in connection with the deliverables of
the group; that page also includes instructions for disclosing a
patent.</p>
<!--
The following organizations are currently participating in the <a
href="/2002/mmi/Group/">Working Group</a>:</p>
<blockquote>Access, Alcatel, Apple, Aspect, AT&amp;T, Avaya, BeVocal,
Brooktrout, Canon, Cisco, Comverse, EDS, Ericsson, France Telecom,
Fraunhofer Institute, HP, IBM, INRIA, Intel, IWA/HWG, KAIT, Kirusa,
Loquendo, Microsoft, Mitsubishi Electric, NEC, Nokia, Nortel Networks,
Nuance Communications, OnMobile Systems, Openstream, Opera Software,
Oracle, Panasonic, ScanSoft, Siemens, Sun Microsystems, Telera, Tellme
Networks, T-Online International, Toyohashi University of Technology,
V-Enable, Vocalocity, VoiceGenie Technologies, Voxeo</blockquote>
-->
<h2 id="plan">Revised publication target dates</h2>
<table class="roadmap" width="80%">
<tbody>
<tr>
<th rowspan="1" colspan="1">Specification</th>
<th rowspan="1" colspan="1"><acronym title="First Working Draft">FPWD</acronym></th>
<th rowspan="1" colspan="1"><acronym title="Last Call Working Draft">LC</acronym></th>
<th rowspan="1" colspan="1"><acronym title="Canditate Recommendation">CR</acronym></th>
<th rowspan="1" colspan="1"><acronym title="Proposed Recommendation">PR</acronym></th>
<th rowspan="1" colspan="1"><acronym title="Recommendation">Rec</acronym></th>
</tr>
<tr>
<th rowspan="1" colspan="1">
Multimodal Architecture and Interfaces
</th>
<td class="WD1" rowspan="1" colspan="1">
- <a href="/TR/2005/WD-mmi-arch-20050422/">Completed</a><br/>
- <a href="/TR/2006/WD-mmi-arch-20060414/">2nd WD</a><br/>
- <a href="/TR/2006/WD-mmi-arch-20061211/">3rd WD</a><br/>
- <a href="/TR/2008/WD-mmi-arch-20080414/">4th WD</a><br/>
- <a href="/TR/2008/WD-mmi-arch-20081016/">5th WD</a><br/>
- <a href="/TR/2009/WD-mmi-arch-20091201/">6th WD</a><br/>
- <a href="/TR/2010/WD-mmi-arch-20100921/">7th WD</a>
</td>
<td class="LC" rowspan="1" colspan="1">
<a href="/TR/2011/WD-mmi-arch-20110125/">Completed</a>
</td>
<td class="CR" rowspan="1" colspan="1">TBD</td>
<td class="PR" rowspan="1" colspan="1">TBD</td>
<td class="REC" rowspan="1" colspan="1">TBD</td>
</tr>
<tr>
<th rowspan="1" colspan="1">EMMA 2.0</th>
<td class="WD1" rowspan="1" colspan="1">4Q 2009</td>
<td class="LC" rowspan="1" colspan="1">January 2011</td>
<td class="CR" rowspan="1" colspan="1">TBD</td>
<td class="PR" rowspan="1" colspan="1">TBD</td>
<td class="REC" rowspan="1" colspan="1">TBD</td>
</tr>
<tr>
<th rowspan="1" colspan="1">EMMA</th>
<td class="WD1" rowspan="1" colspan="1">Completed</td>
<td class="LC" rowspan="1" colspan="1">Completed</td>
<td class="CR" rowspan="1" colspan="1">Completed</td>
<td class="PR" rowspan="1" colspan="1">Completed</td>
<td class="REC" rowspan="1" colspan="1">Completed<br/>(10 Feb. 2009)</td>
</tr>
<tr>
<th rowspan="1" colspan="1">InkML</th>
<td class="WD1" rowspan="1" colspan="1">Completed</td>
<td class="LC" rowspan="1" colspan="1">
<a href="http://www.w3.org/TR/2006/WD-InkML-20061023">
1st LC: Completed
</a>
<br/><br/>
<a href="http://www.w3.org/TR/2010/WD-InkML-20100527/">
2nd LC: Completed
</a>
</td>
<td class="CR" rowspan="1" colspan="1">
<a href="http://www.w3.org/TR/2011/CR-InkML-20110111/">
Completed
</a>
</td>
<td class="PR" rowspan="1" colspan="1">April 2010</td>
<td class="REC" rowspan="1" colspan="1">June 2010</td>
</tr>
<tr>
<th rowspan="1" colspan="1">EmotionML</th>
<td class="WD1" rowspan="1" colspan="1">
<a href="http://www.w3.org/TR/2009/WD-emotionml-20091029/">Completed</a><br/>
- <a href="http://www.w3.org/TR/2010/WD-emotionml-20100729/">2nd</a>
</td>
<td class="LC" rowspan="1" colspan="1">
<a href="http://www.w3.org/TR/2011/WD-emotionml-20110407/">Completed</a>
</td>
<td class="CR" rowspan="1" colspan="1">June 2011</td>
<td class="PR" rowspan="1" colspan="1">TBD</td>
<td class="REC" rowspan="1" colspan="1">TBD</td>
</tr>
<tr>
<th rowspan="1" colspan="1">Ink Modality Component Definition</th>
<td class="WD1" rowspan="1" colspan="1">
<a href="http://www.w3.org/TR/2011/NOTE-mmi-mcbp-20110301/">
Completed (as a WG Notes)
</a>
</td>
<td class="LC" rowspan="1" colspan="1">-</td>
<td class="CR" rowspan="1" colspan="1">-</td>
<td class="PR" rowspan="1" colspan="1">-</td>
<td class="REC" rowspan="1" colspan="1">-</td>
</tr>
<tr>
<th rowspan="1" colspan="1">Voice Modality Component Definition</th>
<td class="WD1" rowspan="1" colspan="1">December 2009<br/>(as a WG Notes)</td>
<td class="LC" rowspan="1" colspan="1">-</td>
<td class="CR" rowspan="1" colspan="1">-</td>
<td class="PR" rowspan="1" colspan="1">-</td>
<td class="REC" rowspan="1" colspan="1">-</td>
</tr>
</tbody>
</table>
<h2 id="work">Work in Progress</h2>
<p>This is intended to give you a brief summary of each of the
major work items under development by the Multimodal Interaction
Working Group. The suite of specifications is known as the <i>W3C
Multimodal Interaction Framework</i>.</p>
<ul>
<li>Introduction,
6 May 2003. The <a href="/TR/mmi-framework/">Multimodal Interaction
Framework</a> introduces a general framework for multimodal
interaction, and the kinds of markup languages being considered.</li>
<li>Use cases,
4 December 2002. <a href="/TR/mmi-use-cases/">Multimodal Interaction
Use Cases</a> describes several use cases that are helping us to
better understand the requirements for multimodal interaction.</li>
<li>Core requirements,
8 January 2003. <a href="/TR/mmi-reqs/">Multimodal Interaction
Requirements</a> describes fundamental requirements for the
specifications under development in the W3C Multimodal Interaction
Activity.</li>
</ul>
<h3 id="current">Current Work</h3>
<p>The following indicates current work items. Additional work is
expected on topics described in <a
href="/2004/03/mmi-charter.html#scope">section 4</a> of the charter,
including multimodal authoring, modality component interfaces,
composite multimodal input, and coordinated multimodal output.</p>
<h4 id="mmi-architecture">Multimodal Architecture</h4>
<h5 id="mmi-arch">Main Architecture draft</h5>
<ul>
<li><a href="/TR/modality-interface/">Requirements and Capabilities</a>, 10 May 2004</li>
<li><a href="/TR/2005/WD-mmi-arch-20050422/">First Public Working Draft</a>, 22 April 2005</li>
<li><a href="/TR/2006/WD-mmi-arch-20060414/">Second Working Draft</a>, 14 April 2006</li>
<li><a href="/TR/2006/WD-mmi-arch-20061211/">Third Working Draft</a>, 11 December 2006</li>
<li><a href="/TR/2008/WD-mmi-arch-20080414/">Fourth Working Draft</a>, 14 April 2008</li>
<li><a href="/TR/2008/WD-mmi-arch-20081016/">Fifth Working Draft</a>, 16 October 2008</li>
<li><a href="/TR/2009/WD-mmi-arch-20091201/">Sixth Working Draft</a>, 1 December 2009</li>
<li><a href="/TR/2010/WD-mmi-arch-20100921/">Seventh Working Draft</a>, 21 September 2010</li>
<li><a href="/TR/2011/WD-mmi-arch-20110125/">Last Call Working Draft</a>, 25 January 2011</li>
</ul>
<p>A loosely coupled architecture for the <a
href="/TR/mmi-framework/">Multimodal Interaction Framework</a>
that focuses on providing a general means for components to
communicate with each other, plus basic infrastructure for
application contrl and platform services. Work is continuing
on how the architecture can be realized in terms of well defined
component interfaces and eventing models.</p>
<h5 id="mmi-auth">MMI Authoring</h5>
<ul>
<li><a href="/TR/2006/NOTE-mmi-dev-feedback-20060414/">Multimodal Application Developer Feedback Working Group Note</a>, 14 April 2006</li>
<li><a href="/TR/2006/NOTE-mmi-suggestions-20060911/">Common Sense Suggestions for Developing Multimodal User Interfaces Working Group Note</a>, 11 September 2006</li>
<li><a href="http://www.w3.org/TR/2008/NOTE-mmi-auth-20080702/">Authoring Applications for the Multimodal Architecture Working Group Note</a>, 2 July 2008</li>
</ul>
<h5 id="mmi-mcbp">MMI Authoring</h5>
<ul>
<li><a href="http://www.w3.org/TR/2011/NOTE-mmi-mcbp-20110301/">Best practices for creating MMI Modality Components Working Group Note</a>, 1 March 2011</li>
</ul>
<h4 id="emma">Extensible Multi-Modal Annotations (EMMA)</h4>
<h5>- EMMA 1.0</h5>
<ul>
<li><a href="/TR/EMMAreqs/">Requirements</a>, 13 January 2003</li>
<li><a href="/TR/emma/">Last Call Working Draft</a>, 16 September
2005</li>
<li>
<a href="/TR/2007/WD-emma-20070409/">Second Last Call
Working Draft</a>, 9 April 2007
</li>
<li>
<a href="/TR/2007/CR-emma-20071211/">
Candidate Recommendation</a>, 11 December 2007
</li>
<li>
<a href="/TR/2008/CR-emma-20081215/">
Proposed Recommendation</a>, 15 December 2008
</li>
<li>
<a href="/TR/2009/REC-emma-20090210/">
Recommendation</a>, 10 February 2009
</li>
</ul>
<p>EMMA has been developed as a data exchange format for the interface
between input processors and interaction management systems. It will
define the means for recognizers to annotate application specific data
with information such as confidence scores, time stamps, input mode
(e.g. key strokes, speech or pen), alternative recognition hypotheses,
and partial recognition results etc. EMMA is a target data format for
the
<a href="/TR/semantic-interpretation/">semantic interpretation
specification</a> being developed in the <a href="/Voice/">Voice
Browser Activity</a>, and which describes annotations to speech
grammars for extracting application specific data as a result
of speech recognition. EMMA supercedes earlier work on the
natural language semantics markup language in the Voice Browser
Activity.</p>
<h5>- EMMA 2.0</h5>
<ul>
<li>
<a href="/TR/2009/NOTE-emma-usecases-20091215/">
Use Cases for Possible Future EMMA Features
</a> Working Group Note, 15 December 2009
</li>
</ul>
<p>
Since
<a href="/TR/2009/REC-emma-20090210/">EMMA 1.0</a> became a W3C
Recommendation, a number of new possible use cases for the EMMA
language have emerged. These include the use of EMMA to represent
multimodal output, biometrics, emotion, sensor data, multi-stage
dialogs, and interactions with multiple users.
So the Working Group have decided to work on a document capturing use
cases and issues for a series of possible extensions to EMMA, and
published a Working Group Note to seek feedback on the various
different use cases.
</p>
<h4 id="ink">InkML - an XML language for digital ink traces</h4>
<ul>
<li><a href="/TR/inkreqs/">Requirements</a>, 22 January 2003</li>
<li><a href="/TR/2004/WD-InkML-20040928/">
Working Draft</a>, 28 September 2004</li>
<li><a href="http://www.w3.org/TR/2006/WD-InkML-20061023/diff.html">
Last Call Working Draft</a>, 23 October 2006</li>
<li><a href="http://www.w3.org/TR/2010/WD-InkML-20100527/">
Second Last Call Working Draft</a>, 27 May 2010</li>
</ul>
<p>This work item sets out to define an XML data exchange format
for ink entered with an electronic pen or stylus as part of a
multimodal system. This will enable the capture and server-side
processing of handwriting, gestures, drawings, and specific
notations for mathematics, music, chemistry and other fields,
as well as supporting further research on this processing. The
<a href="ink.html">Ink subgroup</a> maintains a separate public
page devoted to W3C's work on pen and stylus input.</p>
<h4 id="emotion">Emotion Markup Language (EmotionML) 1.0</h4>
<ul>
<li><a href="/TR/2009/WD-emotionml-20091029/">First Public Working Draft</a>, 29 October 2009</li>
<li><a href="/TR/2010/WD-emotionml-20100729/">Second Working Draft</a>, 29 July 2010</li>
<li><a href="/TR/2011/WD-emotionml-20110407/">Last Call Working Draft</a>, 7 April 2011</li>
</ul>
<p>
EmotionML will provide representations of emotions and related states
for technological applications.
As the web is becoming ubiquitous, interactive, and multimodal,
technology needs to deal increasingly with human factors, including
emotions.
The language is conceived as a "plug-in" language suitable for use in
three different areas: (1) manual annotation of data; (2) automatic
recognition of emotion-related states from user behavior; and (3)
generation of emotion-related system behavior.
</p>
<h2 id="refs">Related Materials</h2>
<h3 id="workshops">Workshops</h3>
<ul>
<li>
<a href="http://www.w3.org/2010/10/emotionml/cfp.html">
Workshop on Emotion Markup Language</a>
was held
on 5-6 October 2010 in Paris, France, hosted by Telecom ParisTech.
The <a href="/2010/10/emotionml/summary.html">summary</a>
and <a href="/2010/10/emotionml/minutes.html">
detailed minutes</a> are available online.</li>
<li>
<a href="/2010/02/convapps/cfp.html">
Workshop on Conversational Applications
</a>
was held on 18-19 June 2010 in Somerset, New Jersey, US hosted by Openstream.
The
<a href="/2010/02/convapps/agenda.html">Agenda</a>,
the
<a href="/2010/02/convapps/minutes.html">Minutes</a> and
the
<a href="/2010/02/convapps/summary.html">Summary</a>
are available.
</li>
<li>
<a href="http://www.w3.org/2007/08/mmi-arch/cfp.html">
Workshop on W3C's Multimodal Architecture and Interfaces
</a>
was held on 16-17 November, 2007 at Keio University in Japan.
The
<a href="http://www.w3.org/2007/08/mmi-arch/agenda.html">Agenda</a>,
the
<a href="http://www.w3.org/2007/08/mmi-arch/minutes.html">Minutes</a>,
the
<a href="http://www.w3.org/2007/08/mmi-arch/summary.html">Summary</a>
and
the
<a href="http://www.w3.org/2007/08/mmi-arch/topics.html">Issue List
raised during the workshop
</a>
are available.
</li>
</ul>
<h3 id="presentations">MMI related presentations</h3>
<ul>
<li>
Jerry Carter (Nuance), Rafah Hosn (IBM) and Kaz Ashimura (W3C)
gave talks on
<a href="http://www.w3.org/2007/05/w3c-track.html#mmi">
"Multimodal Web to Expand Universal Access"
</a>
on 11 May 2007 during W3C Track in WWW2007 Conference, Banff, Canada.
</li>
<li>
<a href="/2006/Talks/1024-srig-inkml/InkML-IWFHR06.pdf">
InkML slides</a>
were presented
on 24 October 2006 at IWFHR 2006 Conference.
</li>
<li>
<a href="/2005/03/MWeb-seminar.html">
W3C Seminar on Multimodal Web Applications for Embedded Systems
</a> was held on 21 June 2005.
</li>
<li>
<a href="http://www.w3.org/2004/02/mmi-workshop-cfp.html">
W3C Workshop on Multimodal Interaction
</a>
was held on 19-20 July 2004 in
Sophia Antipolis, France. (<a href="/2004/02/mmi-workshop/Agenda">schedule</a>, <a href="/2004/02/mmi-workshop/papers">papers</a>.)
</li>
<li>
<a href="/2004/MWeb/">IST-FP6-001895 "Multimodal Web
Interaction" (MWeb) Project</a>: A W3C initiative funded by the
European Commission in support of the development and adoption
of W3C standards that enable multimodal Web access via mobile
devices. MWeb includes European outreach and the development
of demonstrators.
</li>
<li>
<a href="http://www.openstream.com/w3c/mmi.swf">Openstream
Multimodal Interaction use case demo</a> (Macromedia Flash video).
</li>
<li>
The W3C <a href="/Voice/Group/">Voice Browser working group</a>
published a set of <a href="/TR/multimodal-reqs">requirements for
multimodal interaction</a> in July 2000. The working group also
invited participants to demonstrate proof of concept examples of
multimodal applications. A number of such demonstrations were shown
at the working group's face to face meeting held in Paris in May
2000.
</li>
<li>
To get a feeling for future work, the W3C together with the WAP
Forum held a joint <a href="/2000/09/Papers/Agenda.html">workshop
on the Multimodal Web</a> in Hong Kong on late 2000. This workshop
addressed the convergence of W3C and WAP standards, and the
emerging importance of speech recognition and synthesis for the
Mobile Web. The <a
href="/2000/09/Papers/Meeting/recommendations.html">workshop's
recommendations</a> encouraged W3C to set up a multimodal working
group to develop standards for multimodal user interfaces for the
Web.
</li>
<li>
The IETF <a href=
"http://www.ietf.org/html.charters/speechsc-charter.html">
Speech Services Control (SpeechSC)</a> working group is developing
protocols to support distributed speech recognition, speech
synthesis and speaker verification services, and expects to take
advantage of W3C's work on the speech recognition grammar
specification (<a href="/TR/speech-grammar/">SRGS</a>), the speech
synthesis markup language (<a href="/TR/speech-synthesis/">SSML</a>),
semantic interpretetation (<a href="/TR/semantic-interpretation/">SI</a>)
and extensible multimodal markup annotations (<a href=
"#emma">EMMA</a>). ETSI's <a href=
"http://portal.etsi.org/stq/kta/DSR/dsr.asp">STQ Aurora project</a>
is looking at codecs optimized for distributed speech recognition.
See also <a href="/2005/05/DSR.pdf">David Pearce's presentation on
DSR</a> to the W3C VB/MMI working groups on 25th May 2005.
</li>
<li>
ETSI standard <a href=
"http://docbox.etsi.org/EC_Files/EC_Files/es_202076v010102p.pdf">ES
202076</a> defines a generic spoken command vocabulary for
controlling common operations such as calling someone by saying their
name, browsing through a voice mail box, adjusting the volume, muting
the microphone and other device properties. ETSI provide bindings for
the vocabulary to a variety of human languages. This suggests the
possibility of device-based recognition for common spoken commands
together with network based recognition for other vocabularies.
</li>
<li>
Another idea is to couple a local graphical user interface
with a remote voice dialog engine, perhaps based upon <a href=
"/TR/voicexml20/">VoiceXML</a>. Here the idea is to allow events
to be passed between the device and the remote dialog engine. To
the application developer, these events would look just the same
whether they originated locally or remotely. In this approach,
events can be used to initiate a range of actions, for instance,
changing the focus of interaction, setting the value of a form
field, loading a new page, or altering the current page via the
<a href="/DOM/">DOM</a>. W3C work on <a href="/TR/rex/"
title="Remote Events for XML">REX</a> aims to provide an XML
grammar for DOM events with a view to supporting distribution
of events, and in principle, could be used to couple different
modality components.
</li>
<li>
SIP can also be used to synchronize several devices, for
instance to update the display on a PDA, automotive or desktop
system in concert with the much smaller display on a cellphone.
When it comes to setting up a session that potentially involves
multiple devices and servers, SIP looks like it will provide an
effective solution together with server-side scripts. The Voice
Browser working group's work on <a href="/Voice/#callctrl">call
control</a> may prove valuable.
</li>
<li>
<a href="http://docbox.etsi.org/EC_Files/EC_Files/eg_202191v010101p.pdf">ETSI
EG 202 191 - V1.1.1 - Human Factors (HF); Multimodal interaction,
communications and navigation guidelines</a> (PDF). A study of design
principles for multimodal applications with a focus on accessibility.
Published August 2003.
</li>
<li>
<a href="http://www.w3.org/2002/08/InkXML/">InkXML specification</a>
(<a href="http://cgi.w3.org/MemberAccess/AccessRequest">W3C Members
only</a>) contributed to W3C on 16th August 2002 by IBM, Intel, the
International Unipen Foundation, and Motorola, Inc. InkXML is a markup
language for the exchange of virtual ink, conveying such information
as the kind of pen, the color of the ink and the nature of the medium,
the pressure applied to the pen, its position and speed. InkXML can
be used to exchange virtual ink among devices, such as handhelds,
laptops, desktops, and servers. InkXML is intended to provide the
ink component of Web-based multimodal applications. The working group
consensus process will determine which ideas in InkXML will be taken
up within W3C. W3C Members can view the <a href="
http://lists.w3.org/Archives/Member/w3c-archive/2002Aug/0037.html">
contribution letter</a>.
</li>
<li>
<a href="2002/MM-Arch-Maes-20010820.pdf">Multimodal browser
architecture</a> (PDF) by St&eacute;phane Maes (IBM), dated 20th August
2001. Makes the case for using the model-view-controller paradigm
and presents a variety of architectures for synchronization across
modalities and devices. This is the presentation (T2-010705) that
St&eacute;phane gave to the 3GPP T2 meeting in September 2001.
</li>
<li>
<a href="2002/siemens-26nov01.pdf">Multimodal access position
paper</a> (PDF) by Nathalie Amann, Laurent Hue and Klaus Lukas
(Siemens), dated 26th November 2001. Describes a possible
architectures architecture for multimodal interaction based upon
coupling a visual client with a VoiceXML interpreter.
</li>
<li>
<a href="2002/smil-rexx.pdf">Towards SMIL as a foundation for
for multimodal, multimedia applications</a> (PDF), by Jennifer
Beckham (University of Wisconsin), Giuseppe Di Fabbrizio, and Nils
Klarlund (AT&amp;T Labs), dated 1 October 2001. Shows how SMIL can
provide fine grained synchronization control for multimodal
interaction. The approach combines SMIL with markup for control of
speech engines.
</li>
<li>
<a href="/Submission/2001/13/">XHTML+Voice</a>, W3C Submission
by IBM, Motorola and Opera Software. Dated 30th November 2001.
Shows how markup for XHTML and VoiceXML can be combined to support
multimodal interaction. An <a href=
"http://www.ibm.com/software/pervasive/multimodal/x%2Bv/11/spec.htm">
updated version</a> was contributed to the Voice Browser and
Multimodal Interaction working groups on 11th March 2003, see the
<a href="/Submission/2001/13/Comment.html">Team Comment</a> for
details of associated IPR disclosures. W3C Members can view the <a href=
"http://lists.w3.org/Archives/Member/w3c-mmi-wg/2003Mar/0044.html">
contribution letter</a>.
</li>
<li>
The <a href="http://www.saltforum.org/">SALT Forum</a> was
launched on 15th October 2001 with a mission to develop standards
for speech enabling HTML, XHTML and SMIL. More recently, it has
been applied to speech enabling <a href="/Graphics/SVG/">SVG</a>.
The <a href="http://www.saltforum.org/saltforum/downloads/SALT1.0.pdf">SALT
1.0 specfication</a> was contributed to the Multimodal Interaction
and Voice Browser working groups on 31rd July 2002, and the
working group consensus process will determine which ideas in SALT
will be taken up within W3C. W3C Members can view the <a href=
"http://lists.w3.org/Archives/Member/w3c-mmi-wg/2002Jul/0123.html">
contribution letter</a>. The SALT+SVG profile was provided as a
subsequent contribution.
</li>
<li>
<a href="http://www.3gpp.org/">3GPP</a> is studying different
ways to include speech-enabled services comprising both speech-only
and multimodal services in 3G networks. One option for distributed
speech recognition is based on the <a
href="http://portal.etsi.org/stq/kta/DSR/dsr.asp">ETSI's STQ
Aurora</a> developments. Other options are dependent on the general
study on speech enabled services. 3GPP may be interested in working
on integrating remote access to speech synthesis resources. W3C
should keep a watching brief. There is a possible connection to the
<a href="http://www.ietf.org/html.charters/speechsc-charter.html">IETF
Speech Services Control Working Group</a> (SpeechSC), which is
developing protocols for distributed access to speech synthesis,
recognition and speaker verification services (<acronym title=
"Media Resource Control Protocol">MRCP</acronym>)
</li>
</ul>
<p>For more details on other organizations see the <a
href="/2009/05/mmi-charter.html">Multimodal Interaction
Charter</a>.</p>
<h3 id="contacts">W3C Team Contact</h3>
<address>
<a href="mailto:kazuyuki@w3.org">Kazuyuki Ashimura</a> &lt;<a
href="mailto:ashimura@w3.org">ashimura@w3.org</a>&gt;
- Multimodal Interaction Activity Lead
</address>
<hr />
<div class="policyfooter"><small><a rel="Copyright"
href="/Consortium/Legal/ipr-notice#Copyright">Copyright</a> &copy;
2002-2008 <a href="/"><acronym
title="World Wide Web Consortium">W3C</acronym></a><sup>&reg;</sup>
(<a href="http://www.csail.mit.edu/"><acronym
title="Massachusetts Institute of Technology">MIT</acronym></a>, <a
href="http://www.ercim.org/"><acronym
title="European Research Consortium for Informatics and Mathematics">ERCIM</acronym></a>,
<a href="http://www.keio.ac.jp/">Keio</a>), All Rights Reserved. W3C <a
href="/Consortium/Legal/ipr-notice#Legal_Disclaimer">liability</a>, <a
href="/Consortium/Legal/ipr-notice#W3C_Trademarks">trademark</a>, <a
rel="Copyright" href="/Consortium/Legal/copyright-documents">document
use</a> and <a rel="Copyright"
href="/Consortium/Legal/copyright-software">software licensing</a>
rules apply. Your interactions with this site are in accordance with
our <a href="/Consortium/Legal/privacy-statement#Public">public</a> and
<a href="/Consortium/Legal/privacy-statement#Members">Member</a>
privacy statements. This page was last updated on
<!--===================================================================-->
$Date: 2011/10/18 10:06:03 $</small></div>
</body>
</html>