DAISY Consortium Logo - Link to Home Page

Z39.86 Advisory Committee Face-to-face Meeting Summary: Copenhagen February 2007

Author: George Kerscher
Last revised: 14 March 2007
Status: draft 3

Introduction

 

The Advisory Committee for the DAISY/NISO Standard, formally the ANSI/NISO Z39.86 Specifications for the Digital Talking Book, met in Copenhagen February 7-9 2007. This was the first face-to-face meeting since the Spring of 2004. The committee has been meeting monthly to address primarily maintenance issues. This face-to-face meeting was called to analyze the current state of the standard, it's relation to other standards and technological developments, and to consider if the standard required revision.

Thank You

We would like to gratefully thank the organizations who made these experts available to attend this meeting. We would also like to thank DBB for providing their facilities for the meeting; Anita Schoenbaum, from DBB, was instrumental in making the various arrangements. Finally we would like to thank the experts for all of their hard work. Without this contribution, the standard could not survive.

Attending

Expert's Observations

The Zed committee, as it has become known, took a high level approach to the analysis that needed to be conducted. All aspects of the standard, its strengths and weaknesses, the user experience, including the cost of playback systems, and production of audio and text content were considered. It was an open and honest series of discussions and debates, in a professional and constructive environment.

Adoption

DAISY 2.02, with audio plus the navigation center has enjoyed huge success that surpasses any assistive technology used in the blindness field. It is estimated that 100,000 screen readers are in use worldwide. There are at least double or triple that number of DAISY hardware players that have been sold. Include the software players, and the number of people who are using MP3 players (inferior, but still usable) to read DAISY content, it is clear that we have a widely adopted standard that is entrenched in the blindness community and in libraries serving persons with disabilities.

From a technical perspective, The DAISY/NISO 2005 Standard is clearly superior but there has been a lack of adoption. There is a lack of authoring tools and support in reading systems. Also, the benefits would need to be extremely significant for organizations to bear the costs of upgrading. Upgrading includes not only the player upgrades, but the support to the end users, the content upgrades, and the internal systems needed in the transition process.

With the majority of content in the DAISY world being audio plus navigation, the benefits of upgrading to the DAISY/NISO Standard are just not there. When an organization starts to focus on text, then the benefits of the XML in the Zed standard are extremely clear. It has been noted that organizations, like DBB, will author exclusively in Zed, but then downgrade to DAISY 2.02 for distribution, because of the support in players; they keep their collection in the Zed standard. Organizations who have a text focus use the Zed standard at the center of their production process.

Player manufacturers recognize that the DAISY Standards, even 2.02 are complex. This requires more hardware resources to be available. It also requires more software development. Simple MP3 players have dedicated chips for MP3 playback and do not require much additional hardware resources for playing music or the consecutive playback of pages or chapters; their reading experience is also not as good. So there is a trade off between the cost of players and their capabilities. It was observed that DAISY like capabilities could be provided by a play list approach, but there are many of these specifications and no clear leader. Also, most of these are geared towards music, and not the reading of publications.

Persons with dyslexia and other reading issues are not well served by the current implementations. Images for example can be supported inline, but not fixed in a SMIL region; the Zed specification does not define layout regions, which the larger SMIL spec does. Persons who can see want a good visual experience. Some have attempted to do this, but under the current spec it is unclear on how best to do this.

A Specification that Supports Profiles and Modularization

Consider three categories of content: one that is primarily audio, one that is text, and one that is audio synchronized with text and images. Let's call these content profiles. If we took the existing Zed specification and broke it into modules, these three types of content profiles could be created from the modules.

An audio profile could be created that provides the same functionality that we have in the rich Zed specification. However, in addition to the pure Zed approach that uses SMIL, a series of fallbacks could be defined that enables a player or reading system to pick the one they understand. A fallback could be defined that does not require SMIL processing, but still enables hierarchical and page navigation. Other fallbacks could include the existing competing specifications for reading audio boo markup-like content. All of the formats could exist on a single media, like CD-ROM, or it could be downloaded for online distribution. All the fallback files can be generated by the rich Zed formatted content. Online delivery would also allow content negotiation so that only the files needed for the end users' player or reading system would be delivered.

A fly weight or feather weight player could be used to play this content. It would not necessarily use the full richness of the Zed formatted content, but it would still work effectively. A more powerful player would select the highest level specification to provide a full featured reading experience. This also addresses the difficult issue of extremely complex books. If a player does not have enough resources to process something like a TV listings or a train schedule, it could use the fallback mechanisms and not fail or take too long to respond. Of course, this approach is designed to have players and reading systems with different price points and competing feature sets.

A text profile could also be defined that does not require the use of SMIL. The International Digital Publishing Forum (IDPF) is already building a spec that is modular and is looking to adopt the NCX and dtbook (the text for Zed content). It would be much easier for everybody involved if the various components of the existing Zed specification were modularized and abstracted for more general use.

The SMIL working group, operating under the W3C, has been developing a DAISY multimedia profile that we can use. This would support the development of a full text and audio profile.

In addition, the modularization of the specification would allow us to resolve problems that we have today; we could:

Two Parallel Development Tracks

A lengthy discussion developed that came to the conclusion that there are essentially two development tracks:

  1. Modularization of existing spec to support profiles. Includes full profile (equivalent to Z39.86-2005), audio/featherweight profile, text-only profile, with support for fallbacks and scalability.
  2. Gathering of requirements for next version of Zed spec. Focus on wider range of disability groups and publication types.

NISO has undergone a complete administration change, including a new president and administrative staff. George has had a conference call with the new president Todd Carpenter and primary contact Karen Wetzel. They have our letter and their Board decision making the DAISY Consortium the maintenance agency for the Z39.86 Standard, but they would like to work with us to revise the agreement and bring the operations of the committee into alignment with their new policies and procedures. Regarding the development of the MathML extension, they expressed concern that this was not publicized to NISO membership. They want it to be open, transparent, and moved into the mainstream. All of these criteria provide excellent opportunities for advancing the standard. It also is a perfect opportunity to start the next spec development under the new administration. A lengthy discussion followed which is reflected in the action items.

Time Line

  1. Delivery of this document to the DAISY Board: March 2007
  2. Consideration, direction, approval by DAISY Board: April 31, 2007
  3. NISO Committee Reconstitution: due July 2007
  4. Announcement of Specification Development : due June 2007
  5. Requirements Gathering Launch: June 2007
  6. Architecture of Modular Specification published: October 2007 (Techshare)
  7. Alpha of New Standard: March 2008
  8. Beta of New Standard (requirements close): December 2008
  9. New Standard launched simultaneously with production tools, playback and reading system support, validation tools, sample content, guidelines, upgrade utilities, and training materials: October 2009

Action Items

Maintenance activities
A sub-committee will meet to address submitted issues. Proposed resolutions will be provided to the full committee by email. A time limit will be specified for comment and if an issue warrants consideration by the full committee, it will be placed on the agenda. Members include: Ole, JP, Linus, GK, Lynn Leith, Kathy or Laurie.
 
NISO Formalization
  1. Contact NISO and set up a time line for resolving administrative issues. George, Peter, Lynn, and Kathy to work on this.
  2. A letter of understanding between NISO and DAISY as the maintenance agency is needed.
  3. A document which describes the committee formalities is also needed. It was suggested that the number of voting members could be between 9 and 15 with no distinction between members and invited experts. Any vacancies will be announced to NISO and the committee will consider applicants. A limit of two representatives from any company or organization with two year terms was suggested.
  4. Committee members would be: James Pritchett, Markus Gylling, Rob Longstaff, George Kerscher, Thomas Kjellberg Christensen, Ole Holst Andersen, Linus Ericson, Marisa DeMeglio, Boris Goldowsky, Neil and Lloyd from NLS. These people need to visit with their bosses to confirm continued participation.
  5. Revision of the spec, a possible new name, processes for fast track, alpha and beta releases, and calls for participation need to be discussed with NISO.
 
Codec Research
We need to review what codecs we should include in the future specification. It is important that we keep the number of codecs to the minimum and only include realistic options that will be used. MP3 is a given one, although MP3 files might be too large in some cases. AMR was put forward as a suggestion. It should give the same quality as MP3 using half the bit rate. Before we decide what codecs to include we need to evaluate e.g. Speex. FYI, the DVD "consortium" has decided to only support 4 codecs: PCM, DTS, MPEG L2, Dolby AC3.
Dominic Labbé will lead the group doing codec research. RNIB will provide resources.
Submit an issue into the issues tracking database about AAC. We should provide information that we do not recommend the implementation of this codec.
 
XML Text Production Support
DAISY needs to make more effort to show best-practices on how to create DTBook from other formats. More training sessions are needed.