RT-02 Annotation Data Samples
The goal of the RT-02 evaluation is to automatically build "rich transcripts" which means that recognition systems must generate both word sequences and higher levels of annotation.
There are three sets of data, one for each RT domain, Switchboard, Broadcast News and Meeting Room. For each domain there are three, nominally 100 second samples. As new annotation types are defined by NIST for the RT evaluation, the transcripts for each sample will be updated. Currently, speaker change information is the only annotation type.
Page Created: September 18, 2007
is part of
NIST is an agency of the U.S. Department of Commerce
Accessibility Statement | Disclaimer | FOIA