<%@LANGUAGE="JAVASCRIPT" CODEPAGE="65001"%> NIST Speech Group Website
Information Technology Lab, Information Access Division NIST: National Institute of Standards and Technology


  • Multimodal Information Group Home
  • Benchmark Tests
  • Tools
  • Test Beds
  • Publications
  • Links
  • Contacts
  • Topic Detection Task

    The topic detection task is an experimental abstraction of a story clustering TDT system. The goal of a detection system is to group together stories that discuss the same event. In the graphic at the bottom, the red circles represent stories discuss one event, and the green diamonds are stories that discuss another event.

    While considerable clustering research has been based on global clustering, (i.e. clustering stories over an entire data set), TDT clustering is done incrementally which means that for a given source file of stories, you can look ahead only by a specified number of 'days' before making a final decision.

    Incremental clustering can be broken down into two phases: detecting when a new event is seen and putting stories that discuss previously seen events into appropriate clusters. Since the first phase is a difficult and interesting challenge, the First Story Detection evaluation task was instantiated to evaluate how well systems can detect the first story of an event.

    To learn more about technology used to do TDT topic detection, or to learn how well these systems perform, there were several system description papers submitted for the 1999 TDT workshop. Also, the TDT Research Links contains pointers to related research.

     

     

    Page Created: August 21, 2007
    Last Updated: November 4, 2008

    Multimodal Information Group is part of IAD and ITL
    NIST is an agency of the U.S. Department of Commerce
    Privacy Policy | Security Notices|
    Accessibility Statement | Disclaimer | FOIA