Questions and Answers concerning the Information Retrieval Task.

Q: What are the file formats used for the SDR99 Information Retrieval Task ?

A: All SDR retrieval must be submitted to NIST in the standard TREC ret format as follows :

The ret format is a space delimited ASCII table :

Here is a sample of a ret file:
23 Q0 19980104_1130_1200_CNN_HDL.0034 1 4238 ibm-cr-att-s1
23 Q0 19980105_1800_1830_ABC_WNT.0143 2 4223 ibm-cr-att-s1
23 Q0 19980105_1130_1200_CNN_HDL.1120 3 4207 ibm-cr-att-s1
23 Q0 19980515_1630_1700_CNN_HDL.0749 4 4194 ibm-cr-att-s1
23 Q0 19980303_1600_1700_VOA_WRP.0061 5 4189 ibm-cr-att-s1
...

For the Unknown Story Boundary condition, the ret file must be generated by UIDmatch.pl. The input format for UIDmatch.pl is basically the same as ret. Only the StoryID field is replaced by a timeID TimeId = <episodeID>:<Time-in-seconds.hundredths>
Example : 19980104_1130_1200_CNN_HDL:34.14

Back to top.

Q: What are the tools available ?

A: Different tools are available, as follows:

Back to top.

Q: How can I score my results ?

Back to top.