Why Information Retrieval Visualizations Sometimes Fail
|
Emile L. Morse elm2@sis.pitt.edu |
Michael Lewis ml@sis.pitt.edu |
Department of Information Science
University of Pittsburgh
Pittsburgh, PA 15260
Abstract
Full-featured visual interfaces that assist in performing the tasks of information retrieval are widely distributed. The proliferation of sources of information on the Internet and the fact that these resources are large and poorly structured led us to propose that adapting some of the tools that have already been developed for browsing and organizing information spaces might be worthwhile.
In our experimental environment we can only test and evaluate the use of visual information retrieval interfaces (VIRls) by naive or briefly trained subjects. For users of this type, full-featured VIRIs have led to poorer performance and satisfaction ratings than simple textual retrieval interfaces. Our experiments with de-featured VIRls are significant because:
Introduction
Researchers in information retrieval (IR) have long searched for ways to make their system more accessible to end users and to develop new ways for users to explore data. Visualization techniques (computer methods for displaying large quantities of information graphically) appear promising as a means for achieving both goals. Information visualization can make multidimensional relationships that are difficult to extract from tabular data apparent to a trained searcher. However, unlike scientific visualizations which are largely developed and used within elite specialties, IR visualizations are targeted toward guiding the public through newly accessible oceans of online information as well. Work on IR visualization systems is at a relatively early stage. Systems such as Bead [1, 2], InfoCrystal [12, 13], and LyberWorld [3], have recently been developed as visual information exploration tools to aid in retrieval tasks. Researchers at the University of Pittsburgh have contributed to the development of information visualization systems with VIBE [11], GUIDO [6, 10], and BIRD [4, 5].
Related Research
Previous user study research into IR visualization systems has largely focused on formative usability evaluations to provide feedback for system enhancement. Newby [9] tested subjects with the SPACE IR visualization system and standard text-based system called Prism. His findings showed that subjects preferred the Prism system for many of the tasks performed. Spoerri [12, 13] tested InfoCrystal’s interface with a standard Boolean interface. Ten users performed query recognition tasks and query generation tasks. Results show that new users could use the InfoCrystal interface after receiving a short tutorial, InfoCrystal performed as well as the more traditional Boolean interface, and the user feedback was positive.
Koshman [7, 8] conducted extensive comparisons between VIBE and AskSam, a commercially available text-based retrieval interface, using expert online searchers recruited from local libraries and novice searchers. Results showed that performance differences between AskSam and VIBE were minimal. Novices and experts performed the test tasks with similar speed and accuracy. There was some evidence that particular tasks were better suited to performance in the graphical environment while others were more appropriate for the text-based tool.
Results of preference measures were disturbing in that users in both groups rated the textual interface slightly higher while we had expected a preference for the graphical interface whatever its effect on performance. We believe that the expanded capabilities offered by visual browsing are masked in comparisons of this sort which limit tasks to those which can be performed using text-based interfaces. For tasks expressible as Boolean queries a text-based interface may be both more direct and less complicated and less difficult to learn than a full featured VIRI.
Kim [4, 5] examined BIRD usability in three experiments. The first assessed correlations between performance with BIRD and reasoning skill test scores. The second experiment examined users' performance in remembering how to use the system. The third experiment compared subjects' performance in constructing Boolean queries using BIRD and an informationally equivalent text-based interface for contextually constrained (shapes) and unconstrained (vitamins) data sets. Subjects displayed unexpected difficulties in using the VIRI and reported preferences favoring the text-based interface. In reviewing these results and details of the interface design Kim [4, 5] identified a variety of features that might be related to the difficulties. These include the complexity of sequential query construction as compared to nonsequential specification, the irreversibility of some operations, and lack of integration between the representation of term presence through color bars and the representation and manipulation of sets through use of the bins and the separator.
The Interfaces
Bird and J. BIRD – query formulation tools
The original version of this interface is called BIRD (Browsing Interface for Retrieval of Documents) [4, 5]. Figure 1 shows a typical BIRD display. Its primary function is to support query formation. The Boolean operators are notoriously difficult to use correctly, even for users who are mathematically trained. The basic idea behind the BIRD interface is to represent documents as sets; the AND operator is effected by a device that takes a funnel shape. The OR and NOT operators are implied from the layout of a 2 x 2 output matrix.
The J. Bird interface is shown in Figure 2. Simplification was achieved by representing documents as sheets of paper, the funnel was replaced by a sorter and the matrix by 2 bins indicating 'matches' and 'does not match.’ To use the device a user selects a query term, then drags the stack of unsorted papers to the sorter. The results are displayed as proportional sized stacks in the output bins.
VIBE and WebVIBE -- inferencing tools
Korfhage and Olsen [6, 11] developed VIBE (Visual Information Browsing Environment) at the University of Pittsburgh. VIBE (Figure 3) represents points of interest (POIs) or query terms as nodes. Documents are represented by various sized rectangles suspended within the surface bounded by the POls. The internal representation of a document is a term vector, which is usually weighted, and the interface position of the document is calculated as the vector sum of the components indicated by the POls. Seven special-purpose modes are hidden in the menu system.
WebVIBE (Figure 4) retains much of the appearance of the original VIBE but the metaphor has been changed to a naive physics view of magnetism. The user is called upon to view the POIs as magnets and the documents are suspended in a magnetic force field. The position of a particular document can be thought of as being determined by the relative 'pull' of terms that are present in the document.
Methods
Both J. Bird and WebVIBE were developed using an iterative design approach and were tested after each of two design phases. Task sets were developed for each interface. Subjects were recruited from the University community and were paid for participating in the study. Participants were given a printed task list that included spaces for answers. No verbal or written instructions were given. Subjects were encouraged to talk aloud during the experiment and these verbal protocols were monitored by one of the investigators. Each subject was also given a post-test questionnaire. This survey was constructed to determine which features were problematic and to gather general impressions about system usability and attractiveness as a search tool.
Results
J. Bird
The migration from BIRD to J. Bird entailed a total reorganization of the interface including a remapping of the machine sort metaphor. The first test of the interface brought up several classes of perceived shortcomings. These issues were addressed in the subsequent redesign. The second test showed a substantially improved system as indicated by the post-test questionnaire. The most interesting outcome of the testing, however, was the results of the actual task performance. Although 8 of the 10 test subjects indicated that they believed that they understood how the Boolean operators worked, only 1 of the 10 could successfully apply them to two problems given in the post-test. These subjects, however, performed Boolean operations with the J. Bird interface with over 90% correct answers!
WebVIBE
The process of simplifying VIBE to produce the WebVIBE interface was distinctly different from that used for BIRD. The difference lies in using magnetism as a metaphor bolstered by portraying POIs with magnet icons. Behavior of the interface was changed to enhance this metaphor by animating influenced documents to move with the movement of a POI rather than postponing movement until the mouse button was released. The special-purpose modes and features were eliminated from the interface in order to make it more accessible and less confusing to naive walk-up users. Subjects found document animation to be very useful in performing tasks that required them to discriminate between documents that were positioned in regions that were potentially ambiguous.
Discussion
Overall, task performance was similar on both trials for the pair of tests. The differences between the first and second round of testing were highly salient for the post-test questionnaires. For both interfaces there were decreases in negative comments and increases in positive ones at the second test. Considering that there were no instructions for using the interface apart from those that were embedded in the view, the fact that novice searchers could accurately form queries using J. Bird and could correctly form desired inferences using WebVIBE attests to the utility of these interfaces for their intended audience.
References
Figure 1. Fully-featured BIRD interface showing a term list box, result matrix, separating funnel and partial set storage bins. This is the initial presentation; no document set has yet been opened.
Figure 2. J. Bird Web Application showing a set of documents waiting to be sorted into documents that match a query and those that do not. The ‘criteria’ box has popped up but the user has not yet selected any terms.
Figure 3. Fully-featured VIBE display showing the results of a query on five terms. Several document icons have been lassoed and the titles of those documents are shown in the popup dialog box. The most noticeable features of the application are the prominent multi-item menu and tool bars.
Figure 4. WebVIBE Application. The display shows the results of a five term query. Two POI icons are highlighted in response to the sprite’s moving over a document that had these terms as relevant attributes. Notice the simplified menu and tool bars when compared with Figure 3.