ABSTRACT

Many existing information retrieval (IR) systems are surprisingly ineffective at finding documents relevant to particular topics. Traditional systems are extremely brittle, failing to retrieve relevant documents unless the user's exact search string is found. They support only the most primitive trial-and-error interaction with their users and are also static. Even systems with so-called "relevance feedback" are incapable of learning from experience with users. SCALIR (a Symbolic and Connectionist Approach to Legal Information Retrieval) -- a system for assisting research on copyright law -- has been designed to address these problems. By using a hybrid of symbolic and connectionist artificial intelligence techniques, SCALIR develops a conceptual representation of document relationships without explicit knowledge engineering. SCALIR's direct manipulation interface encourages users to browse through the space of documents. It then uses these browsing patterns to improve its performance by modifying its representation, resulting in a communal repository of expertise for all of its users.

SCALIR's representational scheme also mirrors the hybrid nature of the Anglo-American legal system. While certain legal concepts are precise and rule-like, others -- which legal scholars call "open-textured" -- are subject to interpretation. The meaning of legal text is established through the parallel and distributed precedence-based judicial appeal system. SCALIR represents documents and terms as nodes in a network, capturing the duality of the legal system by using symbolic (semantic network) and connectionist links. The former correspond to a priori knowledge such as the fact that one case overturned another on appeal. The latter correspond to statistical inferences such as the relevance of a term describing a case. SCALIR's text corpus includes all federal cases on copyright law.

The hybrid representation also suggests a way to resolve the apparent incompatibility between the two prominent paradigms in artificial intelligence, the "classical" symbol-manipulation approach and the neurally-inspired connectionist approach. Part of the book focuses on a characterization of the two paradigms and an investigation of when and how -- as in the legal research domain -- they can be effectively combined.

chapter 1|10 pages

Introduction

chapter 2|25 pages

Humans, Computers, and Finding Information

chapter 4|33 pages

Approaches to Information Retrieval

chapter 6|25 pages

Hybrid Vigor

chapter 7|20 pages

The Structure of SCALIR

chapter 8|29 pages

The Retrieval Process

chapter 9|24 pages

Feedback and Learning

chapter 10|25 pages

Interacting With SCALIR

chapter 11|24 pages

Performance Evaluation

chapter 12|13 pages

Discussion