Identifying Useful Passages in Documents based on Annotation Patterns.

Abstract

Many readers annotate passages that are important to their work. If we
understand the relationship between the types of marks on a passage and the passage’s
ultimate utility in a task, then we can design e-book software to facilitate
access to the most important annotated parts of the documents. To investigate
this hypothesis and to guide software design, we have analyzed annotations collected
during an earlier study of law students reading printed case law and
writing Moot Court briefs. This study has allowed us to characterize the relationship
between the students’ annotations and the citations they use in their final
written briefs. We think of annotations that relate directly to the written brief as
high-value annotations; these annotations have particular, detectable characteristics.
Based on this study we have designed a mark parser that analyzes freeform
digital ink to identify such high-value annotations.