Publications

FXPAL publishes in top scientific conferences and journals.

2003
Publication Details
  • Proceedings of INTERACT '03, pp. 583-590.
  • Sep 1, 2003

Abstract

Close
In a meeting room environment with multiple public wall displays and personal notebook computers, it is possible to design a highly interactive experience for manipulating and annotating slides. For the public displays, we present the ModSlideShow system with a discrete modular model for linking the displays into groups, along with a gestural interface for manipulating the flow of slides within a display group. For the applications on personal devices, an augmented reality widget with panoramic video supports interaction among the various displays. This widget is integrated into our NoteLook 3.0 application for annotating, capturing and beaming slides on pen-based notebook computers.
Publication Details
  • Proceedings of Hypertext '03, pp. 124-125
  • Aug 26, 2003

Abstract

Close
Existing hypertext systems have emphasized either the navigational or spatial expression of relationships between objects. We are exploring the combination of these modes of expression in Hyper-Hitchcock, a hypervideo editor. Hyper-Hitchcock supports a form of hypervideo called "detail-on-demand video" due to its applicability to situations where viewers need to take a link to view more details on the content currently being presented. Authors of detail-on-demand video select, group, and spatially arrange video clips into linear sequences in a two-dimensional workspace. Hyper-Hitchcock uses a simple spatial parser to determine the temporal order of selected video clips. Authors add navigational links between the elements in those sequences. This combination of navigational and spatial hypertext modes of expression separates the clip sequence from the navigational structure of the hypervideo. Such a combination can be useful in cases where multiple forms of inter-object relationships must be expressed on the same content.

Identifying Useful Passages in Documents based on Annotation Patterns.

Publication Details
  • 7th European Conference on Research and Advanced Technology for Digital Libraries (ECDL 2003) Trondheim, Norway, August 17-22, 2003
  • Aug 17, 2003

Abstract

Close
Many readers annotate passages that are important to their work. If we understand the relationship between the types of marks on a passage and the passage's ultimate utility in a task, then we can design e-book software to facilitate access to the most important annotated parts of the documents. To investigate this hypothesis and to guide software design, we have analyzed annotations collected during an earlier study of law students reading printed case law and writing Moot Court briefs. This study has allowed us to characterize the relationship between the students' annotations and the citations they use in their final written briefs. We think of annotations that relate directly to the written brief as high-value annotations; these annotations have particular, detectable characteristics. Based on this study we have designed a mark parser that analyzes freeform digital ink to identify such high-value annotations.

Discourse Structure and Sentential Information Structure An Initial Proposal

Publication Details
  • Journal of Logic, Language and Information, Kluwer Academic Publishers, Dordrecht, The Netherlands
  • Aug 15, 2003

Abstract

Close
In this article we argue that discourse structure constrains the set of possible constituents in a discourse that can provide the relevant context for structuring information in a target sentence, while information structure critically constrains discourse structure ambiguity. For the speaker, the discourse structure provides a set of possible contexts for continuation while information structure assignment is independent of discourse structure. For the hearer, the information structure of a sentence together with discourse structure instructs dynamic semantics how rhematic information should be used to update the meaning representation of the discourse (Polanyi and van den Berg, 1996).
Publication Details
  • IEEE International Conference on Multimedia and Expo, v. I, pp. 221-224
  • Jul 7, 2003

Abstract

Close
A novel method is presented for inaudibly hiding information in an audio signal by subtly applying time-scale modification to segments of the signal. The sequence, duration, and degree of the time-scale modifications are the parameters which encode information in the altered signal. By comparing the altered signal with a reference copy, compressed and expanded regions can be identified and the hidden data recovered. This approach is novel and has several advantages over other methods: it is theoretically noiseless, it introduces no spectral distortion, and it is robust to all known methods of reproduction, compression, and transmission.
Publication Details
  • IEEE International Conference on Multimedia and Expo, v. II, pp. 77-80
  • Jul 7, 2003

Abstract

Close
We created an improved layout algorithm for automatically generating visual video summaries reminiscent of comic book pages. The summaries are comprised of images from the video that are sized according to their importance. The algorithm performs a global optimization with respect to a layout cost function that encompasses features such as the number of resized images and the amount of whitespace in the presentation. The algorithm creates summaries that: always fit exactly into the requested area, are varied by containing few rows with images of the same size, and have little whitespace at the end of the last row. The layout algorithm is fast enough to allow the interactive resizing of the summaries and the subsequent generation of a new layout.
Publication Details
  • IEEE International Conference on Multimedia and Expo, v. II, pp. 753-756
  • Jul 7, 2003

Abstract

Close
We created an alternative approach to existing video summaries that gives viewers control over the summaries by selecting hyperlinks to other video with additional information. We structure such summaries as "detail-on-demand" video, a subset of general hypervideo in which at most one link to another video sequence is available at any given time. Our editor for such video, Hyper-Hitchcock, provides a workspace in which an author can select and arrange video clips, generate composites from clips and from other composites, and place links between composites. To simplify dealing with a large number of clips, Hyper-Hitchcock generates iconic representations for composites that can be used to manipulate the composite as a whole. In addition to providing an authoring environment, Hyper-Hitchcock can automatically generate multi-level hypervideo summaries for immediate use or as the starting point for author modification.
Publication Details
  • 2003 International Conference on Multimedia and Expo
  • Jul 6, 2003

Abstract

Close
This paper presents an information-driven audiovisual signal acquisition approach. This approach has several advantages: users are encouraged to assist in signal acquisition; available sensors are managed based on both signal characteristics and users' suggestions. The problem formulation is consistent with many well-known empirical approaches widely used in previous systems and may provide analytical explanations to these approaches. We demonstrate the use of this approach to pan/tilt/zoom (PTZ) camera management with field data.
Publication Details
  • HCI International 2003
  • Jun 22, 2003

Abstract

Close
A basic objective of ubiquitous computing research is ubiquitous information: the ability to utilize any content or service, using devices that are always at hand, over networks that don't tie us down. Although much progress has been made, the ideal remains elusive. This paper reflects on the interrelations among three dimensions of ubiquitous information: content, devices, and networks. We use our understanding of these dimensions to motivate our own attempt to create a ubiquitous information system by combining unlimited World Wide Web content with mobile phones and mobile phone networks. We briefly describe a middleware proxy system we developed to increase the usefulness of very small devices as Internet terminals. We conclude with a post-mortem analysis highlighting lessons learned for others interested in information systems for very small devices.
Publication Details
  • HCI International 2003
  • Jun 22, 2003

Abstract

Close
Everywhere we go, we are surrounded by shared devices: TVs, stereos, and appliances in the home; copiers, fax machines, and projectors in the office; phones and vending machines in public. Because these devices don't know who we are, they provide the same user interface and functionality to everyone. This paper describes a system for personalizing workplace document devices- projectors, public displays, and multi-function copiers-that has been in use for over two years in our organization. We compare user interfaces that are embedded (i.e., integrated or co-located with the shared device) versus portable (i.e., accessible via portable devices such as mobile phones or PDAs). We summarize lessons learned for others designing interfaces for shared ubiquitous devices.
Publication Details
  • Business Process Management Journal, Volume 9, Number 3, 2003, pages 337-353
  • Jun 9, 2003

Abstract

Close
Purveyors of knowledge management software have a disconcerting tendency to promote the myth that all problems may be solved by more powerful tools for the exchange of information in the workplace. This fallacy is based on the faulty assumption that knowledge management is about the management of knowledge (as if knowledge were a commodity that could be managed), as opposed to the management of people whose work depends critically on what they know. The origins of knowledge management are far more firmly rooted in the psychological legacy of organizational communication than they are in the technological legacy of information management systems. However, even organizational communication is an inadequate foundation, since various schools of thought in social theory, particularly the structuration theory of Anthony Giddens, inform us that interaction (in the workplace or in any other social setting) is not strictly limited to communication. Knowledge management thus requires moving beyond simplistic models of information exchange to more challenging problems of leveraging social interaction to the advantage of the enterprise. This paper focuses on the claim of structuration theory that the dimension of communication should be supplemented with additional dimensions of power and sanction. This perspective is then examined in light of a case study of crisis management practices, and the case study provides a basis for addressing implications for technological support.

Agent Supported Cooperative Work.

Publication Details
  • Mass,USA: Kluwer Academic Publishers, 2003
  • Jun 1, 2003

Abstract

Close
This is a volume, edited by Ye and Churchill. The chapters detail the design of agent-baed technologies in service of collaborative and cooperative work practices.

AttrActive Windows: Dynamic Windows for Digital Bulletin Boards

Publication Details
  • CHI 2003
  • Apr 7, 2003

Abstract

Close
In this paper we describe AttrActive Windows, a novel interface for presenting live, interactive, multimedia content on a network of public, digital, bulletin boards. Implementing a paper flyer metaphor, AttrActive Windows are paper-like in appearance and are attached to a virtual corkboard by virtual pushpins. Windows can therefore appear in different orientations, creating an attractive, informal look. Attractive Windows can also have autonomous behaviors that are consistent with the corkboard metaphor, like fluttering in the wind. We describe the AttrActive Windows prototype, and offer the results of an initial evaluative user study.
Publication Details
  • CHI 2003
  • Apr 7, 2003

Abstract

Close
Shared freeform input is a technique for facilitating note taking across devices during a meeting. Laptop users enter text with a keyboard, whereas PDA and Tablet PC users input freeform ink with their stylus. Users can quickly reuse text and freeform ink already entered by others. We show how a new technique, freeform pasting, allowed us to deal with a variety of design issues such as quick and informal ink sharing, screen real estate, privacy and mixing ink-based and textual material.
Publication Details
  • Proc. SPIE Storage and Retrieval for Multimedia Databases, Vol. 5021, pp. 167-75
  • Jan 20, 2003

Abstract

Close
We present a framework for analyzing the structure of digital media streams. Though our methods work for video,text,and audio,we concentrate on detecting the structure of digital music files. In the first step,spectral data is used to construct a similarity matrix calculated from inter-frame spectral similarity. The digital audio can be robustly segmented by correlating a ernel along the diagonal of the similarity matrix. Once segmented, spectral statistics of each segment are computed.In the second step,segments are clustered based on the self- similarity of their statistics. This reveals the structure of the digital music in a set of segment boundaries and labels.Finally,the music can be summarized by selecting clusters with repeated segments throughout the piece. The summaries can be customized for various applications based on the structure of the original music.

AttrActive Windows: Active Windows for Pervasive Computing Applications

Publication Details
  • ACM Intelligent User Interface (IUI) 2003, Miami Beach, FL, pp 326
  • Jan 12, 2003

Abstract

Close
We introduce the AttrActive Windows user interface, a novel approach for presenting interactive content on large screen, interactive, digital, bulletin boards. Moving away from the desktop metaphor, AttrActive Windows are dynamic, non-uniform windows that can appear in different orientations and have autonomous behaviours to attract passers-by and invite interactions.
2002
Publication Details
  • IEEE Multimedia Signal Processing Workshop
  • Dec 11, 2002

Abstract

Close
We present a novel approach to automatically ex-tracting summary excerpts from audio and video. Our approach is to maximize the average similarity between the excerpt and the source. We first calculate a similarity matrix by comparing each pair of time samples using a quantitative similarity measure. To determine the segment with highest average similarity, we maximize the summation of the self-similarity matrix over the support of the segment. To select multiple excerpts while avoiding redundancy, we compute the non-negative matrix factorization (NMF) of the similarity matrix into its essential structural components. We then build a summary comprised of excerpts from the main components, selecting the excerpts for maximum average similarity within each component. Variations integrating segmentation and other information are also discussed, and experimental results are presented.
Publication Details
  • ACM Multimedia 2002
  • Dec 1, 2002

Abstract

Close
We present methods for automatic and semi-automatic creation of music videos, given an arbitrary audio soundtrack and source video. Significant audio changes are automatically detected; similarly, the source video is automatically segmented and analyzed for suitability based on camera motion and exposure. Video with excessive camera motion or poor contrast is penalized with a high unsuitability score, and is more likely to be discarded in the final edit. High quality video clips are then automatically selected and aligned in time with significant audio changes. Video clips are adjusted to match the audio segments by selecting the most suitable region of the desired length. Besides a fully automated solution, our system can also start with clips manually selected and ordered using a graphical interface. The video is then created by truncating the selected clips (preserving the high quality portions) to produce a video digest that is synchronized with the soundtrack music, thus enhancing the impact of both.
Publication Details
  • ACM Multimedia 2002
  • Dec 1, 2002

Abstract

Close
FlySPEC is a video camera system designed for real-time remote operation. A hybrid design combines the high resolution possible using an optomechanical video camera, with the wide field of view always available from a panoramic camera. The control system integrates requests from multiple users with the result that each controls a virtual camera. The control system seamlessly integrates manual and fully automatic control. It supports a range of options from untended automatic to full manual control, and the system can learn control strategies from user requests. Additionally, the panoramic view is always available for an intuitive interface, and objects are never out of view regardless of the zoom factor. We present the system architecture, an information-theoretic approach to combining panoramic and zoomed images to optimally satisfy user requests, and experimental results that show the FlySPEC system significantly assists users in a remote inspection tasks.
Publication Details
  • ACM 2002 Conference on Computer Supported Cooperative Work
  • Nov 16, 2002

Abstract

Close
Technology can play an important role in enabling people to interact with each other. The Web is one such technology with the affordances for sharing information and for connecting people to people. In this paper, we describe the design of two social interaction Web sites for two different social groups. We review several related efforts to provide principles for creating social interaction environments and describe the specific principles that guided our design. To examine the effectiveness of the two sites, we analyze the usage data. Finally, we discuss approaches for encouraging participation and lessons learned.