Publications

FXPAL publishes in top scientific conferences and journals.

2002
Publication Details
  • Journal of Library Administration, 35:1-2, 99-123, Haworth
  • Jun 7, 2002

Abstract

Close
In the emerging world of electronic publishing how we create, distribute, and read books will be in a large part determined by an underlying framework of content standards that establishes the range of technological opportunities and constraints for publishing and reading systems. But efforts to develop content standards based on sound engineering models must skillfully negotiate competing and sometimes apparently irreconcilable objectives if they are to produce results relevant to the rapidly changing course of technology. The Open eBook Forum's Publication Structure, an XML-based specification for electronic books, is an example of the sort of timely and innovative problem solving required for successful real-world standards development. As a result of this effort, the electronic book industry will not only happen sooner and on a larger scale than it would have otherwise, but the electronic books it produces will be more functional, more interoperable, and more accessible to all readers. Public interest participants have a critical role in this process.
Publication Details
  • CHI 2002
  • Apr 22, 2002

Abstract

Close
Shared text input is a technique we implemented into a note taking system for facilitating text entry on small devices. Instead of writing out words on the tedious text entry interfaces found on handheld computers, users can quickly reuse words and phrases already entered by others. Sharing notes during a meeting also increases awareness among note takers. We found that filtering the text to share was appropriate to deal with a variety of design issues such as screen real estate, scalability, privacy, reciprocity, and predictability of text location
Publication Details
  • CHI 2002
  • Apr 22, 2002

Abstract

Close
In this paper, we describe an evaluation of the Palette, a presentation tool that was reported at CHI '99. The Palette allows presenters to quickly access digital presentations using physical cards that have unique barcodes printed on them. The Palette has been in use in our lab for over three years, and has been released as a product in Japan. Our evaluation consists of an analysis of usage logs, an expert walkthrough review, and observations and interviews with users, non-users and the system administrator. The findings reveal benefits and drawbacks of the technology, and offers design ideas for further work on tangible tools of this kind.
Publication Details
  • International Journal of Human-Computer Studies, 56, pp. 75-107
  • Feb 1, 2002

Abstract

Close
We describe our experiences with the design, implementation, deployment, and evaluation of a Portholes tool which provides group and collaboration awareness through the Web. The research objective was to explore how such a system would improve communication and facilitate a shared understanding among distributed development groups. During the deployment of our Portholes system, we conducted a naturalistic study by soliciting user feedback and evolving the system in response. Many of the initial reactions of potential users indicated that our system projected the wrong image so that we designed a new version that provided explicit cues about being in public and who is looking back to suggest a social rather than information interface. We implemented the new design as a Java applet and evaluated design choices with a preference study. Our experiences with different Portholes versions and user reactions to them provide insights for designing awareness tools beyond Portholes systems. Our approach is for the studies to guide and to provide feedback for the design and technical development of our system.
2001
Publication Details
  • In Workshop on Identifying Objects Across Variations in Lighting: Psychophysics & Computation, Proc. IEEE Intl. Conf. on Computer Vision & Pattern Recognition 2001.
  • Dec 12, 2001

Abstract

Close
In this paper, we document an extension to traditional pattern-theoretic object templates to jointly accommodate variations in object pose and in the radiant appearance of the object surface. We first review classical object templates accommodating pose variation. We then develop an efficient subspace representation for the object radiance indexed on the surface of the three dimensional object template. We integrate the low-dimensional representation for the object radiance, or signature, into the pattern-theoretic template, and present the results of orientation estimation experiments. The experiments demonstrate both estimation performance fluctuations under varying illumination conditions and performance degradations associated with unknown scene illumination. We also present a Bayesian approach for estimation accommodating illumination variability.

Work/place: mobile technologies and arenas of activity

Publication Details
  • ACM SIGGROUP Bulletin, Volume 22, Issue 3, Pp3-9, Publisher ACM Press, New York, NY, USA
  • Dec 8, 2001

Abstract

Close
The increasing number of wireless, portable devices has led inevitably to lyrical rhetorics of business cost-cutting and increased efficiency as workers can be productive while on the and offices become streamlined areas of efficient activity. In this short paper, we raise a number if issues that have been appearing in common discourses the (most) modern office, and the impact of wireless technologies thereupon. We also present an overview of a workshop held at ECSCW in Bonn in September of 2001 on this topic, giving an overview of the comments and discussions that took place at the workshop.

Framing Mobile Collaborations and Mobile Technologies.

Publication Details
  • In B. Brown, N. Green, R. Harper (Eds.) Wireless World: Social and Interactional Aspects of Wireless Technology, London, UK: Springer-Verlag.
  • Dec 1, 2001

Abstract

Close
Recent years have seen a marked increase in the production and promotion of portable, wireless communication devices: mobile phones with internet access, wireless PDAs such as the Palm VII and smart pagers such as RIM's 850 and 950. Some claim the presence of such devices in the hands, bags and pockets of so many people heralds a new world of work in which people can be reached and information accessed "anywhere, anytime". Whether or not access to information in itself can promote new working practices, individuals whose lives revolve around movement between work sites have been singled out as an obvious market for such portable wireless communication devices. Using these devices such “mobile workers” can be in touch with colleagues, collaborators and clients "24/7", and still sustain non-work social relationships due, apparently, to their constant connectedness whilst mobile. In this chapter we have two goals. The first is to address the design of mobile technologies. This second is to illustrate our design approach, wherein we consider local practices of technology use, but also the broader cultural context in which technologies are designed, produced, bought, sold, used and redesigned. Our ultimate design aim is to build upon existing practices, but also to consider possibilities for the development of innovative technologies that enable new, complementary, practices.
Publication Details
  • In Proceedings of the International Conference on Image Processing, Thessaloniki, Greece. October 7-10, 2001.
  • Oct 7, 2001

Abstract

Close
In this paper, we present a novel framework for analyzing video using self-similarity. Video scenes are located by analyzing inter-frame similarity matrices. The approach is flexible to the choice of similarity measure and is robust and data-independent because the data is used to model itself. We present the approach and its application to scene boundary detection. This is shown to dramatically outperform a conventional scene-boundary detector that uses a histogram-based measure of frame difference.
Publication Details
  • Proceedings of ACM Multimedia 2001, Ottawa, Canada, Oct. 5, 2001.
  • Oct 5, 2001

Abstract

Close
Given rapid improvements in storage devices, network infrastructure and streaming-media technologies, a large number of corporations and universities are recording lectures and making them available online for anytime, anywhere access. However, producing high-quality lecture videos is still labor intensive and expensive. Fortunately, recent technology advances are making it feasible to build automated camera management systems to capture lectures. In this paper we report our design of such a system, including system configuration, audio-visual tracking techniques, software architecture, and user study. Motivated by different roles in a professional video production team, we have developed a multi-cinematographer single-director camera management system. The system performs lecturer tracking, audience tracking, and video editing all fully automatically, and offers quality close to that of human-operated systems.
Publication Details
  • Proc. ACM Multimedia 2001, Ottawa,CA, Oct. 2001.
  • Sep 30, 2001

Abstract

Close
We describe a system called FlyAbout which uses spatially indexed panoramic video for virtual reality applications. Panoramic video is captured by moving a 360° camera along continuous paths. Users can interactively replay the video with the ability to view any interesting object or choose a particular direction. Spatially indexed video gives the ability to travel along paths or roads with a map-like interface. At junctions, or intersection points, users can chose which path to follow as well as which direction to look, allowing interaction not available with conventional video. Combining the spatial index with a spatial database of maps or objects allows users to navigate to specific locations or interactively inspect particular objects.
Publication Details
  • Proc. International Conference on Computer Music (ICMC), Habana, Cuba, September 2001.
  • Sep 12, 2001

Abstract

Close
This paper presents a novel approach to visualizing the time structure of musical waveforms. The acoustic similarity between any two instants of an audio recording is displayed in a static 2D representation, which makes structural and rhythmic characteristics visible. Unlike practically all prior work, this method characterizes self-similarity rather than specific audio attributes such as pitch or spectral features. Examples are presented for classical and popular music.
Publication Details
  • IEEE Computer, 34(9), pp. 61-67
  • Sep 1, 2001

Abstract

Close

To meet the diverse needs of business, education, and personal video users, the authors developed three visual interfaces that help identify potentially useful or relevant video segments. In such interfaces, keyframes-still images automatically extracted from video footage-can distinguish videos, summarize them, and provide access points. Well-chosen keyframes enhance a listing's visual appeal and help users select videos. Keyframe selection can vary depending on the application's requirements: A visual summary of a video-captured meeting may require only a few highlight keyframes, a video editing system might need a keyframe for every clip, while a browsing interface requires an even distribution of keyframes over the video's full length. The authors conducted user studies for each of their three interfaces, gathering input for subsequent interface improvements. The studies revealed that finding a similarity measure for collecting video clips into groups that more closely match human perception poses a challenge. Another challenge is to further improve the video-segmentation algorithm used for selecting keyframes. A new version will provide users with more information and control without sacrificing the interface's ease of use.

Recording the Region of Interest from FlyCam Panoramic Video

Publication Details
  • Proc. International Conference on Image Processing, Thessaloniki, Greece, September 2001.
  • Sep 1, 2001

Abstract

Close
A novel method for region of interest tracking and recording video is presented. The proposed method is based on the FlyCam system, which produces high resolution and wide-angle video sequences by stitching the video frames from multiple stationary cameras. The method integrates tracking and recording processes, and targets applications such as classroom lectures and video conferencing. First, the region of interest (which typically covers the speaker) is tracked using a Kalman filter. Then, the Kalman filter estimation results are used for virtual camera control and to record the video. The system has no physical camera motion and the virtual camera parameters are readily available for video indexing. The proposed system has been implemented for real time recording of lectures and presentations.

The Beat Spectrum: A New Approach to Rhythm Analysis

Publication Details
  • In Proceedings of the International Conference on Multimedia and Expo 2001 (ICME), Tokyo, Japan. August 22-25, 2001.
  • Aug 25, 2001

Abstract

Close
We introduce the beat spectrum, a new method of automatically characterizing the rhythm and tempo of music and audio. The beat spectrum is a measure of acoustic self-similarity as a function of time lag. Highly structured or repetitive music will have strong beat spectrum peaks at the repetition times. This reveals both tempo and the relative strength of particular beats, and therefore can distinguish between different kinds of rhythms at the same tempo. We also introduce the beat spectrogram which graphically illustrates rhythm variation over time. Unlike previous approaches to tempo analysis, the beat spectrum does not depend on particular attributes such as energy or frequency, and thus will work for any music or audio in any genre. We present tempo estimation results for a variety of musical genres, which are accurate to within 1%. This approach has a variety of applications, including music retrieval by similarity and automatically generating music videos.
Publication Details
  • In Proceedings of Conference on Modeling and Design of Wireless Networks (ITCOM2001), Denver, Colorado, August 23-24 August 2001.
  • Aug 23, 2001

Abstract

Close
This paper reports our design, and implementation of an automatic lecture-room camera-management system. The motivation for building this system is to facilitate online lecture access and reduce the expense of producing high quality lecture videos. The goal of this project is a camera-management system that can perform as a human video-production team. To achieve this goal, our system collects audio/video signals available in the lecture room and uses the multimodal information to direct our video cameras to interesting events. Compared to previous work--which has tended to be technology centric--we started with interviews with professional video producers and used their knowledge and expertise to create video production rules. We then targeted technology components that allowed us to implement a substantial portion of these rules, including the design of a virtual video director, a speaker cinematographer, and an audience cinematographer. The complete system is installed in parallel with a human-operated video production system in a middle-sized corporate lecture room, and used for broadcasting lectures through the web. The systemí*s performance was compared to that of a human operator via a user study. Results suggest that our system's quality is close to that of a human-controlled system.

The impact of text browsing on text retrieval performance

Publication Details
  • Information Processing and Management 37 (3) pp. 507-520
  • Aug 21, 2001

Abstract

Close
The results from a series of three experiments that used Text Retrieval Conference (TREC) data and TREC search topics are compared. These experiments each involved three novel user interfaces (one per experiment). User interfaces that made it easier for users to view text were found to improve recall in all three experiments. A distinction was found between a cluster of subjects (a majority of whom were search experts) who tended to read fewer documents more carefully (readers, or exclusives) and subjects who skimmed through more documents without reading them as carefully (skimmers, or inclusives). Skimmers were found to have significantly better recall overall. A major outcome from our experiments at TREC and with the TREC data, is that hypertext interfaces to information retrieval (IR) tasks tend to increase recall. Our interpretation of this pattern of results across the three experiments is that increased interaction with the text (more pages viewed) generally improves recall. Findings from one of the experiments indicated that viewing a greater diversity of text on a single screen (i.e., not just more text per se, but more articles available at once) may also improve recall. In an experiment where a traditional (type-in) query interface was contrasted with a condition where queries were marked up on the text, the improvement in recall due to viewing more text was more pronounced with search novices. Our results demonstrate that markup and hypertext interfaces to text retrieval systems can benefit recall and can also benefit novices. The challenge now will be to find modified versions of hypertext interfaces that can improve precision, as well as recall and that can work with users who prefer to use different types of search strategy or have different types of training and experience.