Publications

FXPAL publishes in top scientific conferences and journals.

2004
Publication Details
  • Proceedings of 2004 IEEE International Conference on Multimedia and Expo (ICME 2004)
  • Jun 27, 2004

Abstract

Close
Using a machine to assist remote environment management can save people's time, effort, and traveling cost. This paper proposes a trainable mobile robot system, which allows people to watch a remote site through a set of cameras installed on the robot, drive the platform around, and control remote devices using mouse or pen based gestures performed in video windows. Furthermore, the robot can learn device operations when it is being used by humans. After being used for a while, the robot can automatically select device control interfaces, or launch a pre-defined operation sequence based on its sensory inputs.
Publication Details
  • Proceedings of 2004 IEEE International Conference on Multimedia and Expo (ICME 2004)
  • Jun 27, 2004

Abstract

Close
Many conference rooms are now equipped with multiple multi-media devices, such as plasma displays and surrounding speakers, to enhance presentation quality. However, most existing presentation authoring tools are based on the one-display-and-one-speaker assumption, which makes it difficult to organize and playback a presentation dispatched to multiple devices, thus hinders users from taking full advantage of additional multimedia devices. In this paper, we propose and implement a tool to facilitate authoring and playback of a multi-channel presentation in a media devices distributed environment. The tool, named PreAuthor, provides an intuitive and visual way to author a multi-channel presentation by dragging and dropping "hyper-slides" on corresponding visual representations of various devices. PreAuthor supports "hyper-slide" synchronization among various output devices during preview and playback. It also offers multiple options for the presenter to view the presentation in a rendered image sequence, live video, 3D VRML model, or real environment.
Publication Details
  • JOINT AMI/PASCAL/IM2/M4 Workshop on Multimodal Interaction and Related Machine Learning Algorithms
  • Jun 22, 2004

Abstract

Close
For some years, our group at FX Palo Alto Laboratory has been developing technologies to support meeting recording, collaboration, and videoconferencing. This paper presents a few of our more interesting research directions. Many of our systems use a video image as an interface, allowing devices and information to be accessed "through the screen." For example, SPEC enables hybrid collaborative and automatic camera control through an active video window. The NoteLook system allows a user to grab an image from a computer display, annotate it with digital ink, then drag it to that or a different display, while automatically generating timestamps for later video review. The ePIC system allows natural use and control of multi-display and multi-device presentation spaces, and the iLight system allows remote users to "draw" with light on a local object. All our systems serve as platforms for researching more sophisticated algorithms that will hopefully support additional advanced functions and ease of use.
Publication Details
  • ED-Media 2004
  • Jun 21, 2004

Abstract

Close
This paper presents a system designed to support note taking by students on wirelessly connected PDAs in a classroom. The system leverages the devices' wireless connectivity to allow students to share their notes in real time and quickly reuse words from their fellow note takers. In addition, presentation material such as Powerpoint slides is also extracted when presented by the instructor, giving students further means for reusing words. We describe the system and report our findings on an initial user study where the system was used for four months during a graduate level course.
Publication Details
  • Journal of Human Interface Society, 6(2), pp. 51-58
  • Jun 1, 2004

Abstract

Close
FX Palo Alto Laboratory provides multimedia and information technology research for the Fuji Xerox corporation based in Tokyo, Japan. FXPAL's mission is to help Fuji Xerox with a digital information technology infrastructure to support services in Fuji Xerox's Open Office Frontier. Our research spans interactive media, immersive conferencing, social computing, mobile and adaptive computing, natural language inquiry, and emerging technologies such as quantum computing and bioinformatics. Our research methods combine determining user needs, inventing new technologies, building prototype systems, informing professional communities, and transferring technology to Fuji Xerox. The physical distance between our laboratory and our parent company makes it natural for us to research problems with collaborations across time zones and cultures. To address these problems, to test our ideas, and to prepare for technology transfers, we actively create prototype systems for interactive media, immersive conferencing, and social and mobile computing. We also foster collaboration with our Japanese colleagues through a combination of face-to-face visits and both synchronous and asynchronous remote communication.
Publication Details
  • Proceedings of the Working Conference on Advanced Visual Interfaces, AVI 2004, pp. 290-297
  • May 25, 2004

Abstract

Close
We introduced detail-on-demand video as a simple type of hypervideo that allows users to watch short video segments and to follow hyperlinks to see additional detail. Such video lets users quickly access desired information without having to view the entire contents linearly. A challenge for presenting this type of video is to provide users with the appropriate affordances to understand the hypervideo structure and to navigate it effectively. Another challenge is to give authors tools that allow them to create good detail-on-demand video. Guided by user feedback, we iterated designs for a detail-on-demand video player. We also conducted two user studies to gain insight into people's understanding of hypervideo and to improve the user interface. We found that the interface design was tightly coupled to understanding hypervideo structure and that different designs greatly affected what parts of the video people accessed. The studies also suggested new guidelines for hypervideo authoring.
Publication Details
  • ACM Interactions Magazine
  • May 1, 2004

Abstract

Close
This article describes two years of experience with a research prototype for personalizing shared workplace devices such as projectors, public displays, and multi-function copiers. The system combines users' networked resources-or "personal information clouds"—with device-specific user interfaces for performing common device tasks. We developed and compared personal interfaces that are embedded (i.e., integrated or co-located with the shared device) and portable (i.e., accessible via personal devices such as mobile phones and PDAs). Our experience indicates that a little personalization can go a long way toward improving user friendliness, efficiency, and capabilities of shared document devices, helping them "weave themselves into the fabric of everyday life". We also gained important insights into subtle differences between embedded and portable approaches to ubiquitous computing systems.

A Rule Based Approach to Discourse Parsing

Publication Details
  • Proceedings of the 5th SIGdial Workshop in Discourse And Dialogue. Cambridge, MA USA pp. 108-117.
  • May 1, 2004

Abstract

Close
In this paper we present recent developments in discourse theory and parsing under the Linguistic Discourse Model framework, a semantic theory of discourse structure. We give a novel approach to the problems of discourse segmentation based on discourse semantics and sketch a limited but robust approach to symbolic discourse parsing based on syntactic, semantic and lexical rules. To demonstrate the utilioty of the system in a real application, we briefly describe the architecture of the PALSUMM System, a symbolic smmarization system being developed at FX Palo Alto Laboratory that uses discourse structures constructed usding the theory otlined to summarize written English texts.

MiniMedia Surfer: Browsing Video Segments on Small Displays

Publication Details
  • CHI 2004 short paper
  • Apr 27, 2004

Abstract

Close
It is challenging to browse multimedia on mobile devices with small displays. We present MiniMedia Surfer, a prototype application for interactively searching a multimedia collection for video segments of interest. Transparent layers are used to support browsing subtasks: keyword query, exploration of results through keyframes, and playback of video. This layered interface smoothly blends the key tasks of the browsing process and deals with the small screen size. During exploration, the user can adjust the transparency levels of the layers using pen gestures. Details of the video segments are displayed in an expandable timeline that supports gestural interaction.

Sharing Multimedia Content with Interactive Displays: A Case Study

Publication Details
  • ACM DIS 2004, Cambridge, August 1-4, 2004. New York: ACM
  • Apr 18, 2004

Abstract

Close
The Plasma Posters are large screen, digital, interactive posterboards designed for informal content sharing within teams, groups, organizations and communities. Leveraging the fact that the physical world is often used as a canvas for asynchronous information exchange via paper fliers and posters, the Plasma Posters provide a platform for sharing interactive digital content in public places. People serendipitously encounter multimedia content usually only encountered from the desktop, while going about their daily business. In this paper we describe the Plasma Poster interface in detail, and offer and overview of the underlying information authoring, parsing, storage, distribution and publishing infrastructure, the Plasma Poster Network. We report qualitative and quantitative data collected over 14 months of use that demonstrate the Plasma Posters have become an integral part of information sharing within our organization. We conclude the paper by reflecting on the patterns of adoption, and speculate on factors that have contributed to the system's success. Finally we briefly describe three other installations of the Plasma Poster Network and new interfaces that have been designed.

Collaborative Note Taking

Publication Details
  • WMTE 2004
  • Mar 22, 2004

Abstract

Close
Collaborative note taking enables students in a class to take notes on their PDAs and share them with their "study group" in real-time. Students receive instructor's slides on their PDAs as they are displayed by the instructor. As the individual members of the group take notes pertaining to the slide being presented, their notes are automatically sent to all members of the group. In addition, to reduce their typing, students can use text they receive from other students and from the instructors slides to construct their notes. This system has been used in actual practice for a graduate level course on wireless mobile computing. In developing this system, special attention has been paid to the task of inputting text on PDAs, efficient use of the screen real estate, dynamics among students, privacy and ease of use issues.

Shot boundary detection via similarity analysis

Publication Details
  • Proceedings TRECVID 2003
  • Mar 1, 2004

Abstract

Close
In this paper, we present a framework for analyzing video using self-similarity. Video scenes are located by analyzing inter-frame similarity matrices. The approach is flexible to the choice of both feature parametrization and similarity measure and it is robust because the data is used to model itself. We present the approach and its application to shot boundary detection.

Digital Graffiti: Public Annotation of Multimedia Content

Publication Details
  • CHI 2004, Vienna, Austria, April 24-29, 2004. New York: ACM Publications.
  • Feb 26, 2004

Abstract

Close
Our physical environment is increasingly filled with multimedia content on situated, community public displays. We are designing methods for people to post and acquire digital information to and from public digital displays, and to modify and annotate previously posted content to create publicly observable threads. We support in-the-moment and on-site "person-to-place-to-people-to-persons" content interaction, annotation, augmentation and publication. We draw design inspiration from field work observations of how people remove, modify and mark up paper postings. We present our initial designs in this arena, and some initial user reactions.

Gooey Interfaces: An Approach for Rapidly Repurposing Digital Content

Publication Details
  • CHI 2004, Vienna, Austria, April 24-29, 2004. New York: ACM Publications
  • Feb 20, 2004

Abstract

Close
With the acceleration of technological development we are reaching the point where our systems and their user interfaces become to some degree outdated 'legacy systems' as soon as they are released. This raises the question of how can we maintain, extend, override, and adapt these systems while preserving what people depend on in them? In this paper we describe an approach for dynamically restructuring user interfaces into a set of communicating processes that 1) provide methods for changing their appearance, behavior, and state; and 2) report their proposed state changes so that other processes may override their actions in updating themselves to a new state. We do this for both new and wrapped legacy user interface components, thereby allowing us to repurpose user interfaces for our evolving needs. We describe how this approach has been successfully used in rapidly creating and deploying interfaces that repurpose content for new appearances and behaviors.
Publication Details
  • Communications of the ACM, February 2004, Vol. 47, No. 2, pp. 38-44
  • Feb 1, 2004

Abstract

Close
Blurring the boundary between the digital and physical in social activity spaces helps blend - and motivate - online and face-to-face community participation. This paper discusses two experimental installations of large screen displays at conferences - CHI 2002 and CSCW 2002. The displays offered a window in the conference arena onto online community information.

Contextual Contact Retrieval

Publication Details
  • International Conference on Intelligent User Interfaces (IUI 2004)
  • Jan 13, 2004

Abstract

Close
People routinely rely on physical and electronic systems to remind themselves of details regarding personal and organizational contacts. These systems include rolodexes, directories and contact databases. In order to access details regarding contacts, users must typically shift their attention from tasks they are performing to the contact system itself in order to manually look-up contacts. This paper presents an approach for automatically retrieving contacts based on users' current context. Results are presented to users in a manner that does not disrupt their tasks, but which allows them to access contact details with a single interaction. The approach promotes the discovery of new contacts that users may not have found otherwise and supports serendipity.

Inhabited Information Spaces: Living with Your Data

Publication Details
  • London: Springer-Verlag, 2003
  • Jan 5, 2004

Abstract

Close
This edited volume brings together projects funded by, or related to, the European i3 initiative. Projects address the design, development and use of rich, digital information spaces for collocated and distributed collaboration.
2003

Public and Situated Displays. Social and Interactional Aspects of Shared Display Technologies

Publication Details
  • Lond: Kluwer Academic Publishers
  • Dec 31, 2003

Abstract

Close
Public and situated display technologies have an important impact on individual and social behaviour and present us with particular interesting new design considerations and challenges. While there is a growing body of research exploring these design considerations and social impacts this work remains somewhat disparate, making it difficult to assimilate in a coherent manner. This book brings together the perspective of key researchers in the area of public and situated display technology. The chapters detail research representing the social, technical and interactional aspects of public and sitauted display technologies. The underlying concern common to these chapters is how these displays can be best designed for collaboration, coordination, community building and mobility.

THE PLASMA POSTER NETWORK Social Hypermedia on Public Display

Publication Details
  • In Public and Situated Displays. Social and Interactional Aspects of Shared Display Technologies. K. O'Hara, M.Perry, E. Churchill and D. Russell (Eds) London: Kluwer Acamdemic Publishers
  • Dec 31, 2003

Abstract

Close
The sharing of digital materials within online communities has increased significantly in recent years. Our work focuses on promoting community information sharing in public spaces using large screen, interactive, digital poster boards called the Plasma Posters. In this chapter we first describe our fieldwork-led, iterative design process, and elaborate a number of design guidelines that resulted. Following this, the design and development of the Plasma Posters themselves and the underlying network infrastructure is discussed. Finally, we present results from qualitative and quantitative evaluations over the course of a ten-month deployment of three Plasma Posters within our own organization, a software research community made up of technologists and designers. We conclude with observations regarding ergonomic, social and other factors that were raised during the design and deployment and offer reflections on factors in the success of this deployment.

A fast, interactive 3D paper-flier metaphor for digital bulletin boards

Publication Details
  • UIST 2003
  • Nov 1, 2003

Abstract

Close
We describe a novel interface for presenting interactive content on public digital bulletin boards. Inspired by paper fliers on physical bulletin boards, posted content is displayed using 3D virtual fliers attached to a virtual corkboard by virtual pushpins. Fliers appear in different orientations, creating an attractive, informal look, and have autonomous behaviors like fluttering in the wind. Passers-by can rotate, move and fold fliers; they can also interact with fliers' live content. Flier content is streamed from a server and represented by the system on large screen displays using a real-time cloth simulation algorithm. We describe our prototype, and offer the results of an initial evaluative user study.
Publication Details
  • Proc. ACM Multimedia 2003. pp. 364-373
  • Nov 1, 2003

Abstract

Close
We present similarity-based methods to cluster digital photos by time and image content. The approach is general, unsupervised, and makes minimal assumptions regarding the structure or statistics of the photo collection. We present results for the algorithm based solely on temporal similarity, and jointly on temporal and content-based similarity. We also describe a supervised algorithm based on learning vector quantization. Finally, we include experimental results for the proposed algorithms and several competing approaches on two test collections.
Publication Details
  • Proc. ACM Multimedia 2003, pp. 546-554
  • Nov 1, 2003

Abstract

Close
We present a system that allows remote and local participants to control devices in a meeting environment using mouse or pen based gestures "through" video windows. Unlike state-of-the-art device control interfaces that require interaction with text commands, buttons, or other artificial symbols, our approach allows users to interact with devices through live video of the environment. This naturally extends our video supported pan/tilt/zoom (PTZ) camera control system, by allowing gestures in video windows to control not only PTZ cameras, but also other devices visible in video images. For example, an authorized meeting participant can show a presentation on a screen by dragging the file on a personal laptop and dropping it on the video image of the presentation screen. This paper presents the system architecture, implementation tradeoffs, and various meeting control scenarios.
Publication Details
  • Proc. ACM Multimedia 2003. pp. 92-93
  • Nov 1, 2003

Abstract

Close
To simplify the process of editing interactive video, we developed the concept of "detail-on-demand" video as a subset of general hypervideo. Detail-on-demand video keeps the authoring and viewing interfaces relatively simple while supporting a wide range of interactive video applications. Our editor, Hyper-Hitchcock, provides a direct manipulation environment in which authors can combine video clips and place hyperlinks between them. To summarize a video, Hyper-Hitchcock can also automatically generate a hypervideo composed of multiple video summary levels and navigational links between these summaries and the original video. Viewers may interactively select the amount of detail they see, access more detailed summaries, and navigate to the source video through the summary.
Publication Details
  • Proc. ACM Multimedia 2003. pp. 392-401
  • Nov 1, 2003

Abstract

Close
In this paper, we describe how a detail-on-demand representation for interactive video is used in video summarization. Our approach automatically generates a hypervideo composed of multiple video summary levels and navigational links between these summaries and the original video. Viewers may interactively select the amount of detail they see, access more detailed summaries, and navigate to the source video through the summary. We created a representation for interactive video that supports a wide range of interactive video applications and Hyper-Hitchcock, an editor and player for this type of interactive video. Hyper-Hitchcock employs methods to determine (1) the number and length of levels in the hypervideo summary, (2) the video clips for each level in the hypervideo, (3) the grouping of clips into composites, and (4) the links between elements in the summary. These decisions are based on an inferred quality of video segments and temporal relations those segments.