|
ABSTRACT
The human visual system makes a great deal more of images than the elemental marks on a surface. In the course of viewing, creating, or editing a picture, we actively construct a host of visual structures and relationships as components of sensible interpretations. This paper shows how some of these computational processes can be incorporated into perceptually-supported image editing tools, enabling machines to better engage users at the level of their own percepts. We focus on the domain of freehand sketch editors, such as an electronic whiteboard application for a pen-based computer. By using computer vision techniques to perform covert recognition of visual structure as it emerges during the course of a drawing/editing session, a perceptually supported image editor gives users access to visual objects as they are perceived by the human visual system. We present a flexible image interpretation architecture based on token grouping in a multiscale blackboard data structure. This organization supports multiple perceptual interpretations of line drawing data, domain-specific knowledge bases for interpretable visual structures, and gesture-based selection of visual objects. A system implementing these ideas, called PerSketch, begins to explore a new space of WYPIWYG (What You Perceive Is What You Get) image editing tools.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
Lee, S. Recognizing Hand-Drawn Electrical Circuit Symbols with Attributed Graph Matching. In H. S. Baird, H. Bunke, and K. Yamamoto (eds.), Structured Document Image Analysis. Springer-Verlag, New York, 1992.
|
| |
3
|
Mart, D. Early Processing of Visual Information. Phil. Trans. R. Soc. Lond., B 275 (1976), 483-519.
|
| |
4
|
|
| |
5
|
Montalvo, F. Diagram Understanding: The Symbolic Descriptions Behind the Scenes. In T. ichikawa, E. Jungert, and R. Korfhage (eds.), Visual Languages and Applications. Plenum Press, New York, 1990.
|
| |
6
|
Moran, T. Deformalizing Computer and Communication Systems. Position Paper for the InterCHI93 Research Symposium. 1993.
|
| |
7
|
Okazaki, S., and Tsuji, Y. An Adaptive Recognition Method for Line Drawings Using Construction Rules. NEC Research and Development Journal, 92 (1989).
|
 |
8
|
Elin Rønby Pedersen , Kim McCall , Thomas P. Moran , Frank G. Halasz, Tivoli: an electronic whiteboard for informal workgroup meetings, Proceedings of the SIGCHI conference on Human factors in computing systems, p.391-398, April 24-29, 1993, Amsterdam, The Netherlands
[doi> 10.1145/169059.169309]
|
| |
9
|
|
| |
10
|
Sato, T., and Tojo, A. Recognition and Understanding of Hand-Drawn Diagrams. Proc. 6th International Conference on Pattern Recognition. IEEE Computer Society Press, New Jersey, 1982.
|
| |
11
|
|
| |
12
|
|
CITED BY 26
|
Thomas P. Moran , Patrick Chiu , William van Melle, Pen-based interaction techniques for organizing material on an electronic whiteboard, Proceedings of the 10th annual ACM symposium on User interface software and technology, p.45-54, October 14-17, 1997, Banff, Alberta, Canada
|
|
|
Thomas P. Moran , Patrick Chiu , William van Melle , Gordon Kurtenbach, Implicit structure for pen-based systems within a freeform interaction paradigm, Proceedings of the SIGCHI conference on Human factors in computing systems, p.487-494, May 07-11, 1995, Denver, Colorado, United States
|
|
Thomas P. Moran , Patrick Chiu , William van Melle, Pen-based interaction techniques for organizing material on an electronic whiteboard, Proceedings of the 10th annual ACM symposium on User interface software and technology, p.45-54, October 14-17, 1997, Banff, Alberta, Canada
|
|
|
Frank M. Shipman, III , Catherine C. Marshall , Thomas P. Moran, Finding and using implicit structure in human-organized spatial layouts of information, Proceedings of the SIGCHI conference on Human factors in computing systems, p.346-353, May 07-11, 1995, Denver, Colorado, United States
|
|
|
|
|
|
Thomas P. Moran , William van Melle , Patrick Chiu, Spatial interpretation of domain objects integrated into a freeform electronic whiteboard, Proceedings of the 11th annual ACM symposium on User interface software and technology, p.175-184, November 01-04, 1998, San Francisco, California, United States
|
|
Eric Saund , David Fleet , Daniel Larner , James Mahoney, Perceptually-supported image editing of text and graphics, Proceedings of the 16th annual ACM symposium on User interface software and technology, p.183-192, November 02-05, 2003, Vancouver, Canada
|
|
|
|
Yang Li , James A. Landay , Zhiwei Guan , Xiangshi Ren , Guozhong Dai, Sketching informal presentations, Proceedings of the 5th international conference on Multimodal interfaces, November 05-07, 2003, Vancouver, British Columbia, Canada
|
|
|
|
|
|
|
|
|
|
|
Takeo Igarashi , Satoshi Matsuoka , Sachiko Kawachiya , Hidehiko Tanaka, Interactive beautification: a technique for rapid geometric design, Proceedings of the 10th annual ACM symposium on User interface software and technology, p.105-114, October 14-17, 1997, Banff, Alberta, Canada
|
|
|
|
|
|
James Lin , Mark W. Newman , Jason I. Hong , James A. Landay, DENIM: finding a tighter fit between tools and practice for Web site design, Proceedings of the SIGCHI conference on Human factors in computing systems, p.510-517, April 01-06, 2000, The Hague, The Netherlands
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
INDEX TERMS
Primary Classification:
I.
Computing Methodologies
I.3
COMPUTER GRAPHICS
I.3.4
Graphics Utilities
Subjects:
Graphics editors
Additional Classification:
H.
Information Systems
H.5
INFORMATION INTERFACES AND PRESENTATION (I.7)
H.5.2
User Interfaces (D.2.2, H.1.2, I.3.6)
Subjects:
Interaction styles (e.g., commands, menus, forms, direct manipulation)
I.
Computing Methodologies
I.5
PATTERN RECOGNITION
I.5.4
Applications
Subjects:
Computer vision
General Terms:
Algorithms,
Design
Keywords:
PerSketch,
computer vision,
drawing tools,
gestures,
graphics editing,
image editing,
interactive graphics,
machine vision,
pen computing,
perceptual grouping,
perceptual organization,
scale space blackboard WYSIWYG,
sketch tools,
token grouping
|