|
ABSTRACT
We present SketchREAD, a multi-domain sketch recognition engine capable of recognizing freely hand-drawn diagrammatic sketches. Current computer sketch recognition systems are difficult to construct, and either are fragile or accomplish robustness by severely limiting the designer's drawing freedom. Our system can be applied to a variety of domains by providing structural descriptions of the shapes in that domain; no training data or programming is necessary. Robustness to the ambiguity and uncertainty inherent in complex, freely-drawn sketches is achieved through the use of context. The system uses context to guide the search for possible interpretations and uses a novel form of dynamically constructed Bayesian networks to evaluate these interpretations. This process allows the system to recover from low-level recognition errors (e.g., a line misclassified as an arc) that would otherwise result in domain level recognition errors. We evaluated Sketch-READ on real sketches in two domains--family trees and circuit diagrams--and found that in both domains the use of context to reclassify low-level shapes significantly reduced recognition error over a baseline system that did not reinterpret low-level classifications. We also discuss the system's potential role in sketch based user interfaces.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
C. Alvarado. Sketch recognition and usability: Guidelines for design and development. In AAAI Fall Symposium on Pen-Based Interaction, 2004.
|
| |
3
|
C.Alvarado and R. Davis. Resolving ambiguities to create a natural sketch based interface. In Proc. of IJCAI, 2001.
|
| |
4
|
|
 |
5
|
|
 |
6
|
|
| |
7
|
T. Hammond and R. Davis. LADDER: A language to describe drawing, display, and editing in sketch recognition. In Proc. of IJCAI, 2003.
|
| |
8
|
D. Koller and A. Pfeffer. Object-oriented bayesian networks. In Proc. of UAI, 1997.
|
| |
9
|
|
 |
10
|
A. Chris Long, Jr. , James A. Landay , Lawrence A. Rowe , Joseph Michiels, Visual similarity of pen gestures, Proceedings of the SIGCHI conference on Human factors in computing systems, p.360-367, April 01-06, 2000, The Hague, The Netherlands
[doi> 10.1145/332040.332458]
|
| |
11
|
J. V. Mahoney and M. P. J. Fromherz. Three main concerns in sketch recognition and an approach to addressing them. In AAAI Spring Symposium on Sketch Understanding, 2002.
|
| |
12
|
|
| |
13
|
M. W. Newman, J. Lin, J. I. Hong, and J. A. Landay. DENIM: An informal web site design tool inspired by observations of practice. Human-Computer Interaction, 18(3):259--324, 2003.
|
 |
14
|
Eric Saund , David Fleet , Daniel Larner , James Mahoney, Perceptually-supported image editing of text and graphics, Proceedings of the 16th annual ACM symposium on User interface software and technology, p.183-192, November 02-05, 2003, Vancouver, Canada
[doi> 10.1145/964696.964717]
|
 |
15
|
|
| |
16
|
M. Shilman, H. Pasula, S. Russell, and R. Newton. Statistical visual language models for ink parsing. In AAAI Spring Symposium on Sketch Understanding, 2002.
|
| |
17
|
|
| |
18
|
|
| |
19
|
A. Torralba and P. Sinha. Statistical context priming for object detection. In Proc. of ICCV, 2001.
|
| |
20
|
L. Wu, S. L. Oviatt, and P. R. Cohen. Multimodal integration--a statistical view. IEEE Trans. on Multimedia, 1(4):334--341, 1999.
|
CITED BY 12
|
|
|
|
|
Tracy Hammond , Brian Eoff , Brandon Paulson , Aaron Wolin , Katie Dahmen , Joshua Johnston , Pankaj Rajan, Free-sketch recognition: putting the chi in sketching, CHI '08 extended abstracts on Human factors in computing systems, April 05-10, 2008, Florence, Italy
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Giovanni Casella , Vincenzo Deufemia , Viviana Mascardi , Gennaro Costagliola , Maurizio Martelli, An agent-based framework for sketched symbol interpretation, Journal of Visual Languages and Computing, v.19 n.2, p.225-257, April, 2008
|
|
|
|
|
|