ACM Home Page
Please provide us with feedback. Feedback
SketchREAD: a multi-domain sketch recognition engine
Full text PdfPdf (263 KB)
Source Symposium on User Interface Software and Technology archive
Proceedings of the 17th annual ACM symposium on User interface software and technology table of contents
Santa Fe, NM, USA
SESSION: Pens & sketching table of contents
Pages: 23 - 32  
Year of Publication: 2004
ISBN:1-58113-957-8
Authors
Christine Alvarado  MIT CSAIL, Cambridge, MA
Randall Davis  MIT CSAIL, Cambridge, MA
Sponsors
ACM: Association for Computing Machinery
SIGGRAPH: ACM Special Interest Group on Computer Graphics and Interactive Techniques
SIGCHI: ACM Special Interest Group on Computer-Human Interaction
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 15,   Downloads (12 Months): 119,   Citation Count: 12
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
Save this Article to a Binder    Display Formats: BibTex  EndNote ACM Ref   
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1029632.1029637
What is a DOI?

ABSTRACT

We present SketchREAD, a multi-domain sketch recognition engine capable of recognizing freely hand-drawn diagrammatic sketches. Current computer sketch recognition systems are difficult to construct, and either are fragile or accomplish robustness by severely limiting the designer's drawing freedom. Our system can be applied to a variety of domains by providing structural descriptions of the shapes in that domain; no training data or programming is necessary. Robustness to the ambiguity and uncertainty inherent in complex, freely-drawn sketches is achieved through the use of context. The system uses context to guide the search for possible interpretations and uses a novel form of dynamically constructed Bayesian networks to evaluate these interpretations. This process allows the system to recover from low-level recognition errors (e.g., a line misclassified as an arc) that would otherwise result in domain level recognition errors. We evaluated Sketch-READ on real sketches in two domains--family trees and circuit diagrams--and found that in both domains the use of context to reclassify low-level shapes significantly reduced recognition error over a baseline system that did not reinterpret low-level classifications. We also discuss the system's potential role in sketch based user interfaces.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
C. Alvarado. Sketch recognition and usability: Guidelines for design and development. In AAAI Fall Symposium on Pen-Based Interaction, 2004.
 
3
C.Alvarado and R. Davis. Resolving ambiguities to create a natural sketch based interface. In Proc. of IJCAI, 2001.
 
4
5
6
 
7
T. Hammond and R. Davis. LADDER: A language to describe drawing, display, and editing in sketch recognition. In Proc. of IJCAI, 2003.
 
8
D. Koller and A. Pfeffer. Object-oriented bayesian networks. In Proc. of UAI, 1997.
 
9
10
 
11
J. V. Mahoney and M. P. J. Fromherz. Three main concerns in sketch recognition and an approach to addressing them. In AAAI Spring Symposium on Sketch Understanding, 2002.
 
12
 
13
M. W. Newman, J. Lin, J. I. Hong, and J. A. Landay. DENIM: An informal web site design tool inspired by observations of practice. Human-Computer Interaction, 18(3):259--324, 2003.
14
15
 
16
M. Shilman, H. Pasula, S. Russell, and R. Newton. Statistical visual language models for ink parsing. In AAAI Spring Symposium on Sketch Understanding, 2002.
 
17
 
18
 
19
A. Torralba and P. Sinha. Statistical context priming for object detection. In Proc. of ICCV, 2001.
 
20
L. Wu, S. L. Oviatt, and P. R. Cohen. Multimodal integration--a statistical view. IEEE Trans. on Multimedia, 1(4):334--341, 1999.

CITED BY  12
 

Collaborative Colleagues:
Christine Alvarado: colleagues
Randall Davis: colleagues