research-article

Thematic organization of web content for distraction-free text-to-speech narration

Authors:
Muhammad Asiful Islam

Stony Brook University, Stony Brook, NY, USA

Stony Brook University, Stony Brook, NY, USA
View Profile

,
Faisal Ahmed

Stony Brook University, Stony Brook, NY, USA

Stony Brook University, Stony Brook, NY, USA
View Profile

,
Yevgen Borodin

Stony Brook University, Stony Brook, NY, USA

Stony Brook University, Stony Brook, NY, USA
View Profile

,
I.V. Ramakrishnan

Stony Brook University, Stony Brook, NY, USA

Stony Brook University, Stony Brook, NY, USA
View Profile

ASSETS '12: Proceedings of the 14th international ACM SIGACCESS conference on Computers and accessibilityOctober 2012Pages 17–24https://doi.org/10.1145/2384916.2384920

Published:22 October 2012Publication History

ASSETS '12: Proceedings of the 14th international ACM SIGACCESS conference on Computers and accessibility

Pages 17–24

ABSTRACT

People with visual disabilities, especially those who are blind, have digital content narrated to them by text-to-speech (TTS) engines (e.g., with the help of screen readers). Naively narrating web pages, particularly the ones consisting of several diverse pieces (e.g., news summaries, opinion pieces, taxonomy, ads), with TTS engines without organizing them into thematic segments will make it very difficult for the blind user to mentally separate out and comprehend the essential elements in a segment, and the effort to do so can cause significant cognitive stress. One can alleviate this difficulty by segmenting web pages into thematic pieces and then narrating each of them separately. Extant segmentation methods typically segment web pages using visual and structural cues. The use of such cues without taking into account the semantics of the content, tends to produce "impure" segments containing extraneous material interspersed with the essential elements. In this paper, we describe a new technique for identifying thematic segments by tightly coupling visual, structural, and linguistic features present in the content. A notable aspect of the technique is that it produces segments with very little irrelevant content. Another interesting aspect is that the clutter-free main content of a web page, that is produced by the Readability tool and the "Reader" feature of the Safari browser, emerges as a special case of the thematic segments created by our technique. We provide experimental evidence of the effectiveness of our technique in reducing clutter. We also describe a user study with 23 blind subjects of its impact on web accessibility.

References

Document object model (DOM) technical reports (http://www.w3.org/DOM/DOMTR). 2010.Google Scholar
x-path (http://www.w3.org/tr/xpath/). 2010.Google Scholar
Apple. Voiceover, screen reader from apple (http://www.apple.com/accessibility/voiceover). 2010.Google Scholar
Y. Borodin, F. Ahmed, M. A. Islam, Y. Puzis, V. Melnyk, S. Feng, I. V. Ramakrishnan, and G. Dausch. Hearsay: a new generation context-driven multi-modal assistive web browser. In WWW, 2010. Google ScholarDigital Library
D. Cai, S. Yu, J.-R. Wen, and W.-Y. Ma. VIPS: a vision-based page segmentation algorithm. Microsoft Technical Report, (MSR-TR-2003-79), 2003.Google Scholar
D. Chakrabarti, R. Kumar, and K. Punera. A graph-theoretic approach to webpage segmentation. In WWW, pages 377--386, 2008. Google ScholarDigital Library
D. Egnor. Document segmentation based on visual gaps. L.L.P. HARRITY and SNYDER, 2006.Google Scholar
G. H. Golub and W. Kahan. Calculating the singular values and pseudo-inverse of a matrix. Journal of the Society for Industrial and Applied Mathematics, pages 205--224, 1965.Google ScholarCross Ref
H.-F. Guo, J. Mahmud, Y. Borodin, A. Stent, and I. V. Ramakrishnan. A general approach for partitioning web page content based on geometric and style information. In ICDAR, pages 929--933, 2007.. Google ScholarDigital Library
G. Hattori, K. Hoashi, K. Matsumoto, and F. Sugaya. Robust web page segmentation for mobile terminal using content-distances and page layout information. In WWW, pages 361--370, 2007. Google ScholarDigital Library
M. A. Islam, F. Ahmed, Y. Borodin, and I. V. Ramakrishnan. Tightly coupling visual and linguistic features for enriching audio-based web browsing experience. In CIKM, 2011. Google ScholarDigital Library
JAWS. (http://www.freedomscientific.com). 2010.Google Scholar
T. K. Landauer and S. T. Dumais. Latent semantic analysis. Scholarpedia, 3(11):43--56, 2008.Google ScholarCross Ref
J. Mahmud, Y. Borodin, and I. V. Ramakrishnan. Csurf: a context driven non-visual web-browser. In WWW, pages 31--40, 2007. Google ScholarDigital Library
C. D. Manning, P. Raghavan, and H. Schutze. Introduction to information retrieval. Cambridge University Press, 2008. Google ScholarDigital Library
A. Pnueli, R. Bergman, S. Schein, and O. Barko. Web page layout via visual segmentation. (HPL-2009-160).Google Scholar
Readability. (https://www.readability.com). 2010.Google Scholar
G. Salton, A. Wong, and C. S. Yang. A vector space model for automatic indexing. Commun. ACM, 18(11):613--620, 1975. Google ScholarDigital Library
A. Strehl. Relationship-based clustering and cluster ensembles for high-dimensional data mining. PhD thesis, The University of Texas at Austin, May 2002. Google ScholarDigital Library

Index Terms

Thematic organization of web content for distraction-free text-to-speech narration
1. Human-centered computing
  1. Human computer interaction (HCI)
2. Information systems
  1. Information retrieval

Recommendations

WAI-ARIA live regions: eBuddy IM as a case example
W4A '10: Proceedings of the 2010 International Cross Disciplinary Conference on Web Accessibility (W4A)

Rich Internet Applications (RIAs) offer new levels of user interactivity through a Web browser. By combining semantics, style and behavior it is possible to create a RIA that can rival a traditional desktop application. Unfortunately, much of the ...
Read More
Guidelines for an accessible web automation interface
ASSETS '11: The proceedings of the 13th international ACM SIGACCESS conference on Computers and accessibility

In recent years, the Web has become an ever more sophisticated and irreplaceable tool in our daily lives. While the visual Web has been advancing at a rapid pace, assistive technology has not been able to keep up, increasingly putting visually impaired ...
Read More
Exploring the relationship between web accessibility and user experience

Understanding the interplay between the user experience (UX) and Web accessibility is key to design Web sites that, beyond access, could provide a better UX for people with disabilities. In this paper we examine the relationship between UX attributes ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
ASSETS '12: Proceedings of the 14th international ACM SIGACCESS conference on Computers and accessibility
October 2012
321 pages
ISBN:9781450313216
DOI:10.1145/2384916
General Chair:
Matt Huenerfauth
City University of New York, USA
,
Program Chair:
Sri Kurniawan
University of California Santa Cruz, USA
Copyright © 2012 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 22 October 2012
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
blind users
clustering
screen readers
segmentation
singular value decomposition
Qualifiers
- research-article
Conference

Acceptance Rates
Overall Acceptance Rate436of1,556submissions,28%
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 4
  Total Citations
  View Citations
- 238
  Total Downloads
- Downloads (Last 12 months)4
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Thematic organization of web content for distraction-free text-to-speech narration

ASSETS '12: Proceedings of the 14th international ACM SIGACCESS conference on Computers and accessibility

ABSTRACT

References

Cited By

Index Terms

Recommendations

WAI-ARIA live regions: eBuddy IM as a case example

Guidelines for an accessible web automation interface

Exploring the relationship between web accessibility and user experience