skip to main content
10.1145/1065385.1065501acmconferencesArticle/Chapter ViewAbstractPublication PagesjcdlConference Proceedingsconference-collections
Article

What type of page is this?: genre as web descriptor

Published: 07 June 2005 Publication History

Abstract

Many have suggested the use of genres to ameliorate the problem of web search, e.g. [1,3,4,5,6,7]. A central issue in the implementation of this idea is the choice of genres to be used as web page descriptors. Several studies have explored user terminology for and recognition of several types of digital documents, e.g., various types of office documents [8], personal homepages [2], and pages returned by user web searches [4,6]. This poster reports on a series of three user studies with the purpose of developing a genre "palette" for use in web retrieval. Pages viewed by participants in these studies were limited to the edu domain, as in [5].In the first study, three participants, an information technology professional, an oncology social worker and a computer science professor, in separate sessions, were given a stack of 102 web page printouts, and were asked to separate the pages into piles according to genre. They were also asked to name the genres by writing the names on sticky notes and placing them on the piles. After the piles were complete, participants were asked to provide a short, one or two sentence, description of each genre, and then to describe the page characteristics that led them to place a page in that genre.A list of 49 genre names and definitions was developed from the work of the three participants, keeping the terminology as similar as possible to the original, while combining definitions which were nearly identical in wording. In a second user study, each of ten participants was given this list of genre name/definition pairs, the same stack of 102 printed web pages (arranged in a different random order for each participant), and a data collection form on which he/she recorded a genre for each web page. For each of the 102 web pages, the participant was given the option to either write a number from the list corresponding to a genre/definition pair which best described the page; or to provide his/her own suggestion for a genre name and definition, if none of those in the list seemed adequate. The participants were drawn from a convenience sample of approximately 10 college graduates of various occupations. Given that participants chose genres from a list of 48, many of which were extremely similar in nature, the resulting level of agreement (half or more of the participants agreeing on one genre for a given page in 60% of the instances) is quite acceptable. A set of five principles for creating a genre palette from individuals' sortings was developed. Based on those principles, the original list was trimmed down to 18 genres.The third study was an online experiment in which 257 college, faculty, students, and staff from two schools categorized a new set of 55 pages using the 18 genres. On average, over 70% agreed on the genre of each page. No study of this scale is known to report user recognition of web genres. This user validation is necessary to set upper bounds for machine categorization efforts. Also, because genre is usually considered to be "socially defined", genre studies using researcher-defined a priori categories (e.g., [5]) may not be able to show genres' usefulness for web search.Interestingly, the genres in this palette, although developed independently, are similar to 7 of 8 Internet-wide genres based on user input reported in [7], and similar to 8 of 11 Internet-wide genres as reported in [3]. Based on these observations, one might infer that some substantial amount of genre knowledge exists among users, even from different cultures (in this case, the United States, Germany, and Sweden).

References

[1]
Crowston, K. & Kwasnik, B. (2003). Can document-genre metadata improve information access to large digital collections?, Library Trends, 52(2), 345--361.
[2]
Dillon, A. & Gushrowski, B. (2000). Genres and the Web: Is the personal home page the first uniquely digital genre?, Journal of the American Society for Information Science, 5, 202--205.
[3]
Karlgren, J., Bretan, I., Dewe, J., Hallberg, A., & Wolkert, N. (1998). Iterative information retrieval using fast clustering and usage-specific genres. Eighth DELOS Workshop - User Interface in Digital Libraries, 85--92.
[4]
Nilan, M., Pomerantz, J. & Paling, S. (2001). Genres from the bottom up: What has the Web brought us? Proceedings of the American Society for Information Science and Technology Annual Meeting, 330--339.
[5]
Rehm, G. Towards automatic Web genre identification. Proceedings of the 35nd Annual Hawaii International Conference on Systems Sciences, 2002.
[6]
Roussinov, D., Crowston, K., Nilan, M., Kwasnik, B., Cai, J. & Liu, X. (2001). Genre based navigation on the web. Proceedings of the 34th Annual Hawaii International Conference on System Sciences, Digital Documents Track, IEEE Computer Society Press.
[7]
Stein, B. & Meyer zu Eissen, S. Genre classification of web pages. Proceedings of the 27th German Conference on Artificial Intelligence (KI-2004), 2004.
[8]
Toms, E., Campbell, D., & Blades, R. (1999). Does genre define the shape of information: the role of form and function in user interaction with digital documents. Proceedings of the 62th American Society for Information Science Annual Meeting, 693--704.

Cited By

View all
  • (2022)Genre containersJournal of the Association for Information Science and Technology10.1002/asi.2460073:4(609-624)Online publication date: 1-Mar-2022
  • (2009)Internet GenresEncyclopedia of Library and Information Sciences, Third Edition10.1081/E-ELIS3-120043520(2983-2995)Online publication date: 7-Dec-2009
  • (2008)Towards the use of genre to improve search in digital libraries: Where do we go from here?Proceedings of the American Society for Information Science and Technology10.1002/meet.145044011844:1(1-5)Online publication date: 24-Oct-2008

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
JCDL '05: Proceedings of the 5th ACM/IEEE-CS joint conference on Digital libraries
June 2005
450 pages
ISBN:1581138768
DOI:10.1145/1065385
  • General Chair:
  • Mary Marlino,
  • Program Chairs:
  • Tamara Sumner,
  • Frank Shipman
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 07 June 2005

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. classification
  2. genre
  3. metadata
  4. web search

Qualifiers

  • Article

Conference

JCDL05

Acceptance Rates

Overall Acceptance Rate 415 of 1,482 submissions, 28%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 07 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2022)Genre containersJournal of the Association for Information Science and Technology10.1002/asi.2460073:4(609-624)Online publication date: 1-Mar-2022
  • (2009)Internet GenresEncyclopedia of Library and Information Sciences, Third Edition10.1081/E-ELIS3-120043520(2983-2995)Online publication date: 7-Dec-2009
  • (2008)Towards the use of genre to improve search in digital libraries: Where do we go from here?Proceedings of the American Society for Information Science and Technology10.1002/meet.145044011844:1(1-5)Online publication date: 24-Oct-2008

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media