skip to main content
10.1145/1378889.1378921acmconferencesArticle/Chapter ViewAbstractPublication PagesjcdlConference Proceedingsconference-collections
research-article

Math information retrieval: user requirements and prototype implementation

Authors Info & Claims
Published:16 June 2008Publication History

ABSTRACT

We report on the user requirements study and preliminary implementation phases in creating a digital library that indexes and retrieves educational materials on math. We first review the current approaches and resources for math retrieval, then report on the interviews of a small group of potential users to properly ascertain their needs. While preliminary, the results suggest that meta-search and resource categorization are two basic requirements for a math search engine. In addition, we implement a prototype categorization system and show that the generic features work well in identifying the math contents from the webpage but perform less well at categorizing them. We discuss our long term goals, where we plan to investigate how math expressions and text search may be best integrated.

References

  1. G. Attardi, A. Gullí, and F. Sebastiani. Automatic Web page categorization by link and context analysis. In C. Hutchison and G. Lanzarone, editors, Proceedings of THAI-99, European Symposium on Telematics, Hypermedia and Artificial Intelligence, pages 105--119, Varese, IT, 1999.Google ScholarGoogle Scholar
  2. N. J. Belkin, R. N. Oddy, and H. M. Brooks. Ask for information retrieval: Part i.: Background and theory. pages 299--304, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. A. P. Bishop. Digital libraries and knowledge disaggregation: the use of journal article components. In DL '98: Proceedings of the third ACM conference on Digital Libraries, pages 29--39, New York, NY, USA, 1998. ACM Press. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. A. Broder. A taxonomy of web search. SIGIR Forum, 36(2):3--10, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. C. M. Brown. Information seeking behavior of scientists in the electronic information age: Astronomers, chemists, mathematicians, and physicists. Journal of the American Society for Information Science and Technology, 50(10):929--943, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. G. Buchanan, S. J. Cunningham, A. Blandford, J. Rimmer, and C. Warwick. Information seeking by humanities scholars. In ECDL, pages 218--229, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. D. Cai, S. Yu, J.-R. Wen, and W.-Y. Ma. Extracting content structure for web pages based on visual representation. In Fifth Asia Pacific Web Conference (APWeb2003), 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. D. O. Case. Looking for Information, Second Edition: A Survey of Research on Information Seeking, Needs, and Behavior (Library and Information Science). Academic Press, 2006.Google ScholarGoogle Scholar
  9. M. B. Eisenberg and R. E. Berkowitz. Information problem-solving: the Big Six Skills approach to library and information skills instruction. Norwood, NJ: Albex Publishing, 1990.Google ScholarGoogle Scholar
  10. X.-D. Gu, J. Chen, W.-Y. Ma, and G.-L. Chen. Visual based content understanding towards web adaptation. In AH '02: Proceedings of the Second International Conference on Adaptive Hypermedia and Adaptive Web-Based Systems, pages 164--173, London, UK, 2002. Springer-Verlag. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. M. Hearst. Design recommendations for hierarchical faceted search interfaces. In ACM SIGIR Workshop on Faceted Search, 2006.Google ScholarGoogle Scholar
  12. P. Jipsen. Text-based input formats for mathematical formulas. In The Evolution of Mathematical Communication in the Age of Digital Libraries, IMA "Hot Topics" Workshop, U.S.A, 2006.Google ScholarGoogle Scholar
  13. F. Kamareddine, R. Lamar, M. Maarek, and J. B. Wells. Restoring natural language as a computerised mathematics input method. In Towards Mechanized Mathematical Assistants, MKM 2007, pages 280--295, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. M. Kan, J. Klavans, and K. McKeown. Linear segmentation and segment significance. 1998.Google ScholarGoogle Scholar
  15. M. Kohlhase and A. Franke. MBase: Representing knowledge and context for the integration of mathematical software systems. Journal of Symbolic Computation, 32(4):365--402, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. M. Kohlhase and I. Sucan. A search engine for mathematical formulae. In Proceedings of Artificial Intelligence and Symbolic Computation, AISC 2006, number 4120 in LNAI, pages 241--253. Springer Verlag, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. H. Kruger. Searching mathematics with zentralblatt math: Overview and outlook. In Enhancing the Searching of Mathematics, IMA "Hot Topics" Workshop, U.S.A, 2004.Google ScholarGoogle Scholar
  18. A. M. Lau. Advancing PARCELS: PARser for content extraction and logical structure using inter- and intra-similarity features. Technical report, National University of Singapore, 2005.Google ScholarGoogle Scholar
  19. C. H. Lee, M.--Y. Kan, and S. Lai. Stylistic and lexical co-training for web block classification. In Proceedings of WIDM 04, pages 136--143, Washington, D.C., USA, 2004. ACM Press. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Y.-B. Lee and S.-H. Myaeng. Text genre classification with genre-revealing and subject-revealing features. In SIGIR, pages 145--150, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. P. Libbrecht and E. Melis. Methods to access and retrieve mathematical content in activemath. In ICMS, volume 4151 of Lecture Notes in Computer Science, pages 331--342. Springer, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. R. Miner and R. Munavalli. An approach to mathematical search through query formulation and data normalization. In Towards Mechanized Mathematical Assistants, MKM 2007, pages 342--355, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. G. Newby. Information space based on HTML structure. In The Ninth Text REtrieval Conference (TREC 9), pages 601---610, 2000.Google ScholarGoogle Scholar
  24. H. R. Tibbo. Primarily history: Historians and the search for primary source materials. In JCDL '02: Proceedings of the 2nd ACM/IEEE-CS Joint Conference on Digital libraries, pages 1--10, New York, NY, USA, 2002. ACM Press. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. S. Wiberley and W. G. Jones. Time and technology: A decade-long look at humanists' use of electronic information technology. College and Research Libraries, 61, September, pages 421--431, 2000.Google ScholarGoogle Scholar

Index Terms

  1. Math information retrieval: user requirements and prototype implementation

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          JCDL '08: Proceedings of the 8th ACM/IEEE-CS joint conference on Digital libraries
          June 2008
          490 pages
          ISBN:9781595939982
          DOI:10.1145/1378889

          Copyright © 2008 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 16 June 2008

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article

          Acceptance Rates

          JCDL '08 Paper Acceptance Rate33of117submissions,28%Overall Acceptance Rate415of1,482submissions,28%

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader