skip to main content
research-article

Non-profit geographically constrained locator

Published:30 July 2015Publication History
Skip Abstract Section

Abstract

There exist numerous non-profit entities and organizations that provide social services to those in need. These organizations may be governmental, secular, parochial, or religious. The services provided may be long term institutionalized support, seasonal or limited time benefits. Beneficiaries of social services may be restricted by age, sex, family status, citizenship, and/or income level. Given this variability, we envisioned the need to create an interactive geographically constrained web based social services client matching system. In this system, a client would enter basic demographic information and the system would return a set of available social services within a specific geographic area.

As a first step in trying to create this system, undergraduate student Dan Myers investigated the viability of using Scrapy to crawl and scrape data from the web based on geographic location. Scrapy is an open-source web crawling framework written in Python. Scrapy was chosen because it is open source, widely used, and provides tools for both scraping and crawling [2]. The main focus of Dan Myers' project was to write a web crawler that utilizes regular expressions to extract relevant data from geographically targeted websites. Specifically, Dan's project used Scrapy to find all web mentions of physical addresses in the zip codes of the three most northern counties in Kentucky. The two main tasks for this project were (1) to use Scrapy's spider class to extract street addresses from the chosen zip codes and (2) to create a Scrapy item object to manage the data. This project demonstrated that by using the open source tool Scrapy, the problem of extracting geographically relevant information is faster and easier to manage than previously proposed systems such as the gazetteer approach used in the web-a-where project [1].

The next phase of the project will be undertaken by a team of UR-STEM students in the summer of 2015. At NKU, UR-STEM (which stands for undergraduate research in STEM) is a paid summer research opportunity for students early in their college careers. A goal for this summer's research will be the creation of a fully working prototype of a web based non-profit geographically constrained locator. Depending on the quality of the code produced, the project may be released as the first version of an ongoing open source software project which could, potentially, lead to the creation of an HFOSS (Humanitarian Free and Open Source Software) project whose original programmers are all students [3].

References

  1. Einat Amitay, Nadav Har'El, Ron Sivan, and Aya Soffer. Web-a-where: geotagging web content. In Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval (SIGIR '04). ACM, New York, NY, USA, 273--280. DOI=10.1145/1008992.1009040 Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Kate Matsudaira. Capturing and structuring data mined from the web. Commun. ACM 57, 3 (March 2014), 10--11. DOI=10.1145/2567664 Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Annie Murphy Paul. Build something. Slate (September 24, 2014), <http://www.slate.com/articles/technology/future_tense/2014/09/hfoss_software_for_humanity_computer_science_students_solve_real_world_problems.html>, accessed 31 January 2015Google ScholarGoogle Scholar

Index Terms

  1. Non-profit geographically constrained locator

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in

    Full Access

    • Published in

      cover image ACM SIGCAS Computers and Society
      ACM SIGCAS Computers and Society  Volume 45, Issue 2
      June 2015
      40 pages
      ISSN:0095-2737
      DOI:10.1145/2809957
      Issue’s Table of Contents

      Copyright © 2015 Copyright is held by the owner/author(s)

      Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 30 July 2015

      Check for updates

      Qualifiers

      • research-article

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader