ABSTRACT
Efficient health systems require reliable data. In developing countries the need for accurate data is particularly acute, as organizations are often forced to make decisions on a tight budget with limited capacity for data collection. In this note, we describe recent progress toward developing a set of algorithms that can help detect and classify anomalies in health worker data. Building on recent efforts to use unsupervised multinomial techniques for outlier detection, we outline the steps required to turn a set of statistical tests into a framework that can be implemented by health organizations, and calibrate these algorithms on a large dataset from a partner health organization. Here, we describe the core methods, present results from ongoing analyses, and outline our plan for future work, including plans to obtain labeled training data that will allow us to detect and classify different types of outlier in community health worker data.
- Birnbaum, B., DeRenzi, B., Flaxman, A. D., & Lesh, N. (2012, March). Automated quality control for mobile data collection. In Proceedings of the 2nd ACM Symposium on Computing for Development (p. 1). ACM. Google ScholarDigital Library
- Birnbaum, B., Borriello, G., Flaxman, A. D., DeRenzi, B., & Karlin, A. R. (2013, April). Using behavioral data to identify interviewer fabrication in surveys. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (pp. 2911--2920). ACM. Google ScholarDigital Library
- I. Schreiner et al. Interviewer falsification in census bureau surveys. ASA Section on Survey Research Methods, pages 491--496, 1988.Google Scholar
- Dell, N., Breit, N., Wobbrock, J. O., Borriello, G. 2013. Improving form-based data entry with image snippets. Proceedings of Graphics Interface (GI '13). Google ScholarDigital Library
- Patnaik, S., Brunskill, E., and Thies, W. 2009. Evaluating the accuracy of data collection on mobile phones: a study of forms, SMS, and voice. ICTD '09. Google ScholarDigital Library
- Chen, K., Chen, H., Conway, N., Hellerstein, J. M., & Parikh, T. S. (2011). Usher: Improving data quality with dynamic forms. Knowledge and Data Engineering, IEEE Transactions on, 23(8), 1138--1153. Google ScholarDigital Library
- Chen, K., Kannan, A., Yano, Y., Hellerstein, J. M., & Parikh, T. S. (2012, March). Shreddr: pipelined paper digitization for low-resource organizations. In Proceedings of the 2nd ACM Symposium on Computing for Development (p. 3). ACM Google ScholarDigital Library
Recommendations
“It cannot do all of my work”: Community Health Worker Perceptions of AI-Enabled Mobile Health Applications in Rural India
CHI '21: Proceedings of the 2021 CHI Conference on Human Factors in Computing SystemsRecent advances in Artificial Intelligence (AI) suggest that AI applications could transform healthcare delivery in the Global South. However, as researchers and technology companies rush to develop AI applications that aid the health of marginalized ...
Illustrating the Gaps and Needs in the Training Support of Community Health Workers in India
CHI '21: Proceedings of the 2021 CHI Conference on Human Factors in Computing SystemsIn India and other developing countries, Community Health Workers (CHWs) provide the first line of care in delivering necessary maternal and child health services. In this work, we assess the training and skill-building needs of CHWs, through a mobile-...
Supporting Community Health Workers in India through Voice- and Web-Based Feedback
CHI '17: Proceedings of the 2017 CHI Conference on Human Factors in Computing SystemsOur research aims to support community health workers (CHWs) in low-resource settings by providing them with personalized information regarding their work. This information is delivered through a combination of voice- and web-based feedback that is ...
Comments