research-article

Mining Free-Text Medical Notes for Suicide Risk Assessment

Authors:

Grigoris Antoniou,

Elissavet Greasidou,

Vincenzo Lagani,

Paulos Charonyktakis,

Ioannis TsamardinosAuthors Info & Claims

SETN '18: Proceedings of the 10th Hellenic Conference on Artificial Intelligence

Article No.: 47, Pages 1 - 8

https://doi.org/10.1145/3200947.3201020

Published: 09 July 2018 Publication History

Abstract

Suicide has been considered as an important public health issue for a very long time, and is one of the main causes of death worldwide. Despite suicide prevention strategies being applied, the rate of suicide has not changed substantially over the past decades. Advances in machine learning make it possible to attempt to predict suicide based on the analysis of relevant data to inform clinical practice. This paper reports on findings from the analysis of data of patients who died by suicide in the period 2013-2016 and made use of both structured data and free-text medical notes. We focus on examining various text-mining approaches to support risk assessment. The results show that using advance machine learning and text-mining techniques, it is possible to predict within a specified period which people are most at risk of taking their own life at the time of referral to a mental health service.

References

[1]

Marios Adamou, Grigoris Antoniou, Elissavet Greasidou, Vincenzo Lagani, Paulos Charonyktakis, Ioannis Tsamardinos, and Michael Doyle. {n. d.}. Towards Automatic Risk Assessment to Support Suicide Prevention. ({n. d.}). (to appear).

[2]

Steven Bird, Ewan Klein, and Edward Loper. 2009. Natural language processing with Python: analyzing text with the natural language toolkit. " O'Reilly Media, Inc.".

Digital Library

[3]

David M Blei, Andrew Y Ng, and Michael I Jordan. 2003. Latent dirichlet allocation. Journal of machine Learning research 3, Jan (2003), 993--1022.

Digital Library

[4]

Giorgos Borboudakis, Taxiarchis Stergiannakos, Maria Frysali, Emmanuel Klontzas, Ioannis Tsamardinos, and George E Froudakis. 2017. Chemically intuited, large-scale screening of MOFs by machine learning techniques. npj Computational Materials 3, 1 (2017), 40.

[5]

Bernhard E Boser, Isabelle M Guyon, and Vladimir N Vapnik. 1992. A training algorithm for optimal margin classifiers. In Proceedings of the fifth annual workshop on Computational learning theory. ACM, 144--152.

Digital Library

[6]

Leo Breiman. 2001. Random forests. Machine learning 45, 1 (2001), 5--32.

Digital Library

[7]

Leo Breiman, JH Friedman, Richard A Olshen, and Charles J Stone. 1984. Classification and Regression Trees. Wadsworth (1984).

[8]

Gregory Carter, Allison Milner, Katie McGill, Jane Pirkis, Navneet Kapur, and Matthew J Spittal. 2017. Predicting suicidal behaviours using clinical instruments: systematic review and meta-analysis of positive predictive values for risk scales. The British Journal of Psychiatry (2017), bjp-bp.

[9]

Andrea Fagiolini, Paola Rocca, Serafino De Giorgi, Edoardo Spina, Giovanni Amodeo, and Mario Amore. 2017. Clinical trial methodology to assess the efficacy/effectiveness of long-acting antipsychotics: Randomized controlled trials vs naturalistic studies. Psychiatry research 247 (2017), 257--264.

[10]

National Center for Health Statistics (US et al. 2017. Health, United States, 2016: with chartbook on long-term trends in health. (2017).

[11]

American Foundation for Suicide Prevention. 2017. Suicide Statistics. https://afsp.org/about-suicide/suicide-statistics/. (2017).

[12]

Beth Han, WilsonMCompton, Joseph Gfroerer, and Richard McKeon. 2014. Mental health treatment patterns among adults with recent suicide attempts in the United States. American journal of public health 104, 12 (2014), 2359--2368.

[13]

Arthur E Hoerl and Robert W Kennard. 1970. Ridge regression: Biased estimation for nonorthogonal problems. Technometrics 12, 1 (1970), 55--67.

[14]

Jeffrey Hyman, Robert Ireland, Lucinda Frost, and Linda Cottrell. 2012. Suicide incidence and risk factors in an active duty US military population. American journal of public health 102, S1 (2012), S138-S146.

[15]

Ronald C Kessler, LTC Christopher H Warner, LTC Christopher Ivany, Maria V Petukhova, Sherri Rose, Evelyn J Bromet, LTC Millard Brown III, Tianxi Cai, Lisa J Colpe, Kenneth L Cox, et al. 2015. Predicting US Army suicides after hospitalizations with psychiatric diagnoses in the Army Study to Assess Risk and Resilience in Servicemembers (Army STARRS). JAMA psychiatry 72, 1 (2015), 49.

[16]

Ron Kohavi et al. 1995. A study of cross-validation and bootstrap for accuracy estimation and model selection. In Ijcai, Vol. 14. Montreal, Canada, 1137--1145.

Digital Library

[17]

Vincenzo Lagani, Giorgos Athineou, Alessio Farcomeni, Michail Tsagris, and Ioannis Tsamardinos. 2016. Feature selection with the r package mxm: Discovering statistically-equivalent feature subsets. arXiv preprint arXiv: 1611.03227 (2016).

[18]

Vincenzo Lagani, Giorgos Athineou, Alessio Farcomeni, Michail Tsagris, Ioannis Tsamardinos, et al. 2017. Feature Selection with the R Package MXM: Discovering Statistically Equivalent Feature Subsets. Journal of Statistical Software 80, i07 (2017).

[19]

Andrew Kachites McCallum. 2002. Mallet: A machine learning for language toolkit. (2002).

[20]

House of Commons Health Committee. 2017. Suicide prevention: Sixth Report. (2017).

[21]

Georgia Orfanoudaki, Maria Markaki, Katerina Chatzi, Ioannis Tsamardinos, and Anastassios Economou. 2017. MatureP: prediction of secreted proteins with exclusive information from their mature regions. Scientific reports 7, 1 (2017), 3263.

[22]

Fabian Pedregosa, Gaël Varoquaux, Alexandre Gramfort, Vincent Michel, Bertrand Thirion, Olivier Grisel, Mathieu Blondel, Peter Prettenhofer, Ron Weiss, Vincent Dubourg, et al. 2011. Scikit-learn: Machine learning in Python. Journal of machine learning research 12, Oct (2011), 2825--2830.

Digital Library

[23]

Chris Poulin, Brian Shiner, Paul Thompson, Linas Vepstas, Yinong Young-Xu, Benjamin Goertzel, Bradley Watts, Laura Flashman, and Thomas McAllister. 2014. Predicting the risk of suicide by analyzing the text of clinical notes. PloS one 9, 1 (2014), e85733.

[24]

Pooja Saini, David While, Khatidja Chantler, Kirsten Windfuhr, and Navneet Kapur. 2014. Assessment and management of suicide risk in primary care. Crisis: The Journal of Crisis Intervention and Suicide Prevention 35, 6 (2014), 415.

[25]

G Salton andMJ McGill. 1983. Introduction to modern information Philadelphia, PA. American Association for Artificial Intelligence retrieval. (1983).

Digital Library

[26]

Olympia Simantiraki, Paulos Charonyktakis, Anastasia Pampouchidou, Manolis Tsiknakis, and Martin Cooke. 2017. Glottal Source Features for Automatic Speech-based Depression Assessment. Proc. Interspeech 2017 (2017), 2700--2704.

[27]

Karen Sparck Jones. 1972. A statistical interpretation of term specificity and its application in retrieval. Journal of documentation 28, 1 (1972), 11--21.

[28]

Ioannis Tsamardinos, Elissavet Greasidou, and Giorgos Borboudakis. {n. d.}. Bootstrapping the Out-of-sample Predictions for Efficient and Accurate Cross-Validation. Machine Learning ({n. d.}). to appear.

[29]

Ioannis Tsamardinos, Amin Rakhshani, and Vincenzo Lagani. 2015. Performance-estimation properties of cross-validation-based protocols with simultaneous hyper-parameter optimization. International Journal on Artificial Intelligence Tools 24, 05 (2015), 1540023.

[30]

Sudhir Varma and Richard Simon. 2006. Bias in error estimation when using cross-validation for model selection. BMC bioinformatics 7, 1 (2006), 91.

[31]

Colin G Walsh, Jessica D Ribeiro, and Joseph C Franklin. 2017. Predicting risk of suicide attempts over time through machine learning. Clinical Psychological Science 5, 3 (2017), 457--469.

[32]

Eric Youngstrom, Oren Meyers, Jennifer Kogos Youngstrom, Joseph R Calabrese, and Robert L Findling. 2006. Comparing the effects of sampling designs on the diagnostic accuracy of eight promising screening algorithms for pediatric bipolar disorder. Biological Psychiatry 60, 9 (2006), 1013--1019.

[33]

Eric A Youngstrom. 2013. A primer on receiver operating characteristic analysis and diagnostic efficiency statistics for pediatric psychology: we are ready to ROC. Journal of pediatric psychology 39, 2 (2013), 204--221.

Cited By

Yoo DWoo HPendse SLu NBirnbaum MAbowd GDe Choudhury M(2024)Missed Opportunities for Human-Centered AI Research: Understanding Stakeholder Collaboration in Mental Health AI ResearchProceedings of the ACM on Human-Computer Interaction10.1145/36373728:CSCW1(1-24)Online publication date: 26-Apr-2024
https://dl.acm.org/doi/10.1145/3637372
Petsolari MIbrahim SSlovak P(2024)Socio-technical Imaginaries: Envisioning and Understanding AI Parenting Supports through Design FictionProceedings of the 2024 CHI Conference on Human Factors in Computing Systems10.1145/3613904.3642619(1-27)Online publication date: 11-May-2024
https://dl.acm.org/doi/10.1145/3613904.3642619
Adekkanattu PFurmanchuk AWu YPathak APatra BBost SMorrow DWang GYang YForrest NLuo YWalunas TLo-Ciganic WGelad WBian JBao YWeiner MOslin DPathak J(2024)Deep learning for identifying personal and family history of suicidal thoughts and behaviors from EHRsnpj Digital Medicine10.1038/s41746-024-01266-77:1Online publication date: 28-Sep-2024
https://doi.org/10.1038/s41746-024-01266-7
Show More Cited By

Index Terms

Mining Free-Text Medical Notes for Suicide Risk Assessment
1. Applied computing
  1. Life and medical sciences
2. Computing methodologies
  1. Machine learning

Recommendations

Identification of Imminent Suicide Risk Among Young Adults using Text Messages
CHI '18: Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems

Suicide is the second leading cause of death among young adults but the challenges of preventing suicide are significant because the signs often seem invisible. Research has shown that clinicians are not able to reliably predict when someone is at ...
Automatically estimating the incidence of symptoms recorded in GP free text notes
MIXHS '11: Proceedings of the first international workshop on Managing interoperability and complexity in health systems

The UK General Practice Research Database (GPRD) is a valuable source of information for health services research. It contains coded data supplemented by free text (physicians' notes and letters). However, due to the difficulty of extracting useful ...
Text Mining Applied to Electronic Medical Records: A Literature Review

The analysis of medical records is a major challenge, considering they are generally presented in plain text, have a very specific technical vocabulary and are nearly always unstructured. It is an interdisciplinary work that requires knowledge from ...

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences

SETN '18: Proceedings of the 10th Hellenic Conference on Artificial Intelligence

July 2018

339 pages

ISBN:9781450364331

DOI:10.1145/3200947

Copyright © 2018 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

In-Cooperation

EETN: Hellenic Artificial Intelligence Society
UOP: University of Patras
University of Thessaly: University of Thessaly, Volos, Greece

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 09 July 2018

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article
Research
Refereed limited

Conference

SETN '18

SETN '18: 10th Hellenic Conference on Artificial Intelligence

July 9 - 12, 2018

Patras, Greece

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

16
Total Citations
View Citations
307
Total Downloads

Downloads (Last 12 months)20
Downloads (Last 6 weeks)1

Reflects downloads up to 06 Jan 2025

Other Metrics

View Author Metrics

Citations

Cited By

Yoo DWoo HPendse SLu NBirnbaum MAbowd GDe Choudhury M(2024)Missed Opportunities for Human-Centered AI Research: Understanding Stakeholder Collaboration in Mental Health AI ResearchProceedings of the ACM on Human-Computer Interaction10.1145/36373728:CSCW1(1-24)Online publication date: 26-Apr-2024
https://dl.acm.org/doi/10.1145/3637372
Petsolari MIbrahim SSlovak P(2024)Socio-technical Imaginaries: Envisioning and Understanding AI Parenting Supports through Design FictionProceedings of the 2024 CHI Conference on Human Factors in Computing Systems10.1145/3613904.3642619(1-27)Online publication date: 11-May-2024
https://dl.acm.org/doi/10.1145/3613904.3642619
Adekkanattu PFurmanchuk AWu YPathak APatra BBost SMorrow DWang GYang YForrest NLuo YWalunas TLo-Ciganic WGelad WBian JBao YWeiner MOslin DPathak J(2024)Deep learning for identifying personal and family history of suicidal thoughts and behaviors from EHRsnpj Digital Medicine10.1038/s41746-024-01266-77:1Online publication date: 28-Sep-2024
https://doi.org/10.1038/s41746-024-01266-7
Ahmed TIvan SMunir AAhmed S(2024)Decoding depression: Analyzing social network insights for depression severity assessment with transformers and explainable AINatural Language Processing Journal10.1016/j.nlp.2024.1000797(100079)Online publication date: Jun-2024
https://doi.org/10.1016/j.nlp.2024.100079
Barajas Aranda DTorres Soto ATorres Soto MOchoa Ortiz Zezzatti C(2024)Mood-Based Prioritization Model in People with Suicidal Tendencies Using TopsisIntegrated Science for Sustainable Development Goal 310.1007/978-3-031-64288-3_7(133-152)Online publication date: 12-Dec-2024
https://doi.org/10.1007/978-3-031-64288-3_7
Thieme AHanratty MLyons MPalacios JMarques RMorrison CDoherty G(2023)Designing Human-centered AI for Mental Health: Developing Clinically Relevant Applications for Online CBT TreatmentACM Transactions on Computer-Human Interaction10.1145/356475230:2(1-50)Online publication date: 17-Mar-2023
https://dl.acm.org/doi/10.1145/3564752
Thomaidis GPapadimitriou KMichos SChartampilas ETsamardinos I(2023)A characteristic cerebellar biosignature for bipolar disorder, identified with fully automatic machine learningIBRO Neuroscience Reports10.1016/j.ibneur.2023.06.00815(77-89)Online publication date: Dec-2023
https://doi.org/10.1016/j.ibneur.2023.06.008
Aleem SHuda NAmin RKhalid SAlshamrani SAlshehri A(2022)Machine Learning Algorithms for Depression: Diagnosis, Insights, and Research DirectionsElectronics10.3390/electronics1107111111:7(1111)Online publication date: 31-Mar-2022
https://doi.org/10.3390/electronics11071111
Zhang TSchoene AJi SAnaniadou S(2022)Natural language processing applied to mental illness detection: a narrative reviewnpj Digital Medicine10.1038/s41746-022-00589-75:1Online publication date: 8-Apr-2022
https://doi.org/10.1038/s41746-022-00589-7
Tsamardinos ICharonyktakis PPapoutsoglou GBorboudakis GLakiotaki KZenklusen JJuhl HChatzaki ELagani V(2022)Just Add Data: automated predictive modeling for knowledge discovery and feature selectionnpj Precision Oncology10.1038/s41698-022-00274-86:1Online publication date: 16-Jun-2022
https://doi.org/10.1038/s41698-022-00274-8
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents