research-article

Open Access

Pharmacovigilance via Baseline Regularization with Large-Scale Longitudinal Observational Data

Authors:
Zhaobin Kuang

University of Wisconsin-Madison, Madison, WI, USA

University of Wisconsin-Madison, Madison, WI, USA
View Profile

,
Peggy Peissig

Marshfield Clinic, Marshfield, WI, USA

Marshfield Clinic, Marshfield, WI, USA
View Profile

,
Vitor Santos Costa

Universidade do Porto, Porto, Portugal

Universidade do Porto, Porto, Portugal
View Profile

,
Richard Maclin

University of Minnesota-Duluth, Duluth, MN, USA

University of Minnesota-Duluth, Duluth, MN, USA
View Profile

,
David Page

University of Wisconsin-Madison, Madison, USA

University of Wisconsin-Madison, Madison, USA
View Profile

KDD '17: Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data MiningAugust 2017Pages 1537–1546https://doi.org/10.1145/3097983.3097998

Published:13 August 2017Publication History

KDD '17: Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

Pages 1537–1546

ABSTRACT

Several prominent public health incidents that occurred at the beginning of this century due to adverse drug events (ADEs) have raised international awareness of governments and industries about pharmacovigilance (PhV), the science and activities to monitor and prevent adverse events caused by pharmaceutical products after they are introduced to the market. A major data source for PhV is large-scale longitudinal observational databases (LODs) such as electronic health records (EHRs) and medical insurance claim databases. Inspired by the Multiple Self-Controlled Case Series (MSCCS) model, arguably the leading method for ADE discovery from LODs, we propose baseline regularization, a regularized generalized linear model that leverages the diverse health profiles available in LODs across different individuals at different times. We apply the proposed method as well as MSCCS to the Marshfield Clinic EHR. Experimental results suggest that incorporating the heterogeneity among different patients and different times help to improve the performance in identifying benchmark ADEs from the Observational Medical Outcomes Partnership ground truth

Supplemental Material

kuang_baseline_regularization.mp4

mp4

415.8 MB

Download

References

Laurent Condat. 2013. A Direct Algorithm for 1D Total Variation Denoising. IEEE Signal Processing Letters (2013).Google Scholar
P Laurie Davies and Arne Kovac 2001. Local Extremes, Runs, Strings and Multiresolution. Annals of Statistics (2001).Google Scholar
Steven Findlay. 2015. Health policy briefs: The FDA's Sentinel Initiative. Health Affiaris (2015).Google Scholar
Jerome Friedman, Trevor Hastie, and Rob Tibshirani. 2010. Regularization Paths for Generalized Linear Models via Coordinate Descent. Journal of Statistical Software (2010).Google Scholar
Ian Goodfellow, Yoshua Bengio, and Aaron Courville. 2016. Deep Learning. MIT Press. shownotehttp://www.deeplearningbook.org.Google ScholarDigital Library
Rave Harpaz, William DuMochel, and Nigam H Shah. 2015. Big Data and Adverse Drug Reaction Detection. Clinical Pharmacology & Therapeutics (2015).Google Scholar
Rave Harpaz, William DuMouchel, Nigam H Shah, David Madigan, Patrick Ryan, and Carol Friedman. 2012. Novel Data-Mining Methodologies for Adverse Drug Event Discovery and Analysis. Clinical Pharmacology & Therapeutics (2012).Google Scholar
George Hripcsak, Jon D Duke, Nigam H Shah, Christian G Reich, Vojtech Huser, Martijn J Schuemie, Marc A Suchard, Rae Woong Park, Ian Chi Kei Wong, Peter R Rijnbeek, and others 2015. Observational Health Data Sciences and Informatics (OHDSI): Opportunities for Observational Researchers. Studies in Health Technology and Informatics (2015).Google Scholar
Nicholas A Johnson. 2013. A Dynamic Programming Algorithm for the Fused Lasso and l0-Segmentation. Journal of Computational and Graphical Statistics (2013).Google Scholar
Zhaobin Kuang, James Thomson, Michael Caldwell, Peggy Peissig, Ron Stewart, and David Page. 2016. Baseline Regularization for Computational Drug Repositioning with Longitudinal Observational Data. In Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence (IJCAI-16).Google ScholarDigital Library
David Madigan, Nandini Raghavan, William Dumouchel, Martha Nason, Christian Posse, and Greg Ridgeway 2002. Likelihood-Based Data Squashing: A Modeling Approach to Instance Construction. Data Mining and Knowledge Discovery (2002).Google Scholar
David Madigan, Martijn J Schuemie, and Patrick B Ryan. 2013. Empirical Performance of the Case--Control Method: Lessons for Developing a Risk Identification and Analysis System. Drug Safety (2013). Google ScholarCross Ref
Tom M Mitchell. 1997. Machine Learning (bibinfoedition1 ed.). MGH.Google Scholar
Kevin P Murphy. 2012. Machine Learning: a Probabilistic Perspective. MIT Press.Google ScholarDigital Library
Yu Nesterov. 2012. Efficiency of Coordinate Descent Methods on Huge-Scale Optimization Problems. SIAM Journal on Optimization (2012).Google Scholar
G Niklas Norén, Tomas Bergvall, Patrick B Ryan, Kristina Juhlin, Martijn J Schuemie, and David Madigan 2013. Empirical Performance of the Calibrated Self-Controlled Cohort Analysis within Temporal Pattern Discovery: Lessons for Developing a Risk Identification and Analysis System. Drug Safety (2013).Google Scholar
Javier Pena and Ryan Tibshirani 2016. Lecture Notes in Machine Learning 10--725/Statistics 36--725-Convex Optimization (Fall 2016). (2016).Google Scholar
Valerie Powell, Franklin M Din, Amit Acharya, and Miguel Humberto Torres-Urquidy 2012. Integration of Medical and Dental Care and Patient Data. Springer Science & Business Media.Google Scholar
Aaditya Ramdas and Ryan J Tibshirani 2015. Fast and Flexible ADMM Algorithms for Trend Filtering. Journal of Computational and Graphical Statistics (2015).Google Scholar
Melissa A Robb, Judith A Racoosin, Rachel E Sherman, Thomas P Gross, Robert Ball, Marsha E Reichman, Karen Midthun, and Janet Woodcock. 2012. The US Food and Drug Administration's Sentinel Initiative: Expanding the Horizons of Medical Product Safety. Pharmacoepidemiology and Drug Safety (2012).Google Scholar
Patrick B Ryan, David Madigan, Paul E Stang, J Marc Overhage, Judith A Racoosin, and Abraham G Hartzema 2012. Empirical Assessment of Methods for Risk Identification in Healthcare Data: Results from the Experiments of the Observational Medical Outcomes Partnership. Statistics in Medicine (2012).Google Scholar
Patrick B Ryan, Martijn J Schuemie, Susan Gruber, Ivan Zorych, and David Madigan 2013. Empirical Performance of a New User Cohort Method: Lessons for Developing a Risk Identification and Analysis System. Drug Safety (2013).Google Scholar
Patrick B Ryan, Martijn J Schuemie, and David Madigan. 2013. Empirical Performance of a Self-Controlled Cohort Method: Lessons for Developing a Risk Identification and Analysis System. Drug Safety (2013).Google Scholar
Martijn J Schuemie, David Madigan, and Patrick B Ryan. 2013. Empirical Performance of LGPS and LEOPARD: Lessons for Developing a Risk Identification and Analysis System. Drug Safety (2013).Google Scholar
Martijn J Schuemie, Gianluca Trifirò, Preciosa M Coloma, Patrick B Ryan, and David Madigan. 2016. Detecting Adverse Drug Reactions Following Long-Term Exposure in Longitudinal Observational Data: The Exposure-Adjusted Self-Controlled Case Series. Statistical Methods in Medical Research Vol. 25, 6 (2016), 2577--2592.Google ScholarCross Ref
Shawn E Simpson. 2011. Self-Controlled Methods for Postmarketing Drug Safety Surveillance in Large-Scale Longitudinal Data. Dissertation. Columbia University.Google Scholar
Shawn E Simpson, David Madigan, Ivan Zorych, Martijn J Schuemie, Patrick B Ryan, and Marc A Suchard 2013. Multiple Self-Controlled Case Series for Large-Scale Longitudinal Observational Databases. Biometrics (2013).Google Scholar
Suvrit Sra, Sebastian Nowozin, and Stephen J Wright. 2012. Optimization for Machine Learning. Mit Press.Google ScholarDigital Library
Marc A Suchard, Shawn E Simpson, Ivan Zorych, Patrick Ryan, and David Madigan 2013natexlaba. Massive Parallelization of Serial Inference Algorithms for a Complex Generalized Linear Model. ACM Transactions on Modeling and Computer Simulation (TOMACS) (2013).Google Scholar
Marc A Suchard, Ivan Zorych, Shawn E Simpson, Martijn J Schuemie, Patrick B Ryan, and David Madigan 2013. Empirical Performance of the Self-Controlled Case Series Design: Lessons for Developing a Risk Identification and Analysis System. Drug Safety (2013).Google Scholar
Robert Tibshirani, Michael Saunders, Saharon Rosset, Ji Zhu, and Keith Knight 2005. Sparsity and Smoothness via the Fused Lasso. Journal of the Royal Statistical Society: Series B (Statistical Methodology) (2005).Google Scholar
Paul Tseng. 2001. Convergence of a Block Coordinate Descent Method for Nondifferentiable Minimization. Journal of Optimization Theory and Applications (2001).Google Scholar
Stephen J Wright. 2015. Coordinate Descent Algorithms. Mathematical Programming (2015).Google Scholar
Stanley Xu, Chan Zeng, Sophia Newcomer, Jennifer Nelson, and Jason Glanz 2012. Use of Fixed Effects Models to Analyze Self-Controlled Case Series Data in Vaccine Safety Studies. Journal of Biometrics & Biostatistics (2012).Google Scholar
Tuo Zhao, Mo Yu, Yiming Wang, Raman Arora, and Han Liu 2014. Accelerated Mini-Batch Randomized Block Coordinate Descent Method Advances in Neural Information Processing Systems.Google Scholar

Index Terms

Pharmacovigilance via Baseline Regularization with Large-Scale Longitudinal Observational Data
1. Applied computing
  1. Life and medical sciences
    1. Health care information systems
    2. Health informatics
2. Mathematics of computing
  1. Probability and statistics
    1. Statistical paradigms
      1. Regression analysis

Recommendations

Data mining methodologies for pharmacovigilance

Medicines are designed to cure, treat, or prevent diseases; however, there are also risks in taking any medicine - particularly short term or long term adverse drug reactions (ADRs) can cause serious harm to patients. Adverse drug events have been ...
Read More
Users' Perception towards the "Safe Medication through Pharmacovigilance and Compliance Monitoring Pharmacov" Service

A feasibility study was conducted to evaluate the acceptability and effectiveness of the "Safe medication through pharmacovigilance and compliance monitoring PharmacoV" service, an Internet-based interactive information tool that assists physicians in ...
Read More
A research framework for pharmacovigilance in health social media

Display Omitted A research framework for patient reported adverse drug event extraction.Experiments conducted on posts from major diabetes and heart disease forums in US.Each component significantly contributes to the frameworks overall effectiveness. ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
KDD '17: Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
August 2017
2240 pages
ISBN:9781450348874
DOI:10.1145/3097983
General Chairs:
Stan Matwin
Dalhousie University
,
Shipeng Yu
LinkedIn
,
Faisal Farooq
IBM
Copyright © 2017 Owner/Author
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike International 4.0 License.
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 13 August 2017
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
adverse drug event discovery
baseline regularization
electronic health records
longitudinal data
pharmacovigilance
Qualifiers
- research-article
Conference

Acceptance Rates
KDD '17 Paper Acceptance Rate64of748submissions,9%Overall Acceptance Rate1,133of8,635submissions,13%
More
Upcoming Conference
KDD '24

Sponsor:

sigkdd

sigkdd

KDD '24: The 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining

August 25 - 29, 2024

Barcelona , Spain
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 2
  Total Citations
  View Citations
- 839
  Total Downloads
- Downloads (Last 12 months)44
- Downloads (Last 6 weeks)4
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Pharmacovigilance via Baseline Regularization with Large-Scale Longitudinal Observational Data

KDD '17: Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

ABSTRACT

Supplemental Material

References

Cited By

Index Terms

Recommendations

Data mining methodologies for pharmacovigilance

Users' Perception towards the "Safe Medication through Pharmacovigilance and Compliance Monitoring Pharmacov" Service

A research framework for pharmacovigilance in health social media

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Upcoming Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Pharmacovigilance via Baseline Regularization with Large-Scale Longitudinal Observational Data

KDD '17: Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

ABSTRACT

Supplemental Material

References

Cited By

Index Terms

Recommendations

Data mining methodologies for pharmacovigilance

Users' Perception towards the "Safe Medication through Pharmacovigilance and Compliance Monitoring Pharmacov" Service

A research framework for pharmacovigilance in health social media

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Upcoming Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media