ABSTRACT
Several prominent public health incidents that occurred at the beginning of this century due to adverse drug events (ADEs) have raised international awareness of governments and industries about pharmacovigilance (PhV), the science and activities to monitor and prevent adverse events caused by pharmaceutical products after they are introduced to the market. A major data source for PhV is large-scale longitudinal observational databases (LODs) such as electronic health records (EHRs) and medical insurance claim databases. Inspired by the Multiple Self-Controlled Case Series (MSCCS) model, arguably the leading method for ADE discovery from LODs, we propose baseline regularization, a regularized generalized linear model that leverages the diverse health profiles available in LODs across different individuals at different times. We apply the proposed method as well as MSCCS to the Marshfield Clinic EHR. Experimental results suggest that incorporating the heterogeneity among different patients and different times help to improve the performance in identifying benchmark ADEs from the Observational Medical Outcomes Partnership ground truth
Supplemental Material
- Laurent Condat. 2013. A Direct Algorithm for 1D Total Variation Denoising. IEEE Signal Processing Letters (2013).Google Scholar
- P Laurie Davies and Arne Kovac 2001. Local Extremes, Runs, Strings and Multiresolution. Annals of Statistics (2001).Google Scholar
- Steven Findlay. 2015. Health policy briefs: The FDA's Sentinel Initiative. Health Affiaris (2015).Google Scholar
- Jerome Friedman, Trevor Hastie, and Rob Tibshirani. 2010. Regularization Paths for Generalized Linear Models via Coordinate Descent. Journal of Statistical Software (2010).Google Scholar
- Ian Goodfellow, Yoshua Bengio, and Aaron Courville. 2016. Deep Learning. MIT Press. shownotehttp://www.deeplearningbook.org.Google ScholarDigital Library
- Rave Harpaz, William DuMochel, and Nigam H Shah. 2015. Big Data and Adverse Drug Reaction Detection. Clinical Pharmacology & Therapeutics (2015).Google Scholar
- Rave Harpaz, William DuMouchel, Nigam H Shah, David Madigan, Patrick Ryan, and Carol Friedman. 2012. Novel Data-Mining Methodologies for Adverse Drug Event Discovery and Analysis. Clinical Pharmacology & Therapeutics (2012).Google Scholar
- George Hripcsak, Jon D Duke, Nigam H Shah, Christian G Reich, Vojtech Huser, Martijn J Schuemie, Marc A Suchard, Rae Woong Park, Ian Chi Kei Wong, Peter R Rijnbeek, and others 2015. Observational Health Data Sciences and Informatics (OHDSI): Opportunities for Observational Researchers. Studies in Health Technology and Informatics (2015).Google Scholar
- Nicholas A Johnson. 2013. A Dynamic Programming Algorithm for the Fused Lasso and l0-Segmentation. Journal of Computational and Graphical Statistics (2013).Google Scholar
- Zhaobin Kuang, James Thomson, Michael Caldwell, Peggy Peissig, Ron Stewart, and David Page. 2016. Baseline Regularization for Computational Drug Repositioning with Longitudinal Observational Data. In Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence (IJCAI-16).Google ScholarDigital Library
- David Madigan, Nandini Raghavan, William Dumouchel, Martha Nason, Christian Posse, and Greg Ridgeway 2002. Likelihood-Based Data Squashing: A Modeling Approach to Instance Construction. Data Mining and Knowledge Discovery (2002).Google Scholar
- David Madigan, Martijn J Schuemie, and Patrick B Ryan. 2013. Empirical Performance of the Case--Control Method: Lessons for Developing a Risk Identification and Analysis System. Drug Safety (2013). Google ScholarCross Ref
- Tom M Mitchell. 1997. Machine Learning (bibinfoedition1 ed.). MGH.Google Scholar
- Kevin P Murphy. 2012. Machine Learning: a Probabilistic Perspective. MIT Press.Google ScholarDigital Library
- Yu Nesterov. 2012. Efficiency of Coordinate Descent Methods on Huge-Scale Optimization Problems. SIAM Journal on Optimization (2012).Google Scholar
- G Niklas Norén, Tomas Bergvall, Patrick B Ryan, Kristina Juhlin, Martijn J Schuemie, and David Madigan 2013. Empirical Performance of the Calibrated Self-Controlled Cohort Analysis within Temporal Pattern Discovery: Lessons for Developing a Risk Identification and Analysis System. Drug Safety (2013).Google Scholar
- Javier Pena and Ryan Tibshirani 2016. Lecture Notes in Machine Learning 10--725/Statistics 36--725-Convex Optimization (Fall 2016). (2016).Google Scholar
- Valerie Powell, Franklin M Din, Amit Acharya, and Miguel Humberto Torres-Urquidy 2012. Integration of Medical and Dental Care and Patient Data. Springer Science & Business Media.Google Scholar
- Aaditya Ramdas and Ryan J Tibshirani 2015. Fast and Flexible ADMM Algorithms for Trend Filtering. Journal of Computational and Graphical Statistics (2015).Google Scholar
- Melissa A Robb, Judith A Racoosin, Rachel E Sherman, Thomas P Gross, Robert Ball, Marsha E Reichman, Karen Midthun, and Janet Woodcock. 2012. The US Food and Drug Administration's Sentinel Initiative: Expanding the Horizons of Medical Product Safety. Pharmacoepidemiology and Drug Safety (2012).Google Scholar
- Patrick B Ryan, David Madigan, Paul E Stang, J Marc Overhage, Judith A Racoosin, and Abraham G Hartzema 2012. Empirical Assessment of Methods for Risk Identification in Healthcare Data: Results from the Experiments of the Observational Medical Outcomes Partnership. Statistics in Medicine (2012).Google Scholar
- Patrick B Ryan, Martijn J Schuemie, Susan Gruber, Ivan Zorych, and David Madigan 2013. Empirical Performance of a New User Cohort Method: Lessons for Developing a Risk Identification and Analysis System. Drug Safety (2013).Google Scholar
- Patrick B Ryan, Martijn J Schuemie, and David Madigan. 2013. Empirical Performance of a Self-Controlled Cohort Method: Lessons for Developing a Risk Identification and Analysis System. Drug Safety (2013).Google Scholar
- Martijn J Schuemie, David Madigan, and Patrick B Ryan. 2013. Empirical Performance of LGPS and LEOPARD: Lessons for Developing a Risk Identification and Analysis System. Drug Safety (2013).Google Scholar
- Martijn J Schuemie, Gianluca Trifirò, Preciosa M Coloma, Patrick B Ryan, and David Madigan. 2016. Detecting Adverse Drug Reactions Following Long-Term Exposure in Longitudinal Observational Data: The Exposure-Adjusted Self-Controlled Case Series. Statistical Methods in Medical Research Vol. 25, 6 (2016), 2577--2592.Google ScholarCross Ref
- Shawn E Simpson. 2011. Self-Controlled Methods for Postmarketing Drug Safety Surveillance in Large-Scale Longitudinal Data. Dissertation. Columbia University.Google Scholar
- Shawn E Simpson, David Madigan, Ivan Zorych, Martijn J Schuemie, Patrick B Ryan, and Marc A Suchard 2013. Multiple Self-Controlled Case Series for Large-Scale Longitudinal Observational Databases. Biometrics (2013).Google Scholar
- Suvrit Sra, Sebastian Nowozin, and Stephen J Wright. 2012. Optimization for Machine Learning. Mit Press.Google ScholarDigital Library
- Marc A Suchard, Shawn E Simpson, Ivan Zorych, Patrick Ryan, and David Madigan 2013natexlaba. Massive Parallelization of Serial Inference Algorithms for a Complex Generalized Linear Model. ACM Transactions on Modeling and Computer Simulation (TOMACS) (2013).Google Scholar
- Marc A Suchard, Ivan Zorych, Shawn E Simpson, Martijn J Schuemie, Patrick B Ryan, and David Madigan 2013. Empirical Performance of the Self-Controlled Case Series Design: Lessons for Developing a Risk Identification and Analysis System. Drug Safety (2013).Google Scholar
- Robert Tibshirani, Michael Saunders, Saharon Rosset, Ji Zhu, and Keith Knight 2005. Sparsity and Smoothness via the Fused Lasso. Journal of the Royal Statistical Society: Series B (Statistical Methodology) (2005).Google Scholar
- Paul Tseng. 2001. Convergence of a Block Coordinate Descent Method for Nondifferentiable Minimization. Journal of Optimization Theory and Applications (2001).Google Scholar
- Stephen J Wright. 2015. Coordinate Descent Algorithms. Mathematical Programming (2015).Google Scholar
- Stanley Xu, Chan Zeng, Sophia Newcomer, Jennifer Nelson, and Jason Glanz 2012. Use of Fixed Effects Models to Analyze Self-Controlled Case Series Data in Vaccine Safety Studies. Journal of Biometrics & Biostatistics (2012).Google Scholar
- Tuo Zhao, Mo Yu, Yiming Wang, Raman Arora, and Han Liu 2014. Accelerated Mini-Batch Randomized Block Coordinate Descent Method Advances in Neural Information Processing Systems.Google Scholar
Index Terms
- Pharmacovigilance via Baseline Regularization with Large-Scale Longitudinal Observational Data
Recommendations
Data mining methodologies for pharmacovigilance
Medicines are designed to cure, treat, or prevent diseases; however, there are also risks in taking any medicine - particularly short term or long term adverse drug reactions (ADRs) can cause serious harm to patients. Adverse drug events have been ...
Users' Perception towards the "Safe Medication through Pharmacovigilance and Compliance Monitoring Pharmacov" Service
A feasibility study was conducted to evaluate the acceptability and effectiveness of the "Safe medication through pharmacovigilance and compliance monitoring PharmacoV" service, an Internet-based interactive information tool that assists physicians in ...
A research framework for pharmacovigilance in health social media
Display Omitted A research framework for patient reported adverse drug event extraction.Experiments conducted on posts from major diabetes and heart disease forums in US.Each component significantly contributes to the frameworks overall effectiveness. ...
Comments