MPG.eBooks - Staff View: A pilot study using machine learning and domain knowledge to facilitate comparative effectiveness review updating

Read Now

A pilot study using machine learning and domain knowledge to facilitate comparative effectiveness review updating

GLMnet performed better when identifying AE-relevant articles for the AAP review (sensitivity=0.981) than for the LBD review (0.685). When attempting to maximize sensitivity, GLMnet achieved high sensitivities (0.99 for AAP and 1.0 for LBD) while reducing projected screening by 55.4 percent (1990/35...

Full description

Bibliographic Details
Main Author:	Dalal, Siddhartha R.
Corporate Authors:	United States Agency for Healthcare Research and Quality, Southern California Evidence-Based Practice Center/RAND.
Format:	eBook
Language:	English
Published:	Rockville, MD Agency for Healthcare Research and Quality [2012], 2012
Series:	Methods research reports
Subjects:	Comparative Effectiveness Research Artificial Intelligence Review Literature As Topic
Online Access:	https://www.ncbi.nlm.nih.gov/books/NBK109161
Collection:	National Center for Biotechnology Information - Collection details see MPG.ReNa


LEADER	04132nam a2200337 u 4500
001	EB000943489
003	EBX01000000000000000737079
005	00000000000000.0
007	tu\|\|\|\|\|\|\|\|\|\|\|\|\|\|\|\|\|\|\|\|\|
008	150223 r \|\|\| eng
100	1		\|a Dalal, Siddhartha R.
245	0	0	\|a A pilot study using machine learning and domain knowledge to facilitate comparative effectiveness review updating \|h Elektronische Ressource \|c investigators, Siddhartha R. Dalal ... [et al.]
260			\|a Rockville, MD \|b Agency for Healthcare Research and Quality \|c [2012], 2012
300			\|a PDF file (various pagings) \|b ill
505	0		\|a Includes bibliographical references
653			\|a Comparative Effectiveness Research
653			\|a Artificial Intelligence
653			\|a Review Literature as Topic
710	2		\|a United States \|b Agency for Healthcare Research and Quality
710	2		\|a Southern California Evidence-Based Practice Center/RAND.
041	0	7	\|a eng \|2 ISO 639-2
989			\|b NCBI \|a National Center for Biotechnology Information
490	0		\|a Methods research reports
500			\|a "September 2012."
856	4	0	\|u https://www.ncbi.nlm.nih.gov/books/NBK109161 \|3 Volltext \|n NLM Bookshelf Books \|3 Volltext
082	0		\|a 800
082	0		\|a 700
520			\|a GLMnet performed better when identifying AE-relevant articles for the AAP review (sensitivity=0.981) than for the LBD review (0.685). When attempting to maximize sensitivity, GLMnet achieved high sensitivities (0.99 for AAP and 1.0 for LBD) while reducing projected screening by 55.4 percent (1990/3591 articles for AAP) and 63.2 percent (4,454/7,051 for LBD). CONCLUSIONS: In this pilot study, we evaluated statistical classifiers that used previous classification decisions and key explanatory variables derived from MEDLINE indexing terms to predict inclusion decisions on two simulated comparative effectiveness review updates. The system achieved higher sensitivity in evaluating efficacy/effectiveness articles than in evaluating LBD AE articles. In the simulation, this prototype system reduced workload associated with screening updated search results for all relevant efficacy/effectiveness and AE articles by more than 50 percent with minimal or no loss of relevant articles.
520			\|a We then used the data to empirically derive statistical models (based on sparse generalized linear models with convex penalties [GLMnet] and gradient boosting machine [GBM]) that predicted inclusion in the AAP and LBD reviews. Finally, we evaluated performance on the 11,003 PubMed citations retrieved for the LBD and AAP updated reviews. MEASUREMENTS: Sensitivity (percentage of relevant citations corrected identified), positive predictive value (PPV, percentage of predicted relevant citations that were truly relevant), and workload reduction (percentage of screening avoided). RESULTS: GLMnet- and GBM-based models performed similarly, with GLMnet (results shown below) performing slightly better. The GLMnet-based model yielded sensitivities of 0.921 and 0.905 and PPVs of 0.185 and 0.102 when predicting articles relevant to the AAP and LBD efficacy/effectiveness analyses respectively (using a threshold of p ́Æ0.02).
520			\|a BACKGROUND: Comparative effectiveness reviews need to be updated frequently to maintain their relevance. Results of earlier screening efforts should be useful in reducing the screening of thousands of newer citations for articles relevant to efficacy/effectiveness and adverse effects (AEs). METHODS: We collected 14,700 PubMed(r) citation classification decisions from a 2007 comparative effectiveness review of interventions to prevent fractures in persons with low bone density (LBD). We also collected 1,307 PubMed citation classification decisions from a 2006 comparative effectiveness review of off-label uses of atypical anti-psychotic drugs (AAP). We first extracted explanatory variables from the MEDLINE(r) citation related to key concepts, including the intervention, outcome, and study design.
520			\|a After refinement, these document classification algorithms could help researchers maintain up-to-date reviews

A pilot study using machine learning and domain knowledge to facilitate comparative effectiveness review updating

Similar Items