Aaron Jun Yi Yap

and 6 more

Background: Bleeding is an important health outcome of interest in epidemiological studies. We aimed to develop and validate rule-based algorithms to identify major bleeding and all bleeding within real-world electronic healthcare data. Methods: We took a random sample (n=1630) of patient admissions to Singapore public hospitals in 2019 and 2020, stratifying by hospital and year of admission. We adopted the International Society on Thrombosis and Haemostasis definition for major bleeding. Presence of major bleeding and all bleeding was ascertained by two annotators through chart review. A total of 630 and 1,000 records were used for algorithm development and validation, respectively. We formulated two algorithms: sensitivity- and positive predictive value (PPV)-optimized algorithms. A combination of hemoglobin test patterns and diagnosis codes were used in the final algorithms. Results: During validation, diagnosis codes alone yielded low sensitivities for major bleeding (0.14) and all bleeding (0.24), although specificities and PPV were high (>0.97). For major bleeding, the sensitivity-optimized algorithm had much higher sensitivity and negative predictive values (NPV) (sensitivity=0.94, NPV=1.00), however false positive rates were also relatively high (specificity=0.90, PPV=0.34). PPV-optimized algorithm had improved specificity and PPV (specificity=0.96, PPV=0.52), with little reduction in sensitivity and NPV (sensitivity=0.88, NPV=0.99). For all bleeding events, our algorithms had less optimal performances, with lower sensitivities (0.53 to 0.61). Conclusions: The use of diagnosis codes alone misses many genuine major bleeding events. We have developed major bleeding algorithms with high sensitivities which can be used in conjunction with chart reviews to ascertain events within populations of interest.

Hui Xing Tan

and 6 more

Aim: To assess the feasibility of converting electronic medical records (EMR) into the Observational Medical Outcomes Partnership-Common Data Model (OMOP-CDM) schema and potential for subsequent analyses relating to drug safety. Methods: The EMRs belonging to a tertiary care facility from 2013 to 2016 were mapped onto the OMOP-CDM schema. Vocabulary mappings were applied to translate source data values into OMOP-CDM terminologies. Existing analytic codes from a previous study were modified and applied to conduct an illustrative analysis involving oral anticoagulants (OACs) to mimic analyses that may be part of a typical benefit-risk assessment. A novel visualization is proposed to represent comparative efficacy, safety and utilization in one chart. Results: Records of 245,561 unique patients were mapped onto the OMOP-CDM. The CDM and analytic code templates simplified the data analysis for the illustrative example. Of 132 patients identified, a majority were warfarin users (76.5%), followed by rivaroxaban (19.7%) and apixaban (3.8%). Following six months of follow up, differences in cumulative incidence of bleeding and thromboembolic events were observable. The proposed visualization may facilitate collective evaluation of differences relating to utilization, efficacy and safety of drugs of interest. Conclusion: OMOP-CDM conversion of RWD may be useful for gleaning insights on comparative drug utilization, efficacy and safety for risk-benefit assessments in post-market regulatory settings.