Select Publications
Journal articles
, 2021, 'The OpenDeID corpus for patient de-identification', Scientific Reports, 11, http://dx.doi.org/10.1038/s41598-021-99554-9
, 2021, 'A Retrospective Analysis using Deep-Learning Models for Prediction of Survival Outcome and Benefit of Adjuvant Chemotherapy in Stage II/III Colorectal Cancer', , http://dx.doi.org/10.48550/arxiv.2111.03532
, 2021, 'Characteristics and outcomes of over 300,000 patients with COVID-19 and history of cancer in the United States and Spain', Cancer Epidemiology Biomarkers and Prevention, 30, pp. 1884 - 1894, http://dx.doi.org/10.1158/1055-9965.EPI-21-0266
, 2021, 'Predicting length of stay and mortality among hospitalized patients with type 2 diabetes mellitus and hypertension', International Journal of Medical Informatics, 154, http://dx.doi.org/10.1016/j.ijmedinf.2021.104569
, 2021, 'Use of repurposed and adjuvant drugs in hospital patients with covid-19: Multinational network cohort study', BMJ, 373, http://dx.doi.org/10.1136/bmj.n1038
, 2021, 'Implementation of the COVID-19 vulnerability index across an international network of health care data sets: Collaborative external validation study', Jmir Medical Informatics, 9, http://dx.doi.org/10.2196/21547
, 2021, 'Quality assessment of real-world data repositories across the data life cycle: A literature review', Journal of the American Medical Informatics Association, 28, pp. 1591 - 1599, http://dx.doi.org/10.1093/jamia/ocaa340
, 2021, 'Cohort selection for construction of a clinical natural language processing corpus', Computer Methods and Programs in Biomedicine Update, 1, http://dx.doi.org/10.1016/j.cmpbup.2021.100024
, 2021, 'Moving with the Times: The Health Science Alliance (HSA) Biobank, Pathway to Sustainability', Biomarker Insights, 16, pp. 1 - 10, http://dx.doi.org/10.1177/11772719211005745
, 2021, 'Radical collaboration during a global health emergency: Development of the RDA COVID-19 data sharing recommendations and guidelines', Open Research Europe, 1, http://dx.doi.org/10.12688/openreseurope.13369.1
, 2021, 'COVID-19 in patients with autoimmune diseases: characteristics and outcomes in a multinational network of cohorts across three countries', Rheumatology
, 2021, 'From telehealth to virtual primary care in Australia? A Rapid Scoping Review', International Journal of Medical Informatics, pp. 104470 - 104470
, 2021, 'Primary Care Informatics Response to Covid-19 Pandemic: Adaptation, Progress, and Lessons from Four Countries with High ICT Development', Yearbook of Medical Informatics
, 2021, 'Quality assessment of real-world data repositories across the data life cycle: A literature review.', J. Am. Medical Informatics Assoc., 28, pp. 1591 - 1599
, 2020, 'Family history information extraction with neural attention and an enhanced relation-side scheme: Algorithm development and validation', Jmir Medical Informatics, 8, http://dx.doi.org/10.2196/21750
, 2020, 'mHealth for Integrated People-Centred Health Services in the Western Pacific: A Systematic Review', International Journal of Medical Informatics, 142, pp. 104259, http://dx.doi.org/10.1016/j.ijmedinf.2020.104259
, 2020, 'Whole slide images based cancer survival prediction using attention guided deep multiple instance learning networks', Medical Image Analysis, 65, http://dx.doi.org/10.1016/j.media.2020.101789
, 2020, 'Adoption of enterprise architecture for healthcare in AeHIN member countries', BMJ Health and Care Informatics, 27, http://dx.doi.org/10.1136/bmjhci-2020-100136
, 2020, 'Cohort Selection for Clinical Trials Using Multiple Instance Learning', Journal of Biomedical Informatics, pp. 103438 - 103438
, 2020, 'Crowdsourcing digital health measures to predict Parkinson’s disease severity: the Parkinson’s Disease Digital Biomarker DREAM Challenge', npj Digital Medicine
, 2020, 'Ethical Use of Electronic Health Record Data and Artificial Intelligence: Recommendations of the Primary Care Informatics Working Group of the International Medical Informatics Association', Yearbook of Medical Informatics
, 2019, 'Causal Relationships Among Pollen Counts, Tweet Numbers, and Patient Numbers for Seasonal Allergic Rhinitis Surveillance: Retrospective Analysis.', J Med Internet Res, 21, pp. e10450, http://dx.doi.org/10.2196/10450
, 2019, 'Artificial Intelligence in Primary Health Care: Perceptions, Issues, and Challenges', Yearbook of medical informatics, 28, pp. 041 - 046, http://dx.doi.org/10.1055/s-0039-1677901
, 2019, 'Comparison of the cohort selection performance of Australian Medicines Terminology to Anatomical Therapeutic Chemical mappings', Journal of the American Medical Informatics Association, 26, pp. 1237 - 1246
, 2019, 'Statistical principle-based approach for recognizing and normalizing microRNAs described in scientific literature', Database, 2019
, 2019, 'Statistical supervised meta-ensemble algorithm for medical record linkage', Journal of biomedical informatics, 95, pp. 103220 - 103220
, 2018, 'Behavior Change for Youth Drivers: Design and Development of a Smartphone-Based App (BackPocketDriver).', JMIR Form Res, 2, pp. e25, http://dx.doi.org/10.2196/formative.9660
, 2018, 'Assessing the severity of positive valence symptoms in initial psychiatric evaluation records: Should we use convolutional neural networks?', PloS one, 13, pp. e0204493 - e0204493
, 2018, 'Enhanced Functionalities for Annotating and Indexing Clinical Text with the NCBO Annotator+', Bioinformatics
, 2017, 'Patient-Specific Predictive Modeling Using Random Forests: An Observational Study for the Critically Ill.', JMIR Med Inform, 5, pp. e3, http://dx.doi.org/10.2196/medinform.6690
, 2017, 'Exploring associations of clinical and social parameters with violent behaviors among psychiatric patients', Journal of biomedical informatics, 75, pp. S149 - S159
, 2016, 'BioCreative V BioC track overview: collaborative biocurator assistant task for BioGRID', Database, 2016
, 2016, 'Feature engineering for recognizing adverse drug reactions from twitter posts', Information, 7, pp. 27 - 27
, 2016, 'Improving the dictionary lookup approach for disease normalization using enhanced dictionary and query expansion', Database, 2016
, 2016, 'MET network in PubMed: a text-mined network visualization and curation system', Database, 2016
, 2016, 'NTTMUNSW BioC modules for recognizing and normalizing species and gene/protein mentions', Database, 2016
, 2015, 'A context-aware approach for progression tracking of medical concepts in electronic medical records', Journal of biomedical informatics, 58, pp. S150 - S157
, 2015, 'Coronary artery disease risk assessment from unstructured electronic health records using text mining', Journal of biomedical informatics, 58, pp. S203 - S210
, 2015, 'Identification and progression of heart disease risk factors in diabetic patients from longitudinal electronic health records.', BioMed Research International, 2015, pp. 636371, http://dx.doi.org/10.1155/2015/636371
, 2014, 'Assessing video games to improve driving skills: a literature review and observational study', JMIR serious games, 2, pp. e5 - e5
, 2014, 'Data Sharing Challenges and Recommendations for Human Biorepositories: A Systematic Literature Review', The International Technology Management Review, 4, pp. 68 - 77
Conference Papers
, 2025, 'Deidentification and Temporal Normalization of the Electronic Health Record Notes Using Large Language Models: The 2023 SREDH/AI-Cup Competition for Deidentification of Sensitive Health Information', in Communications in Computer and Information Science, pp. 1 - 16, http://dx.doi.org/10.1007/978-981-97-7966-6_1
, 2025, 'Evaluation of OpenDeID Pipeline in the 2023 SREDH/AI-Cup Competition for Deidentification of Sensitive Health Information', in Communications in Computer and Information Science, pp. 107 - 119, http://dx.doi.org/10.1007/978-981-97-7966-6_8
, 2025, 'Utilizing Large Language Models for Privacy Protection and Advancing Medical Digitization', in Communications in Computer and Information Science, pp. 177 - 188, http://dx.doi.org/10.1007/978-981-97-7966-6_13
, 2024, 'Approaches for Evaluating Visit-to-Visit Blood Pressure Variability as a Cardiovascular Disease Risk Factor: A Scoping Review', in Studies in Health Technology and Informatics, pp. 349 - 353, http://dx.doi.org/10.3233/SHTI240417
, 2024, 'Preliminary Evaluation of Fine-Tuning the OpenDeLD Deidentification Pipeline Across Multi-Center Corpora', in Studies in Health Technology and Informatics, pp. 719 - 723, http://dx.doi.org/10.3233/SHTI240515
, 2024, 'Strategies to Address Statin Medication Intolerance Among Patients at Risk of Cardiovascular Disease Identified Through Electronic Health Records: A Literature Review and Pooled Analysis', in Studies in Health Technology and Informatics, pp. 132 - 136, http://dx.doi.org/10.3233/SHTI240362
, 2024, 'Association between Visit to Visit Blood Pressure Variability and Cardiovascular Outcome: A Meta-Analysis', in Studies in Health Technology and Informatics, pp. 262 - 266, http://dx.doi.org/10.3233/SHTI240149
, 2024, 'Strategies to Improve Statin Medication Adherence Among Patients at Risk of Cardiovascular Disease Identified Through Electronic Health Records: A Literature Review', in Studies in Health Technology and Informatics, pp. 986 - 990, http://dx.doi.org/10.3233/SHTI231112
, 2024, 'Visit-to-Visit Blood Pressure Variability in Cardiovascular Disease', in Studies in Health Technology and Informatics, pp. 1358 - 1359, http://dx.doi.org/10.3233/SHTI231193