Tagged: Health Informatics

Projects

  • Data Lakes for Pooling T1D Patients CGM Data to Enable EHR Integration and Prevent Data Silos: A Patient Pathway & Clinical Workflow Analysis — Modern developments in healthcare data collection and management aim to improve patient outcomes. For instance, consider type 1 diabetes (T1D) patients, who benefit from enhanced continuous glucose monitoring (CGM) devices. These devices can provide time-sensitive alerts and facilitate automated data collection for patient-practitioner retrospectives, greatly improving patients’ ability to manage their disease. However, CGM vendors often package their devices with proprietary software and rely on third-party infrastructure. This leads to interoperability challenges across clinical systems, fragmented workflows, and increased cognitive burden for clinicians, who grapple with a heterogeneous device landscape encumbered by data silos. This work aims to begin addressing these concerns by first analysing T1D patient data pathways through process modelling. By translating a traditional swimlane (business process) diagram to a formal graph-based representation, we propose a technique for identifying data-flow bottlenecks in complex multi-actor networks. Following this, we discuss an approach to centralised ingestion of CGM data based on a data lake-like architecture (inspired by the Scottish DataLoch initiative). We hope our investigation provides a refreshing perspective on some of the many issues arising from the fragmented nature of the healthcare industry.
  • Deep Learning, Sequencing Technologies & Polygenic Scores: Alzheimer’s Disease Risk Prediction and Classification Review — This work reviews traditional genome-wide association studies (GWAS) and weighted polygenic risk scores (PRS) as methods for predicting the onset of Alzheimer’s Disease (AD), then examines machine learning (ML) and deep learning (DL) approaches. Reviewed studies include the use of random forests, support vector machines, and various neural network architectures. We identify persistent challenges encountered throughout the survey, including dataset diversity, model explainability, and regulatory compliance. The work concludes by cautiously proposing a multi-phase framework for clinical adoption of selective ML and DL methods into existing NHS genomic testing pipelines over a seven-year timeline, emphasising quality control, SHAP-based interpretability, and robust validation before any scaled deployment.

← All tags