Works

2025

Hierarchical Retrieval with Out-Of-Vocabulary Queries: A Case Study on SNOMED CT

Jon Dilworth, Hui Yang, Jiaoyan Chen, Yongsheng Gao

Submission to The Web Conference 2026 (WWW'26), Dubai, UAE · 2025 · Paper

Abstract

SNOMED CT is a biomedical ontology with a hierarchical representation, modelling terminological concepts at a large scale. Knowledge retrieval in SNOMED CT is critical for its application but often proves challenging due to linguistic ambiguity, synonymy, polysemy, and so on. This problem is exacerbated when the queries are out-of-vocabulary (OOV), i.e., lacking any equivalent matches in the ontology. In this work, we focus on the problem of hierarchical concept retrieval from SNOMED CT with OOV queries, and propose an approach driven by language model-based ontology embeddings. For evaluation, we construct OOV queries annotated against SNOMED CT concepts, testing the retrieval of the most direct subsumers and their less relevant ancestors. We find that our method outperforms the baselines including SBERT and two lexical matching methods. While evaluated against SNOMED CT, the approach is generalisable and can be extended to other ontologies. We release code, tools, and evaluation datasets at https://github.com/jonathondilworth/HR-OOV.

[PDF][arXiv][Code][Dataset]

Msc Thesis: Hierarchical Knowledge Retrieval using Transformer-based Ontology Embeddings in SNOMED CT

Jon Dilworth

Department of Computer Science, University of Manchester · 2025 · Thesis

Abstract

Many modern information systems rely on faithful knowledge retrieval to function effectively. For instance, large language models protect against hallucinations by leveraging techniques such as retrieval augmented generation (RAG). Meanwhile, the healthcare industry relies on structured knowledge bases, such as SNOMED CT, to aid decision support and to enable clinical reporting. Despite the central role that knowledge retrieval plays, LLMs continue to struggle with factual accuracy, and studies on effective retrieval for SNOMED CT remain limited. Motivated by these shortcomings, this work investigates the effectiveness of ontology-aware (hyperbolic) bi-encoders, focusing on the Hierarchy and Ontology Transformer frameworks (HiT and OnT, respectively). Through an investigation of ontology-grounded knowledge retrieval using SNOMED CT, we assess whether OnT-based retrieval improvements transfer to downstream tasks, including RAG-based BioMedical MCQA and web-based search. We construct an out-of-vocabulary (OOV) mention set using the MIRAGE benchmark, annotate gold target reference classes from SNOMED CT, and evaluate retrieval in single-target, multi-target and application-specific settings. We compare our results against strong lexical (TF–IDF, BM25) and contextual (Sentence-BERT) baselines, whilst evaluating potential for exploratory techniques such as mixed model spaces with heterogeneous curvature. For single-target retrieval, we find that the best-performing OnT models provide a 13-point gain in MRR over SBERT and more than double the relative performance when compared to lexical baselines. Similarly, in the multi-target setting, ontology encoders continue to outperform both lexical and contextual baselines, where a depth-biased subsumption score further improves mAP by 1–2 points compared to measures of pure geodesic distance (whilst eliciting minimal nDCG trade-off). Despite these performance improvements, single (top-1) concept retrieval applied to vanilla RAG for biomedical MCQA shows no significant accuracy gains on MIRAGE. Limitations likely stem from language model mismatch, short context length and insufficiencies tied to axiom verbalisation, suggesting that further work is required. We release a modular retrieval toolkit, annotated OOV queries, and a reproducible artefact to support future work, available at https://github.com/jonathondilworth/uom-thesis.

[PDF][Code][Dataset]

2015

BSc Thesis: Improving the Efficiency of Function Mapping Neural Networks using Hybrid Training Methodologies

Jon Dilworth

Department of Computer Science, University of Manchester · 2015 · Thesis

Abstract

Variations of artificial neural networks are prevalent in modern day technology and their applications range from analysing speech and identifying images to predicting stock market fluctuation. As this area of research continues to grow, it can be safely assumed that the capabilities associated with these technologies will become even more extensive. This report investigates the effects of using multiple training algorithms, specifically an evolutionary algorithm and the back-propagation algorithm, in an intermittent fashion in order to assess the training efficiency compared to adopting either training algorithm independently. Through the investigation of numerous mathematical models and by designing and implementing a platform capable of evaluating the efficiency of the training process, given any particular training algorithm, it was shown that the utilisation of a hybrid training methodology can greatly improve the efficiency of the training process.

[PDF]