Available
2025_70
Start date:
October 2025
Project themes:
Main supervisor:
Research Fellow
Co-supervisor:
Dr Thomas Searle
Additional Information:
Background
Electronic Health Records (EHRs) contain a vast amount of valuable clinical information in both structured and unstructured formats. Extracting that information from the records can improve patient care, support clinical decision-making, and advance medical research. The CogStack platform [Noor et al., 2022] is an integrated information retrieval and extraction ecosystem which has been deployed in multiple large National Health Service (NHS) Foundation Trust hospitals in the UK. It enables various natural language processing (NLP) tasks to be built on top of it. Typical NLP tasks include Named-Entity Recognition and Linking (NER+L), Entity Relationship Extraction (ERE), Information Extraction (IE), Summarisation and De-identification.
In recent years, transformers [Lin et al., 2022] have revolutionised NLP by effectively capturing contextual relationships within text. For example, Foresignt [Kraljevic et al., 2024], a generative pretrained transformer, can forecast patient's trajectory using EHRs data. Given the diversity of the clinical NLP tasks, there is a significant opportunity to develop a unified framework that leverages transformer-based models to enhance the processing of EHR data.
Retrieval Augmented Generation (RAG) [Lewis et al., 2020] provides a practical approach to implementing this unified framework. RAG combines the strengths of retrieval-based methods and generative pretrained transformer models. This combination supplies the transformer model with specific context, improving the accuracy and relevance of the generated outputs.
Novelty & Importance
A unified transformer-based NLP framework, through RAG, would facilitate seamless integration of the NLP tasks within a single system. This would enhance efficiency and consistency across clinical applications. Such a framework could also improve the relevance of extracted clinical information. Ultimately, this integration would support better patient outcomes, inform clinical decision-making, and accelerate medical research by providing healthcare professionals with valuable insights derived from EHR data.
Aims & Objectives
The project aims to:
• develop a unified transformer-based NLP framework for EHRs by integrating multiple clinical NLP tasks
• enhance the usability and performance of NLP tasks using retrieval augmented generation (RAG)
• evaluate and validate the framework in real-world clinical settings within the CogStack platform
References
Noor, K., Williams, R. J., O’Brien, N., et al. (2022). CogStack—Open source information retrieval and extraction platform for healthcare data. Journal of Biomedical Informatics, 123, 103934.
Lin, T., Wang, Y., Liu, X., & Qiu, X. (2022). A survey of transformers. AI open, 3, 111-132.
Lewis, P., Perez, E., Piktus, A., Petroni, F., Karpukhin, V., Goyal, N., ... & Kiela, D. (2020). Retrieval-augmented generation for knowledge-intensive nlp tasks. Advances in Neural Information Processing Systems, 33, 9459-9474.
Kraljevic, Z., Bean, D., Shek, A., Bendayan, R., Hemingway, H., Yeung, J. A., ... & Dobson, R. J. (2024). Foresight—a generative pretrained transformer for modelling of patient timelines using electronic health records: a retrospective modelling study. The Lancet Digital Health, 6(4), e281-e290.
We are now accepting applications for 1 October 2025
Candidates should possess or be expected to achieve a 1st or upper 2nd class degree in a relevant subject including the biosciences, computer science, mathematics, statistics, data science, chemistry, physics, and be enthusiastic about combining their expertise with other disciplines in the field of healthcare.
Important information for International Students:
It is the responsibility of the student to apply for their Student Visa. Please note that the EPSRC DRIVE-Health studentship does not cover the visa application fees or the Immigration Health Surcharge (IHS) required for access to the National Health Service. The IHS is mandatory for anyone entering the UK on a Student Visa and is currently £776 per year for each year of study. Further detail can be found under the International Students tab below.
Closing date: 30 January 2025 (23:59 hrs BST)
Create an account with King’s Apply.
Apply to the EPSRC DRIVE-Health: Centre for Doctoral Training in Data-Driven Health MPhil/PhD (Full-time).
Please ensure you read the full information required on our Apply page, particularly relating to Personal Statement and Supporting Information.
Complete the following sections of the application with all the relevant information.
A PDF copy of your CV should be uploaded to the Employment History section.
A 500-word personal statement outlining your motivation for undertaking postgraduate research with the CDT should be uploaded to the Supporting Statement section.
Funding:
Please choose Option 5 "I am applying for a funding award or scholarship administered by King’s College London" in the funding section.
Under "Award Scheme Code or Name" enter "EPSRC DRIVE-Health 2025".
Failing to include one of these codes might result in you not being considered for funding.
Questions marked * are mandatory and you will not be able to submit without answering.
Non-EU international applicants are advised that ATAS may be required. While there is no charge to apply for ATAS, processing can take up to 3 months. Please read the Important Information for International Students.
Enhanced Studentships to Attract Top Talent
Each studentship is fully funded for 4 years.
This includes tuition fees, a stipend and a generous allowance for project consumables.
Tuition Fees: these will be covered for both Home and International students.
Stipend: students will receive a tax-free living allowance of £23,814 per year (current projection for Academic Year 2025/26).
Research Training Support Grant (RTSG): up to £20,000 over 4 years for research consumables and attending national and international conferences.
Important Information for International Students
It is the responsibility of the student to apply for their Student Visa.
Please note that the EPSRC DRIVE-Health studentship does not cover the visa application fees or the Immigration Health Surcharge (IHS) required for access to the National Health Service. The IHS is mandatory for anyone entering the UK on a Student Visa and is currently £776 per year for each year of study.
Additionally, depending on your chosen project, some nationals may need to apply for an Academic Technology Approval Scheme (ATAS) certificate prior to applying for a visa. The ATAS application process can take up to 3 months and so it is essential that you apply for this early. Please note the following:
• If you need to apply for a student visa, you cannot submit your visa application until your ATAS certificate has been issued.
• If you are applying for any other visa, you cannot enrol at King’s and start your programme unless your ATAS certificate has been issued.
• If you apply late, you may not be able to join on the expected entry point and your registration may be postponed
Please review the following article for further information on the ATAS certificate and how to apply: label="" type="url" target="_blank" href="https://self-service.kcl.ac.uk/article/KA-01847/en-us" data-runtime-url="https://self-service.kcl.ac.uk/article/KA-01847/en-us">Do I need ATAS clearance before I start my course at King's?
For further advice, please contact the Visas & International Student Advice as soon as possible.
Academic Requirements and Eligibility
We welcome eligible Home and International applicants from any personal background who are pleased to join diverse and friendly research groups.
Open to Home and International applicants.
Applicable level of study: Postgraduate research.
English Language Requirements (Band D)
Based on the IELTS test scoring system, this programme requires that successful candidates achieve the following level of English before enrolling. Successful applicants’ offer letters will include information about when they must have achieved this standard.
Overall: 6.5
Listening: 6
Speaking: 6
Reading: 6
Writing: 6
Visit our admissions webpages to view our English language entry requirements.
For any other questions about the recruitment process, please email us at
EPSRC DRIVE-Health Centre for Doctoral Training in Data-Driven Health