Skip to content

Commit a96bc33

Browse files
authored
mres projects pages (#307)
* mres projects pages * remove reference * remove significant
1 parent 323484b commit a96bc33

11 files changed

Lines changed: 156 additions & 2 deletions

docs/images/mh-in-leeds.png

1.08 MB
Loading
Lines changed: 28 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,28 @@
1+
---
2+
title: 'Understanding delays in elective pathways'
3+
summary: 'A case study of lithotripsy using NHS England data and process mining and machine learning techniques'
4+
tags: ['SECONDARY CARE', 'FORECASTING', 'PROCESS MINING', 'MODELLING', 'STRUCTURED DATA', 'PYTHON', 'PYSPARK', 'COMPLETE']
5+
---
6+
*This project was completed by Evelyn Koon, Analyst, in the Elective Analysis Team, as part of the Data Science MRes at the University of Leeds.*
7+
8+
## Background
9+
The UK’s National Health Service (NHS) provides publicly funded healthcare free at the point of use. In 2025, NHS England introduced the 'Reforming Elective Care for Patients' (NHS England and Department of Health and Social Care, 2025) initiative, aimed at enhancing the delivery of elective services and reducing waiting times. Urology was one of the first five specialties chosen for reform due to its large and growing waiting list (NHS England and Department of Health and Social Care, 2025). Using national data, this project examines patient pathways for Extracorporeal Shock Wave Lithotripsy (ESWL) within Urology, aiming to understand why some patients remained on the waiting list for over 18 weeks.
10+
11+
## Methods
12+
13+
NHS England has introduced a new dataset, the Elective Patient Pathway (EPP), designed to capture activities within the elective care journey. This project applied process mining techniques to the national dataset, extending the scope beyond a single site. Directly Follows Graphs (DFGs) were used with input from NHS experts, new variables were derived and further examined using machine learning techniques to evaluate the association of these new variables on elective waiting times. Additionally, conformance analysis was used to compare individual hospitals with the longest and shortest durations on the elective waiting list, highlighting variations in care processes.
14+
15+
## Results
16+
17+
Incorporating four of the additional activity types, which were non-lithotripsy appointments and assigning sequential identifiers to lithotripsy treatments (e.g., ESWL_1 or lit_1 for the first session), enabling the discovery of more meaningful care pathways. The machine learning models identified two of these as the most predictive features when they were tested. These were the other last appointment, which occurred last out of all non-lithotripsy appointments, and the first other outpatient routine appointment.
18+
Patients who waited less than 18 weeks generally followed similar sequences to those who waited longer, though extended delays were concentrated in the early stages of the pathway, particularly after being added to the waiting list. Conformance analysis across hospitals revealed that more complex pathways, with more activity, took longer to complete. Much of this extended activity occurred after patients left the waiting list, indicating the potential impacts of more complicated pathways on service provision.
19+
20+
## Conclusions
21+
22+
By combining national data with clinical expertise, this study uncovered variation in patient pathways, particularly for those with complex needs navigating the lithotripsy pathway. Process mining and machine learning revealed disparities in waiting list durations and activities that influence time a patient waited for their treatment, while conformance analysis highlighted variations in patient pathways. Grounding the dataset in real-world clinical practice ensured operational relevance to improve NHS performance. These insights can support frontline decision-making and offer promising applications in the NHS, especially with the development of new pathway datasets.
23+
24+
Output|Link
25+
---|---
26+
Code and Documentation - private while under development|[Link](https://github.com/nhsengland/PM4_elective_care.git)
27+
28+
#

docs/mres/ethnicity.md

Lines changed: 24 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,24 @@
1+
---
2+
title: 'Investigating unknown ethnicity records in NHS emergency care data: patterns, predictors and missingness '
3+
summary: 'Supporting the improvement of ethnicity recording in emergency care data by: (a) investigating variation in unknown ethnicity records, (b) exploring patterns of missingness between ethnicity and other variables, and (c) identifying the variables most important in the recording of an unknown ethnicity. '
4+
tags: ['PUBLIC/POPULATION HEALTH', 'EMERGENCY CARE', 'CLASSIFICATION', 'MODELLING', 'STRUCTURED DATA', 'PYTHON', 'R', 'COMPLETE']
5+
---
6+
*This project was completed by Grace Dean, Senior Analyst, in the South East Data & Analytics Team Team, as part of the Data Science MRes at the University of Leeds.*
7+
8+
High quality ethnicity data is necessary for tackling health inequalities. This project looked at unknown ethnicity recording (where the ethnicity is either Null, Not Stated or Not Known) and was focused on three objectives:
9+
10+
1. Investigating variation in unknown ethnicity recording in ECDS across different organisation types, patient demographics and attendance characteristics.
11+
2. Looking at the patterns of missingness between ethnicity and other variables.
12+
3. Identifying the factors which most contribute to ethnicity being recorded as unknown.
13+
14+
## Main Findings
15+
16+
1. Variation in unknown ethnicity recording was observed across demographics and attendance characteristics, with younger patients, those requiring low-level emergency care and those with fewer attendances more likely to have an unknown ethnicity record. Differences were also observed across organisation type, with higher rates of unknown ethnicity recording in independent sector organisations and community trusts compared to acute trusts.
17+
2. Missing data in other ECDS variables was more common and more extensive when ethnicity was unknown. Increases in missingness were observed in variables relating to the investigations and treatments received by the patient, the severity of the patient’s condition, the chief complaint on arrival and place of discharge.
18+
3. Provider site code was identified as the most important variable in recording ethnicity as unknown, and a subset of acute trusts and independent sector organisations were identified as disproportionately contributing to unknown ethnicity records.
19+
20+
Output|Link
21+
---|---
22+
Code and Documentation - private while under development|[Link](https://github.com/nhsengland/ethnicity-coding-ECDS)
23+
24+
#

docs/mres/index.md

Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -8,13 +8,19 @@ A showcase of some of these projects can be found in these pages.
88

99
- [Prediction of CVD Onset](cvd-onset.md)
1010
- [Supporting Dementia Diagnosis](dementia.md)
11+
- [Understanding Delays in Elective Pathways](elective-pathways-delays.md)
12+
- [Investigating unknown ethnicity records in NHS emergency care data](ethnicity.md)
1113
- [Predicting GP Staff Turnover](gp-turnover.md)
1214
- [Menopause-Related Diagnosis Coding](menopause.md)
15+
- [Environmental Impacts on Mental Health in Leeds](mh_leeds.md)
1316
- [Impact of midwife-led continuity of carer on birth outcomes](midwives.md)
1417
- [Predicting perinatal depression](perinatal.md)
1518
- [Poor pregnancy outcomes in women with type 2 diabetes](pregnancy-outcomes.md)
1619
- [Process Mining on Patient Pathways in Healthcare](process-mining.md)
20+
- [Relationship between Psychotropic Medication Usage and Talking Therapies Treatment Outcomes](psychotropic-meds-talking-therapies.md)
21+
- [Factors related to Stillbirth Outcomes](stillbirths.md)
1722
- [UTI Surgery Risk Predictions](uti.md)
23+
- [Predicting Winter Pressures on Emergency Admissions](winter-pressures.md)
1824

1925

2026
!!! note

docs/mres/mh_leeds.md

Lines changed: 17 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,17 @@
1+
---
2+
title: 'Mental health and its relationship with noise and air pollution and the deprivation index in Leeds'
3+
summary: 'Mental health and its relationship with noise and air pollution and the deprivation index in Leeds at two geographical scales.'
4+
tags: ['MENTAL HEALTH', 'MACHINE LEARNING', 'LIKAGE', 'REGRESSIONS', 'GEOGRAPHICAL ANALYSIS', 'LINKAGE', 'RESEARCH', 'STRUCTURED DATA', 'PYTHON', 'SQL', 'GIS', 'COMPLETE']
5+
---
6+
*This project was completed by Chloe Peaker, Net Zero Estates Senior Insight Manager, in the Estates - Soft FM Team, as part of the Data Science MRes at the University of Leeds.*
7+
8+
![test](../images/mh-in-leeds.png)
9+
10+
There is a growing mental health crisis in the UK that is exacerbating an already strained healthcare system and there is also pressure for continuous growth leading to more and more urbanisation. This study therefore explores the relationship between environmental factors and mental health referrals for Leeds for the financial year 2023/24 using two geographical scales: 1 km square grid and Lower Super Output Areas (LSOAs).
11+
12+
This study shows that noise pollution alone showed a weak positive correlation when aggregated to 1 km but this was diminished when the same data was aggregated to LSOA level. A gender disparity was found warranting further investigation and potentially more proactive outreach to males. The analysis also highlighted key cleaning steps for the Patient Level Answering New Questions (PLANQ) dataset and that more expansive noise,
13+
air and Index of Multiple Deprivation (IMD), variables should be included for future analysis. As expected rates of mental health referrals were found to correlate with the IMD, though limitation in IMD granularity and outdated data raises concern for its use in transport and health policy.
14+
15+
Air pollutants including PM2.5, PM10, ozone, carbon monoxide, benzene, sulphur dioxide and nitrogen oxides, and noise pollution, were found to account for 16% of variance in mental health referrals, suggesting there is a correlation but no causal link was investigated, a geographic map of residual errors was also presented. These results offer valuable insights for future research particularly in a new geographic domain explored using geographical statistics. The results also hold relevance for policy development across both the healthcare and transport domain.
16+
17+
#
Lines changed: 36 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,36 @@
1+
---
2+
title: 'Exploring the relationship between psychotropic medication usage and Talking Therapies treatment outcomes: a retrospective cohort study'
3+
summary: 'Exploring medication use as a predictor of reliable recovery following completion of a course of Talking Therapies, using logistic regression and causal analysis'
4+
tags: ['PRIMARY CARE', 'MENTAL HEALTH', 'MACHINE LEARNING', 'MODELLING', 'RESEARCH', 'STRUCTURED DATA', 'R', 'PYSPARK', 'SQL', 'COMPLETE']
5+
---
6+
*This project was completed by CHRISTINE WELLS, Analyst, in the Modelling & System Analytics Team, as part of the Data Science MRes at the University of Leeds.*
7+
8+
NHS Talking Therapies (NHSTT) provide psychological support to people with anxiety and depressive disorders. The ideal outcome, following a completed course of treatment (minimum 2 sessions) is to achieve ‘reliable recovery’. Psychological therapy may sometimes be supplemented by prescribing psychotropic medication such as antidepressants. At each appointment patients are asked whether they are prescribed and taking any psychotropic medication, although no specific details of any medication is collected and responding is not mandatory. Whilst this information has sometimes been included as a predictor in machine learning-based analysis of therapeutic outcomes, there has been less focus on understanding how outcomes differ between medicated and non-medicated groups.
9+
10+
## Objectives:
11+
12+
* To use national NHSTT data to train and test a logistic regression model that included medication status (taking/not) as a predictor of the odds of achieving reliable recovery.
13+
* Use propensity score matching to undertake causal analysis of the impact of medication on reliable recovery.
14+
15+
## Design:
16+
A retrospective cohort study of 2½ years’ worth of patients who had completed treatment.
17+
18+
## Setting:
19+
Data was extracted from the NHS England NHSTT database for patients who had completed treatment between 1st January 2022 and 31st May 2025.
20+
21+
## Participants:
22+
The cohort was N = 1,120,839 for the logistic regression (669,142 = taking medication, 451,697 = not taking medication) and N = 54,802 for the matching analysis (N = 27,401 per group).
23+
24+
## Interventions:
25+
The exposure was whether patients were taking psychotropic medication during treatment or not, and whether they received high or low intensity therapy.
26+
27+
## Primary outcome measure:
28+
Whether patients achieved reliable recovery by completion of treatment. Results: Logistic regression on the cohort showed that medication was associated with reduced odds of recovery. Logistic regression on the matched groups suggested:
29+
30+
1. a small but beneficial effect of medication, although this result was non-significant, and
31+
2. taking medication in combination with high intensity therapy was more beneficial than high intensity therapy alone.
32+
33+
## Conclusions:
34+
The results support previous research regarding other predictors associated with increased odds of recovery, including number of treatment sessions and lower levels of deprivation. Whilst the medication information contained in the NHSTT dataset is limited, there is scope to expand this analysis through linkage to prescribing data. Enhancing our understanding of how medication status relates to therapeutic outcomes has important implications for interpretation of NHSTT recovery rates.
35+
36+
#

docs/mres/stillbirths.md

Lines changed: 19 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,19 @@
1+
---
2+
title: 'Study Relationship Between Continuity of Midwife Care, Deprivation, Ethnicity, and Stillbirth Outcomes'
3+
summary: 'To investigate the association between a data-driven measure of care continuity, socioeconomic deprivation, ethnicity, and other key factors with the risk of stillbirth, using the Maternity Services Dataset maintained by NHS England.'
4+
tags: ['SECONDARY CARE', 'MACHINE LEARNING', 'CLASSIFICATION', 'EXPLAINABILITY', 'RESEARCH', 'STRUCTURED DATA', 'PYTHON', 'COMPLETE']
5+
---
6+
*This project was completed by Ketankumar Italiya, Senior Data Architect, in the Metadata, Data Quality & Standards Team, as part of the Data Science MRes at the University of Leeds.*
7+
8+
In my capacity as a Data Engineer, my previous experience working with the NHS Maternity Services Data Set (MSDS) and the research conducted by Chris Roebuck inspired me to pursue further investigation into this domain. I was further encouraged by the collaboration and sponsorship of two relevant senior staff. Their support provided the necessary confidence to move forward with this strategic initiative.
9+
10+
The analysis was conducted on a filtered cohort of 773,786 pregnancies (including 2,233 stillbirths) where the number of unique care activities was greater than two, ensuring a sufficient basis for continuity analysis.
11+
12+
We found a mix-trend relationship between the care continuity cohorts and the stillbirth rate. The stillbirth rate was lowest (0.186%) with 'Fragmented Care' (care from >6 midwives), rising to 0.339% with 'Good Continuity' (≤3 midwives) and peaking at 0.416% with 'Low Continuity' (≤6 midwives). The analysis by subgroups revealed that the impact of ethnicity, deprivation, and maternal age on stillbirth risk varied significantly across these continuity cohorts. For example, after multivariable adjustment, the odds ratio for stillbirth among Black women (compared to White women) was highest in the 'Low Continuity' cohort (OR 1.806). Predictive models achieved high accuracy (0.9972) but Area Under the Curve (AUC) is 0.500, indicating no predictive power is not helpful.
13+
14+
Output|Link
15+
---|---
16+
Published paper on the Impact of midwife continuity of carer on stillbirth rate and first feed in England
17+
|[Link](https://www.nature.com/articles/s43856-025-01025-z)
18+
19+
#

docs/mres/winter-pressures.md

Lines changed: 18 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,18 @@
1+
---
2+
title: 'Predicting Winter Pressures on Emergency Admissions in England Using Machine Learning'
3+
summary: 'A 4-year national study across all NHS Trusts in England using Daily Emergency Care Data Set (ECDS) linked with UK Met Office weather data to predict winter pressures on Emergency admissions using Machine Learning to improve planning and response in the Emergency Department.'
4+
tags: ['SECONDARY CARE', 'EMERGENCY CARE', 'PUBLIC/POPULATION HEALTH', 'MACHINE LEARNING', 'RESEARCH', 'LINKAGE', 'MODELLING', 'STRUCTURED DATA', 'TIME SERIES', 'PYTHON', 'SQL', 'COMPLETE']
5+
---
6+
*This project was completed by Joby Jose, Senior Analytics Developer, in the Enterprise Data Visualisation Team, as part of the Data Science MRes at the University of Leeds.*
7+
8+
Winter pressures present a continued challenge to the National Health Service (NHS) in England, as cold weather causes an increase in Emergency Department (ED) admissions. This study investigates the relationship between winter weather conditions and ED admissions across all NHS trusts in England. It aims to quantify the influence of temperature, rainfall, and snowfall on ED admissions and assess the predictive value of weather features using machine learning.
9+
10+
Daily Emergency Care Data Set (ECDS) records were linked with UK Met Office weather data, assigning mean daily temperature, rainfall, and snowfall to each trust. Feature engineering included lagged weather variables, calendar effects, and public holidays. An Extreme Gradient Boosting model was trained and optimised using GridSearchCV with time-series cross-validation and interpreted using SHapley Additive exPlanations (SHAP).
11+
12+
The model achieved an RMSE of 18.78 and MAE of 10.6 admissions per trust per day. SHAP analysis showed that temperature, and its lagged values were the strongest predictors, with colder weather linked to higher admissions. Subgroup analysis showed diagnosis-specific temperature effects. Circulatory admissions increased 4-6 days after temperatures of 2-12°C; infectious diseases and injuries responded to shorter lags; and respiratory admissions peaked 7 days after temperatures below -2°C. Rainfall had a moderate influence and snowfall had a marginal influence. These findings support the integration of weather-based forecasting into NHS planning tools.
13+
14+
Output|Link
15+
---|---
16+
Code available on the GitHub: | MRes_Project/Predicting_Winter_Pressures.ipynb at main jobyjose6/MRes_Project
17+
18+
#

mkdocs.yml

Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -187,13 +187,19 @@ nav:
187187
- MRes Projects:
188188
- Prediction of CVD Onset: mres/cvd-onset.md
189189
- Supporting Dementia Diagnosis: mres/dementia.md
190+
- Understanding delays in elective pathways: mres/elective-pathways-delays.md
191+
- Investigating unknown ethnicity records in NHS emergency care data: mres/ethnicity.md
190192
- Predicting GP Staff Turnover: mres/gp-turnover.md
191193
- Menopause-Related Diagnosis Coding: mres/menopause.md
194+
- Mental Health Environmental Impacts Leeds: mres/mh_leeds.md
192195
- Impact of midwife-led continuity of carer on birth outcomes: mres/midwives.md
193196
- Predicting perinatal depression: mres/perinatal.md
194197
- Poor pregnancy outcomes in women with type 2 diabetes: mres/pregnancy-outcomes.md
195198
- Process Mining on Patient Pathways in Healthcare: mres/process-mining.md
199+
- Relationship between psychotropic medication usage and Talking Therapies treatment outcomes: mres/psychotropic-meds-talking-therapies.md
200+
- Relationship Between Factors and Stillbirth outcomes: mres/stillbirths.md
196201
- UTI Surgery Risk Predictions: mres/uti.md
202+
- Predicting Winter Pressures on Emergency Admissions: mres/winter-pressures.md
197203
- Articles:
198204
- articles/index.md
199205
- Playbooks: playbooks.md

utils/input.xlsx

26.4 KB
Binary file not shown.

0 commit comments

Comments
 (0)