Development of a clinical deci

Development of a clinical decision support tool for Primary care Management of lower Urinary tract Symptoms in men: the PriMUS study

Edwards A, Agarwal R, Bates J, Bray A, Milosevic S, Thomas-Jones E, Drinnan M, Drake M, Michell P, Pell B, Ahmed H, Joseph-Williams N, Hood K, Takwoingi Y, Harding C

Record ID 32018013851

English

Authors' objectives: Lower urinary tract symptoms particularly affect older men and their quality of life. General practitioners currently have no easily available assessment tools to diagnose lower urinary tract symptom causes. Referrals to urology specialists are increasing. General practitioner access to simple, accurate tests and clinical decision tools could facilitate management of lower urinary tract symptoms in primary care. To determine which of several index tests in combination, best predicted three diagnoses (detrusor overactivity, bladder outlet obstruction and/or detrusor underactivity) in men presenting with lower urinary tract symptoms in primary care. To develop and validate three diagnostic prediction models, and a prototype primary care clinical decision support tool. Lower urinary tract symptoms (LUTS) particularly affect older men and can lead to poor quality of life, often referred to as the degree of ‘bother’ experienced. General practitioners (GPs) currently have no easily available assessment tools to effectively diagnose the causes of LUTS and aid discussion of treatment with patients. Men are increasingly referred to urology specialists who often recommend treatments that could have been initiated in primary care. GP access to simple, accurate tests and clinical decision tools could facilitate faster and more effective patient management of LUTS in primary care. The reference standard test for investigation of LUTS, and thus diagnosis of detrusor overactivity (DO), bladder outlet obstruction (BOO) and detrusor underactivity (DU), is invasive urodynamics, which takes place in secondary care. However, National Institute for Health and Care Excellence (NICE) guidelines suggest that many men referred to specialist care with LUTS are eventually managed conservatively, and so could have remained within primary care. Further, GPs do not have access to validated clinical decision tools giving an indication of the most likely cause of symptoms to guide treatment and management. Making such a tool available should improve treatment efficacy, standardise treatment, reduce unnecessary referrals, expedite referral of those requiring specialist care, and thus improve cost-effectiveness of NHS care. A primary care-based clinical decision support tool with defined accuracy would, firstly, mean that the men could undergo the necessary simple tests straightaway, organised through the GP surgery and, secondly, would get a quicker result regarding predicted diagnosis and choice of management options that are most likely to be effective. For GPs, a clinical decision support tool could allay uncertainty around both diagnosis and best management. The primary objectives of the Primary care Management of lower Urinary tract Symptoms (PriMUS) study were to: Develop statistical models to predict the likelihood of each urological condition (BOO, DU and DO), based on a series of non-invasive index tests, with urodynamics as the reference standard. Estimate the diagnostic accuracy of the above models in an independent validation cohort. The study incorporated an internal pilot phase with ‘Stop/go’ criteria that included quantitative and qualitative assessments. The progression criteria were designed to allow for mitigating strategies to be discussed to allow for some adaptation to recruitment processes in the main study. The secondary objectives were to: Develop a series of patient management recommendations and thresholds for clinically useful diagnostic prediction by expert consensus and with reference to current clinical guidelines that map to the diagnoses predicted by the statistical model. Combine the statistical model and management recommendations into an online tool that will form the prototype clinical decision support tool. Complete a qualitative study to explore the feasibility of introducing the clinical decision support tool into primary care, including potential acceptability to primary care staff and patients. Collect NHS costs involved in delivering the new pathway and compare with cost of standard pathway calculated from NHS and other sources. Methods Men presenting to their GP with LUTS were recruited prospectively from GP practices in Bristol, Newcastle upon Tyne and Wales. Participants underwent a series of simple index tests and the invasive reference standard (urodynamics). To determine which index tests used in combination best predicted three urodynamic observations (BOO, DU and DO) in men, diagnostic prediction models were developed for each target condition using logistic regression modelling. Multiple imputation by chain equations was used to handle missing data and fractional polynomial functions were used to fit continuous variables. The discriminative ability of the models was assessed using the c-index, and calibration was assessed using calibration plots and the calibration slope. Internal validation was conducted to assess optimism of the performance statistics using the bootstrapping procedure. External validation was conducted to assess model performance in another sample from a similar population as the development cohort. In both forms of validation, the models were recalibrated using the calibration slope as a shrinkage factor to re-estimate the intercept and model coefficients. Sensitivity and specificity were plotted on a receiver operating characteristic plot for each model. Risk thresholds were identified at a sensitivity and specificity of 75%, which was deemed to be the minimum clinically useful performance. Sensitivity analyses were performed by fitting two alternative models for each target condition. In sensitivity analysis 1, predictors that may be difficult to obtain in practice were excluded from the list of candidate predictors (mean urgency score and mean 24-hour fluid intake from the bladder diary) and in sensitivity analysis 2 (SA2), alternative measures or methods of measurement of candidate predictors were considered. Urology specialists (target n = 15–20) were invited to take part in the process of developing management recommendations to inform the clinical decision support tool. Interviews and questionnaires were used to establish how the urologists would manage a number of different commonly encountered clinical scenarios, focusing on the thresholds at which they would recommend treatment and the strategies they would use when multiple urodynamic abnormalities are diagnosed or suggested. Scenarios were informed by real-life data generated by study participants. Feedback from the interviews and questionnaire were collated, and Study Management Group members with a background in urology then considered this in conjunction with available evidence/guidelines to inform the development of the draft management recommendations. The prototype tool is available at PriMUS decision aid for LUTS (shinyapps.io). This is a draft and not yet intended to be used in practice (details of evaluation to date under next objective). The GP is asked to enter data relating to demographics, initial questionnaire and uroflowmetry results and any previous treatments that have been used. A ‘Statistics’ summary of findings is provided, along with further pages with graphical displays (bar charts and crowd figures) of diagnostic probabilities, and management recommendations in accordance with NICE guidance. It is intended that the GP and patient may review these later pages (displays and recommendations) together to move towards a shared decision of management choice. To explore GPs’ experiences of managing LUTS, together with patients’ experiences of and preferences for treatment in primary care, 25 patients and 11 GPs were purposively sampled from 20 GP practices in 3 UK regions. We also conducted initial user-testing of the prototype decision support tool with GPs. Participants were asked to try out the prototype tool prior to the interview so that they were able to give feedback on design and ease of use. We also conducted a simulated consultation workshop involving the study management GPs (AE and HA) and patient and public contributors (n = 3). Field notes were taken during this workshop. A framework approach was used to analyse interview data. There were four main themes concerning treatment of LUTS in primary care: unresolved symptoms, preference for primary care, satisfaction with involvement in decision-making, and challenges of managing LUTS in primary care. Our findings emphasise the importance of LUTS being managed in primary care where possible, as in addition to cost savings and reduced waiting times, this is a more accessible option for patients, who tend to be more comfortable and confident being treated by familiar clinicians or in more familiar environments. Feedback from GPs during development and small-scale user-testing with the tool in simulated consultation scenarios was favourable. It is more likely that the tool will be applicable in primary care by internally referring patients to a GP with Special Interest (in LUTS) who can undertake the discussion, examination, gather the required data (e.g. voiding symptoms subscore, flow rates and residual volumes; PSA testing) in consultations that are longer than the usual 10 minutes’ duration, and review treatments chosen and benefits or side effects ensuing. This ‘clinic model’ might operate at the single practice or cluster of practices level. We were unable to meet this objective as originally intended. Up-to-date data on referral rates are required (identified to have risen to 30% of men presenting with LUTS in primary care at the time of commissioning PriMUS study, 2017). The original objective included assessing Clinical Practice Research Datalink data for a reference standard for referrals, and current medical management and to model whether these are likely to diminish by implementing the PriMUS tool. On detailed assessment of feasibility, we concluded that identifying index cases (men presenting with LUTS) may be susceptible to poor coding of presentations in primary care (e.g. when presenting with sometimes vague symptoms). Complex resource intensive work would be required to ascertain cases and outcomes, and the other study pressures (delayed study recruitment and data collection during the pandemic years) meant that this was not feasible. Given the reasonable diagnostic accuracy data for the PriMUS models, it is now possible to undertake economic modelling research, including the potential resource use effects of adopting the PriMUS tool, its diagnoses and management recommendations. This modelling would provide the basis for constructing decision-analytic models with precision to study the impact of the diagnostic precision models. Key outcomes that determine overall resource use are medical management (treatments and consultation time) and referral rates. Training requirements and set-up of ‘clinics’ at practice or cluster level are also relevant.

Authors' results and conclusions: One model each was developed and validated for bladder outlet obstruction and detrusor underactivity, two for detrusor overactivity (detrusor overactivity main, detrusor overactivity sensitivity analysis 2). Age, voiding symptoms subscore, prostate-specific antigen level, median maximum flow rate, median voided volume were predictors for bladder outlet obstruction. Median maximum flow rate and post-void residual volume were predictors for detrusor underactivity. Age, post-void residual volume and median voided volume were included in detrusor overactivity main model, while age and storage symptoms subscore predicted detrusor overactivity sensitivity analysis 2. For all four models, sensitivity of 75.0% could be achieved with a specificity of 74.2%, 47.3%, 45.6% and 46.2% for bladder outlet obstruction, detrusor underactivity, detrusor overactivity main and detrusor overactivity sensitivity analysis 2 models, respectively. Similarly, a specificity of 75.0% could be achieved with a sensitivity of 71.3%, 39.8%, 33.3% and 62.7% for bladder outlet obstruction, detrusor underactivity, detrusor overactivity main and detrusor overactivity sensitivity analysis 2 models, respectively. The prototype tool (not yet intended for use in practice) is available at Primary care Management of lower Urinary tract Symptoms decision aid for lower urinary tract symptoms (shinyapps.io). General practitioner feedback during tool development and small-scale user-testing in simulated consultation scenarios was favourable. Patients supported such management in primary care. We identified sensitivities and specificities of diagnostic models for detrusor overactivity, bladder outlet obstruction and detrusor underactivity in routine United Kingdom practice and developed a prototype clinical decision support tool. Between March 2018 and June 2022, 350 and 251 men were respectively recruited into the development and validation cohorts. In the development cohort (median age 69), 163 (46.6%), 141 (40.3%) and 253 (72.3%) participants were diagnosed with BOO, DU and DO, respectively. In the validation cohort (median age 67), 112 (44.6%), 87 (34.7%) and 166 (66.1%) participants were diagnosed with BOO, DU and DO, respectively. Two models were developed and validated for DO (DO main and DO SA2), while one model each was developed and validated for BOO (BOO model 3) and DU. Age (participant demographics), voiding symptoms subscore (International Consultation on Incontinence Questionnaire – male LUTS questionnaire), prostate-specific antigen (PSA) test result (blood test), median maximum flow rate (uroflowmetry) and median voided volume (uroflowmetry) were predictors for BOO. Median maximum flow rate and post-void residual volume (bladder ultrasound) were predictors for DU. Age, post-void residual volume and median voided volume were included in DO main model, while age and storage symptoms subscore (International Prostate Symptom Score questionnaire) were predictors in DO SA2 model. Bladder outlet obstruction model 3 demonstrated good discriminative performance with an optimism-corrected c-index of 0.80. The models for DU, DO main and DO SA2 demonstrated moderate discriminative ability with an optimism-corrected c-index of respectively 0.64, 0.67 and 0.65. Similar estimates of c-index were observed for each model with the validation cohort. The optimism-corrected calibration slope for each model was

Authors' methods: Prospective diagnostic accuracy study. Two participant cohorts, for development and validation, underwent simple index tests and a reference standard (invasive urodynamics). General practices in England and Wales. Men (16 years and over) consulting their general practitioner with lower urinary tract symptoms. The primary outcome was diagnostic model sensitivity and specificity for detecting bladder outlet obstruction, detrusor underactivity and detrusor overactivity, with 75.0% considered minimum clinically useful performance. Men presenting to their GP with LUTS were recruited prospectively from GP practices in Bristol, Newcastle upon Tyne and Wales. Participants underwent a series of simple index tests and the invasive reference standard (urodynamics). To determine which index tests used in combination best predicted three urodynamic observations (BOO, DU and DO) in men, diagnostic prediction models were developed for each target condition using logistic regression modelling. Multiple imputation by chain equations was used to handle missing data and fractional polynomial functions were used to fit continuous variables. The discriminative ability of the models was assessed using the c-index, and calibration was assessed using calibration plots and the calibration slope. Internal validation was conducted to assess optimism of the performance statistics using the bootstrapping procedure. External validation was conducted to assess model performance in another sample from a similar population as the development cohort. In both forms of validation, the models were recalibrated using the calibration slope as a shrinkage factor to re-estimate the intercept and model coefficients. Sensitivity and specificity were plotted on a receiver operating characteristic plot for each model. Risk thresholds were identified at a sensitivity and specificity of 75%, which was deemed to be the minimum clinically useful performance. Sensitivity analyses were performed by fitting two alternative models for each target condition. In sensitivity analysis 1, predictors that may be difficult to obtain in practice were excluded from the list of candidate predictors (mean urgency score and mean 24-hour fluid intake from the bladder diary) and in sensitivity analysis 2 (SA2), alternative measures or methods of measurement of candidate predictors were considered.

Details

Project Status: Completed

URL for project: https://www.journalslibrary.nihr.ac.uk/programmes/hta/15/40/05

Year Published: 2025

URL for published report: https://www.journalslibrary.nihr.ac.uk/hta/RGTW5711

URL for additional information: English

English language abstract: An English language summary is available

Publication Type: Full HTA

Country: England, United Kingdom

DOI: 10.3310/RGTW5711

MeSH Terms

Lower Urinary Tract Symptoms
Urologic Surgical Procedures, Male
Male
Prostatic Hyperplasia
Urinary Bladder Neck Obstruction
Urinary Bladder, Overactive
Urinary Bladder, Underactive
Decision Support Systems, Clinical
Urinary Incontinence

Contact

Organisation Name: NIHR Health Technology Assessment programme

Contact Address: NIHR Journals Library, National Institute for Health and Care Research, Evaluation, Trials and Studies Coordinating Centre, Alpha House, University of Southampton Science Park, Southampton SO16 7NS, UK

Contact Name: journals.library@nihr.ac.uk

Contact Email: journals.library@nihr.ac.uk

This is a bibliographic record of a published health technology assessment from a member of INAHTA or other HTA producer. No evaluation of the quality of this assessment has been made for the HTA database.