Digital Transformation

Building an NHS Data Architecture That Survives an EPR Go-Live

Blue icon of a person with a gear, representing user settings or account configuration.
Pamela Sengupta
Blue calendar icon with a grid representing days and two rings at the top.
June 23, 2026

EPR go-live is the milestone that NHS digital programmes are built around. Years of procurement, design, configuration, training, and preparation culminate in a single cutover event that changes how every clinical workflow in a Trust is recorded and managed. It is, rightly, treated as a major organisational achievement.

But the data architecture conversation that should run alongside EPR implementation is consistently underweighted. For many NHS Trusts, the EPR is treated as the data strategy: once the EPR is live, the assumption is that the data problem is largely solved. This assumption is wrong, and the consequences of it are well documented and increasingly costly.

This article sets out what happens to NHS data environments at EPR go-live when the surrounding architecture has not been prepared, why the EPR itself cannot solve the problems that sit outside it, and what a data architecture designed to survive and support an EPR go-live actually requires.

The Cost of Getting This Wrong

Analysis published in May 2026 by healthcare data specialists MBI Health estimated that NHS Trusts in England could spend more than £13.5 million in 2026 alone correcting data problems that emerge after EPR go-lives. That figure is based on nine major acute Trust EPR transitions expected to go live in England during the year, at a typical post-go-live data remediation cost of £1.5 million per Trust.

These are direct remediation costs only. They do not include wider productivity losses, internal staff time diverted from clinical and operational work, delayed benefits realisation, longer-term optimisation costs, or the patient care implications of disrupted waiting list management.

The mechanism of failure is consistent. Patient tracking lists increase by around 25% on average following EPR go-live, reflecting duplicated records, incomplete data, and referrals migrated incorrectly into the new system. Because patient tracking lists underpin Referral to Treatment management, disruption to those records directly impairs a Trust's ability to manage elective waiting list performance at exactly the moment when staff are already under pressure from the transition itself.

NHS England's own RTT publication data has recorded instances where Trusts' waiting time figures for specific months had to be revised following EPR go-lives that affected data completeness. These are not isolated incidents. They are a predictable pattern that plays out repeatedly across NHS EPR transitions, regardless of the EPR system being implemented.

The £13.5 million NHS Trusts are expected to spend on EPR data remediation in 2026 covers only the direct cost of fixing what went wrong. The true cost, including disrupted waiting list management, delayed benefits, and lost staff productivity, is considerably higher.

Why EPR Go-Live Reveals Data Problems That Already Existed

The instinct in most post-go-live reviews is to attribute data problems to the migration itself: something went wrong in the cutover, records were not transferred correctly, the mapping between old and new systems was imperfect. These are real contributors. But the deeper cause is almost always present well before the cutover begins.

NHS data environments are, in most Trusts, characterised by fragmentation. Clinical data sits across dozens of systems. Patient identifiers are inconsistent between systems. Referral and pathway data is incomplete in the legacy systems from which EPR data is migrated. Duplicate records exist in the source data long before migration begins.

The EPR migration process does not create these problems. It surfaces them. A migration that attempts to move records from a fragmented, inconsistently coded legacy environment into a structured EPR will import those structural problems into the new system unless they are addressed upstream of the cutover.

The Health Services Safety Investigations Body has formally confirmed that EPR programmes can contribute to missed, delayed, or incorrect patient care due to issues in implementation, usability, training, and optimisation. Data quality problems in the migrated estate are a documented contributor to those safety risks, not a background technical concern.

The Pre-Migration Data Quality Deficit

Multiple NHS EPR programmes in 2025 and 2026 have cited data readiness as a reason for delayed go-live dates. Coventry and Warwickshire Partnership NHS Trust deferred its go-live to allow completion of structured data cleansing and referral cleanup activities, with programme costs rising from £3.5 million to £4.46 million as a result. Sheffield Teaching Hospitals went live with its EPR in July 2025 following an eight-month delay to address system and organisational readiness issues, of which data readiness was a component.

These are not programme management failures. They are the correct decision when data readiness has not been achieved before cutover. The alternative, proceeding to go-live with inadequate data preparation, produces the post-go-live remediation costs and patient list disruption that are now well documented in the public record.

The EPR as a System of Record, Not a Data Architecture

The most important conceptual shift for NHS digital leaders approaching an EPR programme is to understand what an EPR does and does not do for the Trust's data environment.

An EPR is a system of record for clinical activity. It captures patient demographics, clinical observations, diagnoses, prescribing, orders, results, and pathway events in a structured, accessible way. For Trusts moving from paper-based or fragmented legacy systems, this is transformative. The EPR is the right investment.

What an EPR is not is an analytical data platform, a population health tool, an operational intelligence layer, or a replacement for a Trust-managed data repository. These capabilities require data to flow out of the EPR and into other platforms where it can be joined with operational, workforce, financial, and community data to support the decisions that clinical and operational leaders actually need to make.

A Digital Maturity Assessment reported in early 2026 found that while 93% of NHS providers now have an EPR in place, only 30% have achieved fully integrated, bidirectional data flows between systems. This gap between EPR presence and data integration is the defining data architecture challenge for NHS Trusts in 2026 and beyond. The EPR is in. The surrounding architecture that makes the EPR's data useful at scale has not been built.

Digitisation is largely in place across the NHS. Transformation is not. The EPR alone does not deliver integrated care or operational intelligence. Those outcomes require data to flow out of the EPR and into a structured analytical environment.

The Architecture That Needs to Exist Around the EPR

Building a data architecture that survives and supports an EPR go-live means designing, before go-live, the three interconnected layers that allow the EPR to function as part of a broader data ecosystem rather than as an isolated system of record.

Layer One: The Local Data Repository

The Trust's own cloud data platform is the analytical and operational intelligence layer that sits around the EPR. It receives structured data feeds from the EPR and from the other clinical, operational, and corporate systems the Trust runs. It is where data from multiple source systems is joined, quality-checked, and made available for Trust-level reporting, analytics, and AI.

This platform should be designed and at least partially built before EPR go-live. The data flows from the EPR into the local repository should be defined and tested during the EPR implementation programme, not treated as a post-go-live workstream. Trusts that arrive at EPR go-live without a defined local data layer find that EPR data remains locked in the EPR vendor's reporting environment, accessible only via the tools the vendor provides, rather than integrated with the broader data estate.

Layer Two: The NHS Federated Data Platform

The NHS Federated Data Platform sits above the local EPR and Trust data systems, providing national operational tools for waiting list management, theatre optimisation, discharge planning, and population health analytics. FDP onboarding is no longer optional: NHS Medium-Term Planning guidance has formally shifted the FDP from an innovation to a foundational requirement, with universal onboarding expected across acute, community, and mental health services.

FDP value depends entirely on data quality at the EPR and local system level. As practitioners working on North West London FDP implementation have documented, data quality must be assured at the EPR level before it flows into the FDP. The implication for EPR programmes is direct: the data quality work required for FDP participation is the same data quality work required for a successful EPR go-live. The two programmes share a dependency, and should share a data quality workstream.

Layer Three: Integration Architecture and Standards

The integration layer defines how data moves between EPR, local repository, FDP, and the other systems in the Trust's estate. FHIR R4 using the NHS UK Core implementation is the emerging standard for this integration in NHS settings, providing a consistent, interoperable data exchange format that reduces the bespoke point-to-point integration work that has historically made NHS data environments difficult to maintain.

Integration standards need to be defined before EPR procurement is finalised, not after. Once an EPR is procured and configured, the integration architecture is constrained by what the EPR vendor supports. Trusts that define their integration requirements early, and select EPR systems and configurations accordingly, retain significantly more architectural flexibility than those that treat integration as an implementation detail to be resolved later.

What Has to Be True Before Cutover

The difference between an EPR go-live that stabilises quickly and one that produces months of data remediation work almost always comes down to decisions and actions taken before cutover day. The following represent the non-negotiable prerequisites for a data architecture that survives go-live.

Data Discovery and Source System Documentation

Before migration planning can begin meaningfully, the Trust must have a clear, documented picture of what data exists in the systems being retired or integrated. This means knowing what patient records, referral pathways, clinical observations, and operational data exist across legacy systems, how complete they are, where duplicates exist, and what data will and will not be migrated. Automated discovery tooling provides a more complete picture than stakeholder interviews alone and surfaces the undocumented data that consistently causes post-go-live surprises.

Data Cleansing and Quality Remediation Upstream

Data that is not fit for migration should be cleaned before cutover, not after. This means resolving duplicate patient records, validating and completing referral pathway data, coding data consistently with the target EPR's data model, and applying retention policies to records that should be archived rather than migrated. The cost of doing this upstream of go-live is consistently lower than the cost of doing it reactively after go-live under operational pressure.

Waiting List and PTL Validation

Because patient tracking list disruption is the most operationally damaging post-go-live data problem, waiting list and PTL validation deserves its own dedicated workstream in the pre-go-live programme. East Suffolk and North Essex NHS Foundation Trust, which went live with its EPR in October 2025 with a smoother than average stabilisation, cited intensive focus on wait list validation as one of the specific preparatory activities that contributed to its outcome. This is a transferable lesson, not a one-off success.

Defined Data Flows Out of the EPR

The data flows from the EPR to the local data repository, to the FDP, and to other integrated systems should be designed, configured, and tested before go-live. Treating these as post-go-live workstreams means the analytical capabilities that are part of the EPR business case cannot be realised in the months following go-live when stakeholder confidence in the new system is most fragile and most in need of evidence.

Governance for the New Data Environment

The EPR creates new data ownership questions. Who owns the clinical data in the EPR? Who owns the data that flows from the EPR into the local repository? Who has authority to change data definitions, access controls, or retention policies? These governance questions are difficult to resolve under the pressure of a live cutover. They should be resolved in the programme's design phase and documented in a data governance framework that covers the full three-tier data environment, not just the EPR itself.

The Programme Sequencing Question

A recurring challenge in NHS EPR programmes is that the EPR implementation consumes the organisation's digital programme capacity entirely, leaving no bandwidth for the surrounding data architecture work that makes the EPR valuable. The data repository design is deferred. The FDP integration is treated as a later phase. The governance framework is written after go-live.

This sequencing produces the pattern that is now well documented: EPR goes live, reporting gaps are discovered, analytical capabilities that were in the business case are not available, data leaders are brought in to remediate problems that should have been prevented. The chair of the NHS Chief Data and Analytical Officer Network has been direct on this point: data and analytics leaders need to be involved from the outset of EPR programmes, not brought in after the key decisions have been made.

The practical implication is that the data architecture workstream should be formally scoped and resourced within the EPR programme, not treated as a parallel or subsequent initiative. The deliverables, the data discovery output, the target architecture design, the integration specifications, the data quality remediation plan, are programme deliverables with defined timelines, not background activities managed informally alongside the main programme.

What Good Looks Like

NHS Trusts that navigate EPR go-lives most successfully share a consistent set of characteristics in how they approach the surrounding data architecture.

  1. They conduct structured data discovery before migration planning begins, giving them a complete and honest picture of the legacy data estate.
  1. They invest in data quality remediation upstream of cutover, treating it as a programme workstream with defined ownership, timelines, and quality gates.
  1. They define the target three-tier architecture, EPR, local data repository, and FDP, before the EPR go-live, with integration standards and data flows specified and tested.
  1. They involve data and analytics leadership in EPR programme governance from the outset, not as reviewers of decisions already made.
  1. They resource PTL and waiting list validation as a dedicated pre-go-live activity, recognising that this is the highest-risk post-go-live operational failure mode.
  1. They design governance for the full data environment, covering data ownership, access controls, and retention policies across all three tiers before go-live.

Where VE3 Can Help

VE3 works with NHS Trusts to design and deliver the data architecture, governance frameworks, and discovery and migration programmes that make EPR go-lives stable and analytically productive from day one. Our engagement model integrates data architecture work into EPR programmes from the earliest design phase, ensuring that the surrounding infrastructure that makes EPR data valuable is built in parallel with the EPR itself, not deferred until after the problems have appeared.

If your Trust is in EPR procurement, pre-implementation, or post-go-live stabilisation and is looking to address gaps in its surrounding data architecture or data quality position, we would welcome the conversation.

Woman sitting on couch wearing a white cable-knit sweater and blue jeans, holding a phone with one hand.
  • © 2026 VE3. All rights reserved.
LinkedIn logo in white on a gray circular background.Facebook social media icon with white f on a gray circular background.Gray circle with white X symbol, indicating a close or cancel button.Gray play button icon within a rounded square with a subtle drop shadow on a white background.