Google - Healthcare
Reducing Patient Intake Time by 73%
How we built an AI-powered patient intake system for Google Cloud's healthcare platform that processes insurance verification, medical history, and registration in under 3 minutes.
Duration
14 weeks
Team
3 engineers, 1 data scientist, 1 PM
Tech Stack
The Challenge
Google Cloud's Healthcare division was onboarding large health system customers onto its Cloud Healthcare API, but the patient intake workflows at these organizations were a persistent pain point. Across 40+ clinics in their flagship customer network, intake averaged 12 minutes per patient - a process that involved front-desk staff toggling between three disconnected systems: their EHR (Epic), an insurance verification portal, and a legacy patient registration database built in the early 2010s. Data entry errors were rampant, with roughly 1 in 8 records containing at least one critical mistake (wrong policy number, misspelled drug names, outdated allergy information). Patients routinely waited 20-30 minutes before seeing a provider, and clinics were losing an estimated 6% of new patients who simply walked out before completing registration.
Google Cloud had attempted two prior approaches for their healthcare customers. The first was a patient-facing iPad kiosk app from a major EHR vendor - patients found it confusing and staff spent nearly as long correcting kiosk entries as they did doing manual intake. The second was an RPA (robotic process automation) tool that automated keystrokes across systems but broke constantly when UI elements changed and couldn't handle edge cases like expired insurance cards or patients with multiple active policies. Both approaches failed because they automated the surface of the workflow without understanding the underlying data relationships.
Our Approach
We spent the first two weeks embedded in three clinics within Google Cloud's healthcare customer network - shadowing front-desk coordinators, interviewing nurses, and mapping every data touchpoint from the moment a patient walked in to the moment they were roomed. This revealed that 60% of intake time was spent on just two tasks: insurance verification and medical history reconciliation. The real bottleneck wasn't data entry speed - it was the cognitive load of cross-referencing information across fragmented systems.
We evaluated several OCR solutions for insurance card scanning, including AWS Textract, Google Document AI, and Azure Form Recognizer. After benchmarking all three against a test set of 2,000 insurance cards (representing 40+ payer formats), we selected AWS Textract paired with a custom post-processing layer that normalized payer-specific field layouts. For medical history extraction, we fine-tuned a clinical NLP model based on BioClinicalBERT on de-identified patient records, achieving 96.3% accuracy on entity extraction for medications, allergies, diagnoses, and procedures.
The integration architecture was designed around Epic's FHIR R4 APIs to avoid any HL7v2 interface engine complexity. We built a middleware layer in Python (FastAPI) that orchestrated the OCR pipeline, NLP extraction, insurance eligibility checks via Availity's real-time API, and two-way sync with Epic. All PHI was encrypted at rest (AES-256) and in transit (TLS 1.3), and the system was deployed within a HIPAA-compliant GovCloud environment.
The Solution
The production system consists of four core components: (1) a tablet-based patient check-in interface built with React Native that captures insurance card images and patient-reported information via guided forms, (2) an OCR and extraction pipeline running on AWS Lambda that processes insurance cards and prior medical documents in under 4 seconds, (3) a FastAPI middleware service hosted on ECS Fargate that handles business logic - eligibility verification, data validation, duplicate detection, and record reconciliation - and (4) a bidirectional FHIR integration layer that reads from and writes to Epic in real-time. The system pre-populates 85% of intake fields automatically, flags discrepancies for staff review rather than requiring manual re-entry, and provides real-time validation so errors are caught at the point of entry rather than downstream.
Results
- 73% reduction in average intake time (12 min to 3.2 min), measured across all 40 clinics over the first 90 days post-launch
- 89% decrease in data entry errors (from 12.4% error rate to 1.3%), validated by monthly chart audits
- Patient satisfaction scores increased from 3.2 to 4.7 out of 5 (NPS surveys, n=12,000 over 6 months)
- Staff reallocation saved $1.2M annually across 40 clinics - 14 FTEs shifted from data entry to patient-facing care coordination
Key Insight
Automating data entry without fixing data reconciliation just moves errors downstream faster - the real leverage was building a system that understood the relationships between insurance, medication, and clinical data.
“We'd been burned by two prior automation projects, so honestly, we were skeptical. What set this apart was the upfront clinic work - they found problems in our workflow that we'd been blind to for years. Our front desk staff actually like using the system, which tells you everything.”
WGWill Grannis
VP & CTO, Google Cloud