HIPAA and AI Agents: What the 2024-2025 Guidance Actually Says

A Hospital Deploys an AI Agent. Is It Covered by HIPAA?

Picture this: a mid-size hospital rolls out an AI agent that schedules patient follow-ups, pulls records from the EHR, and sends appointment reminders via SMS. It touches protected health information at every single step. Is this system covered by HIPAA? The short answer is yes. The longer answer is that HHS has not published specific guidance on AI agents - so we're working from existing rules, a few recent signals, and a lot of inference. In this post, we break down what the 2024-2025 guidance actually says, where the 3 biggest compliance gaps are, and what a defensible agent architecture looks like.

What HHS Has (and Hasn't) Said About AI Agents

As of early 2025, HHS Office for Civil Rights has not issued a single document that uses the phrase "AI agent" or "agentic AI." The guidance we do have comes from 3 sources, and none of them directly address autonomous, multi-step AI systems.

The December 2024 HIPAA Security Rule NPRM

The proposed rule update, published December 27, 2024, is the closest thing to AI-relevant HIPAA modernization. It calls for mandatory encryption of ePHI at rest and in transit with zero exceptions, vulnerability scanning every 6 months, and network segmentation requirements. It mentions "technology assets" broadly but does not distinguish between a static database and an AI agent making 40 API calls per patient interaction. The implication is clear: if your system touches ePHI, every provision applies.

The 2023 HHS AI Strategy Paper

Released in December 2023, this document outlines HHS priorities for responsible AI use across its agencies. It references risk management frameworks and calls for transparency, but it is a strategy document, not a regulation. It creates no enforceable obligations for covered entities or business associates.

OCR's Standing Position

OCR has consistently maintained that HIPAA applies to ePHI regardless of the technology processing it. Whether it's a fax machine, a cloud database, or an autonomous agent queuing lab orders - the same rules apply. This technology-neutral stance means AI agents are already covered. The problem is that the existing rules were written for systems that don't make independent decisions, and the gap between the rules and how agents actually behave is where compliance risk lives.

The Three HIPAA Exposure Points in Agentic AI

AI agents don't just process data the way a traditional application does. They reason, plan, and take actions across multiple systems. That behavior pattern creates 3 specific HIPAA exposure points that traditional software doesn't trigger.

1. Data Access Scope and the Minimum Necessary Standard

HIPAA's minimum necessary standard (45 CFR 164.502(b)) requires that access to PHI be limited to what's needed for a specific task. Most AI agents are designed to request broad context to perform well. An agent scheduling a follow-up appointment might pull the full patient chart - diagnosis history, medications, lab results - when all it needs is the patient's name, provider, and preferred time slot. In a 2024 OCR enforcement action, a covered entity was fined $1.19 million for failing to implement minimum necessary access controls on a much simpler system. An agent with over-scoped access is a penalty waiting to happen.

2. Audit Trail Gaps

HIPAA requires audit controls under 45 CFR 164.312(b). For a traditional app, that means logging who accessed what and when. For an AI agent that executes 15 chained actions in a single workflow - reading a chart, checking formulary data, sending a referral, updating billing codes - the audit trail needs to capture each discrete action with its own timestamp, data scope, and justification. Most agent frameworks log at the session level, not the action level. That gap turns a compliance checkbox into an actual vulnerability.

3. Third-Party Tool Calls

This is the exposure point most organizations miss entirely. When an AI agent calls an external API - a drug interaction checker, a scheduling service, a natural language processing endpoint - and passes PHI in the request, that API provider becomes a business associate under HIPAA. That means a BAA is required before the first request is sent. An agent that dynamically selects tools at runtime could trigger 5 or more new BA relationships in a single patient interaction, each requiring a signed agreement. Without controls, agents create compliance obligations faster than legal teams can draft agreements.

What a Compliant AI Agent Architecture Looks Like

The compliance challenges above aren't theoretical. They're engineering problems with engineering solutions. Here's what a defensible AI agent deployment looks like across the 4 critical dimensions.

Self-hosted deployment eliminates the third-party tool call problem at the infrastructure level. When the LLM, the vector store, and every tool the agent calls all run inside your network perimeter, PHI never leaves your control. There are zero external BA relationships created by the AI layer itself.

Task-scoped access controls enforce minimum necessary at the agent level. Instead of giving an agent read access to the full EHR, each task gets a scoped token that expires after the workflow completes. A scheduling agent sees scheduling-relevant fields. A billing agent sees billing codes. Neither sees the other's data.

Immutable per-action logs solve the audit trail problem. Every discrete action the agent takes - every data read, every API call, every decision point - gets its own log entry with a timestamp, the specific data accessed, and the task context. These logs are append-only and tamper-evident, meeting the 2024 NPRM's proposed integrity requirements.

Human-in-the-loop checkpoints add a control layer for high-risk actions. Before an agent sends a message containing PHI, modifies a treatment plan, or shares data with another system, a clinician reviews and approves the action. This catches both compliance violations and clinical errors.

Dimension	Cloud-Hosted Agent	Self-Hosted Agent
Third-party BAA exposure	BAA required for every external tool and the LLM provider - potentially 5+ agreements per workflow	Zero external BAAs for the AI layer; PHI stays on-premise
Access scope control	Depends on vendor's permission model; often coarse-grained	Full control over per-task token scoping and data isolation
Audit log granularity	Session-level logs from vendor; limited visibility into action chains	Per-action immutable logs with full data lineage
Human-in-the-loop	Limited to what the vendor API supports; approval flows vary	Custom checkpoint rules at any point in the agent workflow

Frequently Asked Questions

Does HIPAA require a BAA for every tool an AI agent calls?

If the tool receives, processes, or has access to PHI on behalf of a covered entity, yes. Under 45 CFR 164.502(e), any entity that creates, receives, maintains, or transmits PHI for a covered function is a business associate. When an AI agent passes patient data to an external API - even a simple lookup service - that API provider meets the definition. The practical challenge is that agents can call tools dynamically, so organizations need a pre-approved tool registry with BAAs already in place before the agent can route requests to them. Self-hosted deployments sidestep this entirely by keeping all tool calls internal.

Can we use OpenAI Assistants API for healthcare workflows?

OpenAI offers a BAA for Enterprise customers, which covers ChatGPT Enterprise and the API. However, signing a BAA does not make a deployment compliant on its own. You still need to ensure that data sent to the API meets minimum necessary requirements, that responses containing PHI are stored and transmitted according to the Security Rule, and that audit logs capture the full interaction chain. The December 2024 NPRM's proposed encryption and segmentation requirements would add further obligations. For workflows involving high-volume PHI or autonomous agent actions, the operational overhead of maintaining compliance with a cloud-hosted model often exceeds the cost of a self-hosted deployment.

What should our AI agent audit logs include to satisfy HIPAA?

At minimum, each log entry should include: (1) a timestamp, (2) a unique action identifier, (3) the user or system that triggered the action, (4) the specific data elements accessed or modified, (5) the purpose or task context, and (6) the outcome. For agent workflows, we recommend also logging decision points where the agent chose between actions, any tools or APIs called with request/response metadata (excluding PHI in the log itself), and human approval events. The 2024 NPRM proposes requiring logs to be immutable and reviewed at least every 12 months. Building these 8 fields into your agent framework from day one is significantly easier than retrofitting them after an OCR inquiry.

The Bottom Line

HHS hasn't published agent-specific HIPAA guidance, and we may not see any before 2026. That's not a reason to wait. The existing rules already apply to AI agents - they just weren't written with agents in mind, which means the compliance burden falls on you to close the gaps. Treat every AI agent as a business associate. Scope data access per task, not per agent. Log every individual action, not just sessions. And if you want to eliminate the single largest compliance risk vector - third-party PHI exposure - self-host the entire stack. The organizations that build this way now won't be scrambling when the enforcement catches up.