← All posts

What ABA 512 and Heppner Together Require From Legal AI Systems

April 2, 2026·6 min read·Legal Tech

Reading mode

ABA Formal Opinion 512 and United States v. Heppner should be read together.

Opinion 512, issued on July 29, 2024, is the clearest ABA statement of the professional duties lawyers still carry when generative AI is part of the work. Heppner, decided in the Southern District of New York on February 17, 2026 after a bench ruling on February 10, 2026, shows what happens when those duties meet a public AI workflow with weak legal boundaries.

One gives the framework. The other gives the warning.

Taken together, they do something more useful than either authority does alone. They turn a fuzzy conversation about "responsible AI" into a concrete standard for legal workflows. The buying frame shifts from whether a system feels helpful to whether it creates a professional environment a lawyer can actually defend.

What ABA 512 does

Formal Opinion 512 does not ban AI. It sets the baseline duties lawyers remain responsible for when AI is used in representation.

Those duties include:

  • competence
  • confidentiality
  • communication with clients when appropriate
  • candor toward tribunals
  • supervisory responsibilities
  • reasonable fees and expenses

Opinion 512 is not really about one failure mode. The opinion reaches beyond hallucinations, fake citations, or "be careful with ChatGPT." The core operational test is whether the surrounding system makes competent AI use possible for a lawyer at all.

The novelty has faded. The system-design question underneath it has not.

The official opinion is here:

What Heppner does

Heppner is not a general anti-AI decision. The decision addresses privilege and work-product claims in a specific kind of workflow.

Judge Rakoff held that written exchanges between the defendant and Anthropic's consumer version of Claude were protected by neither the attorney-client privilege nor the work product doctrine on the facts before the court.

The court focused on a few points:

  • the exchanges were not communications between attorney and client
  • they were not confidential in light of the third-party platform and its terms
  • the defendant was not acting at counsel's direction in using the tool
  • the resulting materials did not fit the claimed work-product theory

Those details are the case.

The opinion is here:

What neither one means on its own

The lazy read of 512 is: be careful with AI.

The lazy read of Heppner is: AI destroys privilege.

Both are too shallow.

512 is better read as a systems-design document. Heppner is better read as a fact-specific warning about what happens when sensitive legal work flows through a public consumer tool without controlled boundaries.

Read together, the message is clear

512 says lawyers remain responsible.

Heppner shows that a public AI workflow does not somehow relieve them of that responsibility. If anything, it makes the need for boundaries more obvious.

Read together, 512 and Heppner shift attention away from demo quality and back toward workflow quality.

The concrete questions are:

  • what information reaches the model
  • what remains inside controlled application-layer storage
  • what review boundary exists before legal effect
  • what records exist of generation, editing, approval, and use
  • what kind of workflow the system is actually enforcing

Those are the conditions under which legal AI becomes usable in practice rather than merely interesting in a demo.

Four practical requirements that follow

1. Review has to be a real state, not an expectation

Opinion 512 makes clear that lawyers cannot blindly rely on AI output. Heppner shows why weak workflow assumptions are dangerous. Draft and final cannot be treated as the same thing.

A legal AI system should make clear:

  • what is draft
  • what is pending review
  • what was edited
  • what was approved
  • what has external effect

If the product treats review as a soft suggestion, it is built on the wrong assumption.

2. Confidentiality is a system-boundary question

Heppner is a reminder that confidentiality does not survive by wishful thinking. It depends on the actual workflow and the actual third-party relationship.

That pushes legal AI design toward:

  • bounded retrieval
  • scoped data access
  • deliberate prompt assembly
  • controlled storage boundaries
  • visible provider roles and data paths

The practical evaluation is not just whether a model is hosted. Buyers need to know what reaches that model and what remains outside it.

3. Provenance requires visibility

Opinion 512 keeps supervisory responsibility with the lawyer. That duty is hard to satisfy if the system hides its own process.

A legal AI workflow should make provenance visible:

  • what task ran
  • what source material was used
  • what output was produced
  • what changed during review
  • what became operative

Without that, supervision becomes performative instead of operational.

4. Consumer AI and legal infrastructure are not the same category

Heppner is a good example of what happens when people blur those categories. A consumer chat interface is not the same thing as a legal workflow.

A consumer chat tool can be useful. By itself, it is not legal infrastructure.

Legal infrastructure requires more:

  • role boundaries
  • review states
  • bounded retrieval
  • confidentiality-aware workflows
  • auditable output paths

Legal AI products will increasingly stand or fall on that distinction.

The standard buyers should use

Managing partners, legal ops teams, and in-house counsel should use a stricter standard than headline productivity claims.

The useful test is whether the system makes legal duties easier or harder to satisfy in the actual workflow.

512 and Heppner point toward the same conclusion:

  • a useful legal AI system must be reviewable
  • a safe legal AI system must be bounded
  • a professional legal AI system must be auditable

The framework is better than "Which model does it use?"

Why this should change how products get evaluated

The legal AI market still spends too much time on draft quality, context window size, and model branding. 512 and Heppner point somewhere more useful.

The real questions are operational:

  • what happens before the draft is created
  • what information reaches the system
  • what review boundary exists before legal effect
  • what a supervising attorney can actually see
  • what records exist of generation, editing, approval, and use

Those are system-design questions. They determine whether the product fits legal practice or merely imitates it.

Further reading

The infrastructure legal runs on.

Guided by attorney judgment.