Why Review Boundaries Matter More Than Model Choice

The legal AI market still obsesses over the wrong comparison.

People ask:

Claude or GPT?
hosted or local?
fine-tuned or not?
how many tokens?

Those questions are not meaningless. They are not the first question a law firm should ask.

The first question should be simpler:

Where is the review boundary?

If a legal AI system cannot enforce a real boundary between draft output and legal effect, model branding cannot carry the product.

The market keeps making the same evaluation mistake.

Buyers compare model brands because model brands are easy to compare. They are legible. They fit on a slide.

But model choice is not where most legal risk sits.

The real comparison is not model versus model

A better model can improve drafting quality, recall, reasoning, and fluency.

But legal risk does not begin and end with output quality.

Legal work is sensitive because it creates consequences:

a filing can misstate authority
a letter can waive leverage
a summary can omit a material fact
a demand can overstate a record
a citation can be presented as reliable when it has not been verified

The useful comparison is not just:

Model A versus Model B

The better comparison:

loose workflow versus bounded workflow

Do not stop at whether the model is strong. Ask whether the surrounding system treats its output as draft, reviewable, and bounded before anyone relies on it.

The problem sits in workflow design, not model selection.

Review boundary is a product decision

A real review boundary is not a disclaimer that says lawyers should check the output.

It has to be enforced system state.

That means the product can distinguish between:

generated draft
pending review
edited draft
approved output
rejected output

And it means the system can prevent certain things from happening until review occurs.

For legal AI, that should include at least:

sending externally effective communications
exporting final legal work product
filing or issuing operative output
marking generated material as final approved work

That enforcement makes review structural rather than aspirational.

ABA 512 and Heppner both point in the same direction

ABA Formal Opinion 512 was issued on July 29, 2024. It remains the clearest ABA statement of the duties lawyers still carry when generative AI is part of representation.

Those duties include:

competence
confidentiality
supervisory responsibilities
candor
reasonable fees

Read at the systems level, 512 does not say "pick the best model." It says lawyers remain responsible for what the system does and how they use it.

That implies a design requirement:

AI output should move through a reviewable workflow before it becomes externally effective legal work.

Source:

ABA Formal Opinion 512 (PDF)

United States v. Heppner, No. 25-cr-00503-JSR (S.D.N.Y.), points in the same direction from the litigation side.

Judge Rakoff ruled from the bench on February 10, 2026 and issued a written memorandum on February 17, 2026. The court held that the defendant's written exchanges with Anthropic's consumer version of Claude were protected by neither the attorney-client privilege nor the work product doctrine on the facts presented.

Heppner is not a general anti-AI ruling. It reminds buyers that weak workflow boundaries create legal consequences.

Generic review language is not enough. The system has to make review real.

Source:

United States v. Heppner memorandum (S.D.N.Y. Feb. 17, 2026)

Why review boundaries beat model choice

Imagine two legal AI systems.

System A uses the best available model, but the workflow is loose. Generated output can be copied, exported, or acted on with minimal structure. Review is expected, but not strongly enforced.

System B uses a slightly weaker model, but the workflow is strict. Output is task-bounded, provenance is visible, and the system keeps draft and final states separate until a lawyer acts.

For real legal practice, System B is often the better system.

Model quality still counts. Review boundaries determine whether that model quality is being used inside a controlled legal workflow at all.

Review boundary is also a supervision layer

Partners and supervising attorneys do not just need better drafts. They need to know:

what ran
what information was used
what changed during review
what became final
who approved it

That becomes much easier when review is a system boundary instead of a cultural expectation.

Without that structure, legal AI starts to behave like consumer software with a professional disclaimer attached.

A disclaimer is not enough.

What buyers should actually ask

Model branding is easy to compare. Workflow discipline is harder and more revealing.

A buyer should ask:

where draft becomes final
what requires human review before external effect
what records exist of generation, editing, and approval
what the system can block automatically
what a supervising attorney can actually see

Those answers tell you more about whether a legal AI system belongs in real practice than any benchmark chart.

The best legal AI systems will not be the ones with the most impressive demos. They will be the ones that make legal judgment visible, enforce review before legal effect, and keep draft output inside a real workflow boundary until a lawyer acts.

Review boundaries matter more than model choice for that reason.

Why Review Boundaries Matter More Than Model Choice

The real comparison is not model versus model

Review boundary is a product decision

ABA 512 and Heppner both point in the same direction

Why review boundaries beat model choice

Review boundary is also a supervision layer

What buyers should actually ask

Related posts.

The Risk Is the Scaffold

Sandboxing Is Not the Control Layer

Claude + CoCounsel Strengthens One Category. The Operating Layer Still Sits Underneath.

The infrastructure legal runs on.

Practice Areas

Platform

Resources

Company