← All posts

Why Review Boundaries Matter More Than Model Choice

April 2, 2026·5 min read·AI & Technology

Reading mode

The legal AI market still obsesses over the wrong comparison.

People ask:

  • Claude or GPT?
  • hosted or local?
  • fine-tuned or not?
  • how many tokens?

Those questions are not meaningless. They are not the first question a law firm should ask.

The first question should be simpler:

Where is the review boundary?

If a legal AI system cannot enforce a real boundary between draft output and legal effect, model branding cannot carry the product.

The market keeps making the same evaluation mistake.

Buyers compare model brands because model brands are easy to compare. They are legible. They fit on a slide.

But model choice is not where most legal risk sits.

The real comparison is not model versus model

A better model can improve drafting quality, recall, reasoning, and fluency.

But legal risk does not begin and end with output quality.

Legal work is sensitive because it creates consequences:

  • a filing can misstate authority
  • a letter can waive leverage
  • a summary can omit a material fact
  • a demand can overstate a record
  • a citation can be presented as reliable when it has not been verified

The useful comparison is not just:

  • Model A versus Model B

The better comparison:

  • loose workflow versus bounded workflow

Do not stop at whether the model is strong. Ask whether the surrounding system treats its output as draft, reviewable, and bounded before anyone relies on it.

The problem sits in workflow design, not model selection.

Review boundary is a product decision

A real review boundary is not a disclaimer that says lawyers should check the output.

It has to be enforced system state.

That means the product can distinguish between:

  • generated draft
  • pending review
  • edited draft
  • approved output
  • rejected output

And it means the system can prevent certain things from happening until review occurs.

For legal AI, that should include at least:

  • sending externally effective communications
  • exporting final legal work product
  • filing or issuing operative output
  • marking generated material as final approved work

That enforcement makes review structural rather than aspirational.

ABA 512 and Heppner both point in the same direction

ABA Formal Opinion 512 was issued on July 29, 2024. It remains the clearest ABA statement of the duties lawyers still carry when generative AI is part of representation.

Those duties include:

  • competence
  • confidentiality
  • supervisory responsibilities
  • candor
  • reasonable fees

Read at the systems level, 512 does not say "pick the best model." It says lawyers remain responsible for what the system does and how they use it.

That implies a design requirement:

AI output should move through a reviewable workflow before it becomes externally effective legal work.

Source:

United States v. Heppner, No. 25-cr-00503-JSR (S.D.N.Y.), points in the same direction from the litigation side.

Judge Rakoff ruled from the bench on February 10, 2026 and issued a written memorandum on February 17, 2026. The court held that the defendant's written exchanges with Anthropic's consumer version of Claude were protected by neither the attorney-client privilege nor the work product doctrine on the facts presented.

Heppner is not a general anti-AI ruling. It reminds buyers that weak workflow boundaries create legal consequences.

Generic review language is not enough. The system has to make review real.

Source:

Why review boundaries beat model choice

Imagine two legal AI systems.

System A uses the best available model, but the workflow is loose. Generated output can be copied, exported, or acted on with minimal structure. Review is expected, but not strongly enforced.

System B uses a slightly weaker model, but the workflow is strict. Output is task-bounded, provenance is visible, and the system keeps draft and final states separate until a lawyer acts.

For real legal practice, System B is often the better system.

Model quality still counts. Review boundaries determine whether that model quality is being used inside a controlled legal workflow at all.

Review boundary is also a supervision layer

Partners and supervising attorneys do not just need better drafts. They need to know:

  • what ran
  • what information was used
  • what changed during review
  • what became final
  • who approved it

That becomes much easier when review is a system boundary instead of a cultural expectation.

Without that structure, legal AI starts to behave like consumer software with a professional disclaimer attached.

A disclaimer is not enough.

What buyers should actually ask

Model branding is easy to compare. Workflow discipline is harder and more revealing.

A buyer should ask:

  • where draft becomes final
  • what requires human review before external effect
  • what records exist of generation, editing, and approval
  • what the system can block automatically
  • what a supervising attorney can actually see

Those answers tell you more about whether a legal AI system belongs in real practice than any benchmark chart.

The best legal AI systems will not be the ones with the most impressive demos. They will be the ones that make legal judgment visible, enforce review before legal effect, and keep draft output inside a real workflow boundary until a lawyer acts.

Review boundaries matter more than model choice for that reason.

The infrastructure legal runs on.

Guided by attorney judgment.