STET

flux-pr-4807

Zod (TypeScript) · W2 · GPT-5.4

pass_with_warn

Tests passed. 2/2 commands passed. Strength: strong.

69.2% run pass rate
Tier 1
primary testspassednon equivalentdecision conflictfail
find . -name vitest.config.ts -exec sed -i 's/test: {/test: { testTimeout: 30000,/' {} +
gold passagent pass
pnpm test -- --maxWorkers 1 --maxConcurrency 1 --retry 2
gold passagent pass

Partial score: 2/2

Publishable: yesCache: miss

Trajectory

unknown · partial order only

Canonical trajectory missing; showing coarse derived order only.

patch written
Patch captured
#1

Stet captured agent.patch for this trial.

validation
Tests passed
#2
equivalence
Equivalence judgment
#3

non_equivalent

code review
Code review judgment
#4

fail

decision
Final decision
#5

pass_with_warn

Quality

equivalence
non_equivalent
93% confidence
code review
fail · 30/100
3 findings
footprint
low (0.25)
behavioral
100.0%
cost
$0.97 · 1.4M

Equivalence Reasoning

behavioral

The agent patch covers some intent (untracked-file hook checks, bench import switch, ecosystem entry, version bump, NEVER export wiring), but it misses key runtime/core changes from the task: no `core/api.ts` normalization updates, no `core/checks.ts`/`core/schemas.ts` consistency fixes, no `core/util.ts` `extend` validation tightening, and no `bench/package.json` refresh. It also changes docs anchors to `/api#refinements` and `/api#transforms` instead of the intended `/api#transform`, which may not resolve correctly.

Code Review

correctness: 1/4introduced bug risk: 1/4edge case handling: 1/4maintainability idioms: 2/4

The agent patch is only a partial match: it adds untracked-file guards and some exports/docs updates, but misses important requested changes and introduces likely broken docs anchors, so it likely does not fully satisfy the intended task.

3 findings
Pre-push hook dropped cleanup step
major

The updated `pre-push` script no longer runs `pnpm clean`, so build/test artifacts can persist after the hook, which diverges from the intended clean-workspace workflow.

.husky/pre-push:6
Docs links changed to likely-invalid API anchors
major

The async guidance links were changed to `/api#refinements` and `/api#transforms`; the intended target was `/api#transform` while keeping local refine references. This likely introduces broken navigation in docs.

packages/docs/content/basics.mdx:66
Bench dependency refresh is incomplete
major

The benchmark code now imports `zod4/v4`, but the companion dependency pin update for `zod4` is missing, leaving resolution/version behavior inconsistent with the intended maintained-entry-point migration.

packages/bench/object-moltar.ts:1