CLI reference¶
Every command supports --help. Most accept --json (full report) or --handoff (distilled signal); add --quiet for machine mode — only JSON on stdout, logs on stderr, with stable exit codes:
| Exit code | Meaning |
|---|---|
0 |
pass or warn |
2 |
fail |
3 |
error (machine mode) |
agentvision demo¶
Run the 60-second demo: broken page -> FAIL -> loop to fixed -> PASS (no API key).
agentvision analyze¶
Render and analyze an artifact with a vision backend (+ DOM/CV grounding).
Arguments: SOURCE
| Option | Description | Default |
|---|---|---|
--backend |
anthropic|openai|gemini|local | |
--instructions |
Task context for the vision model. | |
--expected |
What the artifact was supposed to look like. | |
--brief |
The intended product — graded for intent match. | |
--expect |
A required visual claim (repeatable; prefix 'should:'/'nice:'). | |
--reference |
Reference/mockup image the render should match. | |
--source-type |
auto|html|file|url|svg|pdf|image | auto |
--viewport |
WxH, e.g. 1280x800 | |
--full-page |
||
--wait-for |
CSS selector to wait for before capture (for client-rendered data). | |
--settle-ms |
Quiet wait (ms) after load so client-rendered data can populate. | |
--freeze |
Pause animations + rAF before capture (default on; needed for canvas/WebGL). | |
--nav-wait |
load|domcontentloaded|networkidle (default load; networkidle is bounded). | |
--render-timeout |
Max render seconds. | |
--allow-local |
Allow localhost / LAN URLs. | |
--no-ocr |
Disable OCR grounding. | |
--json |
Emit JSON. | |
--handoff |
Emit the eyes→brain handoff signal (JSON) for an agent/brain to act on. | |
--quiet |
Machine mode: only JSON on stdout, logs on stderr, stable exit codes (0 pass/warn, 2 fail, 3 error). |
agentvision conform¶
Grade an artifact against intent — does it match what you set out to build?
Arguments: SOURCE
| Option | Description | Default |
|---|---|---|
--brief |
Free-text description of the intended product. | |
--expect |
A required visual claim (repeatable; prefix 'should:'/'nice:'). | |
--reference |
Reference/mockup image the render should match. | |
--backend |
anthropic|openai|gemini|ollama|local | |
--source-type |
auto |
|
--viewport |
WxH | |
--full-page |
||
--wait-for |
CSS selector to wait for first. | |
--settle-ms |
Quiet wait (ms) after load. | |
--freeze |
Pause animations + rAF. | |
--nav-wait |
load|domcontentloaded|networkidle. | |
--render-timeout |
Max render seconds. | |
--allow-local |
Allow localhost / LAN URLs. | |
--json |
||
--handoff |
Emit the eyes→brain handoff signal (JSON) for an agent/brain to act on. | |
--quiet |
Machine mode: only JSON on stdout. |
agentvision check¶
Classic DOM/CV checks only — no LLM, no API key, no egress.
Arguments: SOURCE
| Option | Description | Default |
|---|---|---|
--source-type |
auto |
|
--viewport |
WxH | |
--full-page |
on | |
--wait-for |
CSS selector to wait for first. | |
--settle-ms |
Quiet wait (ms) after load. | |
--freeze |
Pause animations + rAF. | |
--nav-wait |
load|domcontentloaded|networkidle. | |
--render-timeout |
Max render seconds. | |
--allow-local |
Allow localhost / LAN URLs. | |
--json |
||
--handoff |
Emit the eyes→brain handoff signal (JSON) for an agent/brain to act on. | |
--quiet |
Machine mode: only JSON on stdout. |
agentvision watch¶
Watch an artifact over time — verify playback / loading / liveness, not just a glance.
Arguments: SOURCE
| Option | Description | Default |
|---|---|---|
--backend |
Vision backend for the time-aware pass. | |
--frames |
How many frames to sample. | |
--interval-ms |
Delay between frames (ms). | |
--brief |
Intended behavior (e.g. 'the video plays'). | |
--expect |
A required behavior (repeatable). | |
--no-vision |
Deterministic signals only. | |
--allow-local |
Allow localhost / LAN URLs. | |
--nav-wait |
load|domcontentloaded|networkidle. | |
--render-timeout |
Max seconds. | |
--json |
||
--handoff |
Emit the eyes→brain handoff signal. | |
--quiet |
Machine mode: only JSON on stdout. |
agentvision loop¶
Run the visual feedback loop (re-renders the source up to --max-iter times).
Arguments: SOURCE
| Option | Description | Default |
|---|---|---|
--backend |
||
--max-iter |
3 |
|
--instructions |
||
--brief |
The intended product — graded for intent match. | |
--expect |
A required visual claim (repeatable; prefix 'should:'/'nice:'). | |
--reference |
Reference/mockup image the render should match. | |
--nav-wait |
load|domcontentloaded|networkidle. | |
--settle-ms |
Quiet wait (ms) after load. | |
--freeze |
Pause animations + rAF. | |
--render-timeout |
Max render seconds. | |
--allow-local |
Allow localhost / LAN URLs. | |
--json |
agentvision generate¶
Generative loop: generate → see → grade vs intent → refine prompt → regenerate.
| Option | Description | Default |
|---|---|---|
--generator |
Generator hook as 'module:function' — a callable (prompt:str)->image_path. | |
--brief |
Free-text description of what to generate. | |
--expect |
A required visual claim (repeatable; prefix 'should:'/'nice:'). | |
--reference |
Reference/mockup image the output should match. | |
--backend |
Vision backend used to perceive + refine. | |
--max-iter |
4 |
|
-o, --out |
agentvision-generated.png |
|
--json |
agentvision render¶
Render an artifact to a PNG.
Arguments: SOURCE
| Option | Description | Default |
|---|---|---|
-o, --out |
agentvision-render.png |
|
--source-type |
auto |
|
--viewport |
WxH | |
--full-page |
on | |
--wait-for |
CSS selector to wait for first. | |
--settle-ms |
Quiet wait (ms) after load. | |
--freeze |
Pause animations + rAF. | |
--nav-wait |
load|domcontentloaded|networkidle. | |
--render-timeout |
Max render seconds. | |
--allow-local |
Allow localhost / LAN URLs. |
agentvision diff¶
Compare two images (SSIM + annotated diff).
Arguments: BASELINE, CANDIDATE
| Option | Description | Default |
|---|---|---|
-o, --out |
agentvision-diff.png |
|
--threshold |
Min SSIM to pass. | 0.98 |
--json |
agentvision ocr¶
Extract text (+ word boxes) from an artifact via Tesseract.
Arguments: SOURCE
| Option | Description | Default |
|---|---|---|
--source-type |
auto |
|
--json |
agentvision sheet¶
Render a responsive contact sheet across breakpoints.
Arguments: SOURCE
| Option | Description | Default |
|---|---|---|
--breakpoints |
Comma-separated widths. | 375,768,1280,1920 |
-o, --out |
agentvision-sheet.png |
agentvision baseline¶
Capture and store a named baseline for regression.
Arguments: SOURCE
| Option | Description | Default |
|---|---|---|
--name |
Baseline name. | |
--source-type |
auto |
agentvision regress¶
Render a source and compare it to a named baseline.
Arguments: SOURCE
| Option | Description | Default |
|---|---|---|
--name |
||
-o, --out |
agentvision-regress.png |
|
--threshold |
0.98 |
|
--json |
agentvision doctor¶
Diagnose rendering + backend readiness.
| Option | Description | Default |
|---|---|---|
--fix |
Install the Chromium browser. |
agentvision serve¶
Start the REST service.
| Option | Description | Default |
|---|---|---|
--host |
127.0.0.1 |
|
--port |
8000 |
agentvision version¶
Print the AgentVision version.