Skip to main content

Colter Test

Simulate real AI shopping agents interacting with your store. 10 personas, 3 AI models, actionable results.

TL;DR: Run colter test https://your-store.com to send AI shopping personas through your store and score the experience. Add --json for structured output, --pdf for a report, or --fix to generate follow-up fixes automatically.

colter.test returns structured JSON built for agent workflows, regression checks, and CI jobs.

What Test Does

colter test simulates real shopping prompts across multiple model families and scores the outcomes. It answers a different question than Check: not just "is the protocol there?" but "does the agent succeed?"

Personas

Ten personas run by default, with the_comparer available as an opt-in persona.

Persona groupFocus
Platform shoppersProtocol flows, browser flow, mobile flow
Intent shoppersSecurity, pricing clarity, data quality, returns, edge cases

Scenarios

Typical scenarios include:

  • discovery
  • product info
  • policy comprehension
  • checkout readiness
  • competitive comparison
  • recommendation
  • edge cases

Requirements

  • colter auth login
  • Pro plan with Test add-on, or Agency / Enterprise plan

CLI

colter test <url> [flags]

Common Flags

FlagPurpose
--models LISTChoose claude, gpt, gemini
--personas LISTFilter personas
--scenarios LISTFilter scenarios
--jsonStructured output
--parallel NConcurrent persona runs
--timeout DURATIONPer-test timeout
--budget AMOUNTMax spend in USD
--threshold NExit non-zero below this score
--skip-checkSkip the pre-check phase
--check-json FILEReuse a saved check result
--fixGenerate fix plans for weak dimensions
--fix-threshold NCutoff used with --fix
--applyApply fixes after test when --fix is set
--dry-runGenerate fix content without writing
--pdfCreate a PDF report
--pdf-out PATHSet the PDF path
--uploadUpload results
--headedShow the browser window
--browserRun the browser-based WebMCP flow
--api-url URLOverride API base URL

Examples

colter test https://store.example.com
colter test https://store.example.com --models claude,gemini --json
colter test https://store.example.com --threshold 70 --json
colter test https://store.example.com --fix --fix-threshold 75
colter test https://store.example.com --pdf --pdf-out report.pdf

Output Highlights

The JSON payload includes:

  • overall score
  • per-persona results
  • per-scenario results
  • per-model scores
  • recommendations
  • token and cost totals

CI

colter test exits with code 1 when the final score is below --threshold.

colter test https://mystore.com --threshold 70 --json
  1. Run Check.
  2. Run Test to see interaction failures.
  3. Run Fix on the weak areas.
  4. Re-run Test.
  5. Use Lens for live traffic after launch.

Pricing

Test is:

  • available as a paid add-on on Pro
  • included on Agency
  • included on Enterprise

Next Steps