Skip to main content

Colter Test

Run real AI shopping journeys against your store. 10 personas, 3 AI model families, actionable results.

TL;DR: Run colter test https://your-store.com to send hosted AI shopping personas through your store and score the experience. Test runs on Colter's infrastructure and requires COLTER_API_KEY or a saved colter auth login session. Add --json for structured output, --pdf for a report, or --fix to generate follow-up fixes automatically.

colter.test returns structured JSON built for agent workflows, regression checks, and CI jobs.

What Test Does

colter test runs real shopping journeys across multiple model families and scores the outcomes. It answers a different question than Check: not just "is the protocol there?" but "does the agent succeed when it tries to use it?"

Personas

Ten personas run by default, with the_comparer available as an opt-in persona. Each persona makes live storefront or protocol requests, and the browser verification persona adds screenshots when the required tool surface is present.

Persona groupFocus
Platform shoppersProtocol flows, browser flow, mobile flow
Intent shoppersSecurity, pricing clarity, data quality, returns, edge cases

Scenarios

Typical scenarios include:

  • discovery
  • product info
  • policy comprehension
  • checkout readiness
  • competitive comparison
  • recommendation
  • edge cases

Requirements

  • COLTER_API_KEY=col_live_..., colter auth login, or --api-key
  • Pro, Agency, or Enterprise plan

CLI

colter test <url> [flags]

Common Flags

FlagPurpose
--models LISTChoose claude, gpt, gemini
--api-key KEYOverride COLTER_API_KEY for this run
--personas LISTFilter personas
--scenarios LISTFilter scenarios
--jsonStructured output
--parallel NConcurrent persona runs
--timeout DURATIONClient wait timeout for hosted results
--budget AMOUNTMax spend in USD
--threshold NExit non-zero below this score
--fixGenerate fix plans for weak dimensions
--fix-threshold NCutoff used with --fix
--applyApply fixes after test when --fix is set
--dry-runGenerate fix content without writing
--pdfCreate a PDF report
--pdf-out PATHSet the PDF path
--browserInclude browser_shopper when you already filter personas or scenarios
--api-url URLOverride API base URL

Examples

colter test https://store.example.com
colter test https://store.example.com --models claude,gemini --json
colter test https://store.example.com --threshold 70 --json
colter test https://store.example.com --fix --fix-threshold 75
colter test https://store.example.com --pdf --pdf-out report.pdf

Output Highlights

The JSON payload includes:

  • overall score
  • per-persona results
  • per-scenario results
  • per-model scores
  • recommendations
  • token and cost totals

CI

colter test exits with code 1 when the final score is below --threshold.

colter test https://mystore.com --threshold 70 --json
  1. Run Check.
  2. Run Test to see interaction failures.
  3. Run Fix on the weak areas.
  4. Re-run Test.
  5. Use Lens for live traffic after launch.

Pricing

Test is:

  • available as a paid add-on on Pro
  • included on Agency
  • included on Enterprise

Next Steps