Runs

Execute QA runs and interpret results.

Run Types

Canary supports three types of runs, each designed for different testing scenarios:

Run TypePurposePrerequisites
Test RunsExecute all published workflows as regression testsAt least one published workflow
Chaos RunsAutonomous AI agents explore and test your appA property configured
Workflow RunsExecute a single workflow (documented below)A workflow to run

Workflow Runs

A workflow run is a single execution of one workflow, capturing pass/fail outcomes, screenshots, and video recordings.

Starting a run

  1. Open a workflow in the Flow Designer
  2. Click Run Flow in the header
  3. The sidebar switches to the Runs tab showing live execution

Runs can also be triggered as part of a smoke suite or via the API.

Run statuses

StatusMeaning
QueuedWaiting to start
RunningExecution in progress
WaitingPaused at a Wait node—will resume automatically
SuccessAll steps passed
FailedOne or more steps failed
CanceledManually stopped

Live execution

While a run is in progress, you'll see:

  • Browser stream: Live view of the current browser state
  • Agent reasoning: What the AI is doing at each moment
  • Canvas status: Running nodes pulse; completed nodes show green (success) or red (failure)

Run details

Click any run to see:

  • Video: Full session recording with agent thoughts overlaid on the timeline
  • Screenshots: Per-step browser captures
  • Steps: Each node's status, duration, and any error messages

Investigating failures

When a run fails:

  1. A failure banner shows which step failed and the error message
  2. Click View Details to open the failure drawer with:
    • Failure screenshot (browser state at the moment of failure)
    • Additional context for debugging
  3. The failed node is highlighted on the canvas

Wait nodes

Workflows with Wait nodes pause mid-execution:

  1. Run executes until it hits the Wait node
  2. Status shows Waiting with a countdown timer
  3. Execution resumes automatically after the delay
  4. The next Login node re-establishes the browser session

Artifacts

Each run stores:

ArtifactDescription
ScreenshotsBrowser capture at each step
VideoFull session recording