Screenshots

Claudezilla screenshots use dynamic readiness detection to capture pages at the right moment, with quality presets to balance file size against detail.

Quality Presets

Use the purpose parameter to select a preset, or override with explicit quality and scale values.

Preset	Quality	Scale	Use Case
`quick-glance`	30	0.25	Layout checks, navigation confirmation
`read-text`	60	0.50	Reading content (default)
`inspect-ui`	80	0.75	UI details, small text
`full-detail`	95	1.00	Pixel-perfect inspection

// Quick layout check — small and fast
firefox_screenshot({ purpose: "quick-glance" })

// Default — good for most tasks
firefox_screenshot({ purpose: "read-text" })

// Need to read 12px font sizes
firefox_screenshot({ purpose: "inspect-ui" })

// Override preset with custom values
firefox_screenshot({ quality: 85, scale: 0.6 })

The purpose is a suggestion — explicit quality or scale values always take priority.

Annotation Badges

Set annotate: true to overlay numbered badges on interactive elements before capture. The response includes a labels map linking badge numbers to selectors.

const result = firefox_screenshot({ annotate: true })

// Response includes:
// {
//   dataUrl: "data:image/jpeg;base64,...",
//   labels: {
//     "1": { selector: "#login-btn", text: "Log In", role: "button" },
//     "2": { selector: "a.nav-home", text: "Home", role: "link" },
//     "3": { selector: "#email", text: "", role: "input" }
//   }
// }

This is useful for vision-model workflows: take an annotated screenshot, identify elements by badge number, then use the corresponding selector to interact.

// 1. Capture with annotations
const shot = firefox_screenshot({ annotate: true, purpose: "inspect-ui" })

// 2. Identify badge #3 is the email input
// 3. Type into it using the selector from labels
firefox_type({ selector: "#email", text: "user@example.com" })

Readiness Detection

By default, screenshots wait for the page to settle before capturing. The detection pipeline:

Network idle — waits for pending XHR/fetch/script requests to complete
Visual idle — optionally waits for images and fonts (up to 3s)
Render settlement — double requestAnimationFrame + requestIdleCallback

The response includes timing data showing what happened:

{
  readiness: {
    waitMs: 347,
    timedOut: false,
    timeline: [
      { t: 0, event: "start" },
      { t: 45, event: "critical_idle" },
      { t: 312, event: "visual_idle" },
      { t: 347, event: "render_settled" }
    ]
  }
}

If the page is already idle, the fast path captures in under 50ms.

Controlling Readiness

Parameter	Default	Description
`maxWait`	10000	Maximum ms to wait before capturing anyway
`waitForImages`	true	Wait for images/fonts to load
`skipReadiness`	false	Skip all detection (instant capture)

// Text-heavy page — skip image waiting
firefox_screenshot({ waitForImages: false })

// Page is already loaded — instant capture
firefox_screenshot({ skipReadiness: true })

// Give a slow SPA more time
firefox_screenshot({ maxWait: 20000 })

When Readiness Times Out

If maxWait is reached, the screenshot is still taken — readiness.timedOut will be true. This prevents hanging on pages with perpetual network activity (analytics pings, websockets).

Mutex Serialization

All screenshot requests are serialized through a mutex to prevent tab-switching collisions when multiple agents capture simultaneously. If another agent holds the mutex for more than 3 seconds, you receive a MUTEX_BUSY error:

MUTEX_BUSY: Screenshot mutex held by another agent.
  Holder: agent_ec2e...
  Held for: 6234ms
  Hint: Use getPageState (no mutex) or retry after delay.

Use firefox_get_page_state as a mutex-free alternative when you only need structured data, not a visual capture.