Agent Testing Area

Complex web application scenarios for testing AI agent reasoning and navigation capabilities.

🛒 TechMart Dashboard

E-commerce admin dashboard with product management, orders, customers, and settings. Tests navigation, form filling, filtering, and multi-page workflows.

Navigation Forms Tables Filters
Open Challenge →

🎯 Tool Obstacle Course

Comprehensive test page covering all CDP tool operations: clicks, inputs, dropdowns, checkboxes, iframes, modals, tabs, and dynamic content.

Clicks Inputs Iframes Modals
Open Challenge →

👥 PeopleFirst HR

HR employee portal with employee management, time-off requests, payroll, and reports. Tests complex form workflows and data extraction.

Employee Data Forms Reports Modals
Open Challenge →

🏦 VaultLine Financial

Banking portal for testing error categorization. Agent must navigate failure scenarios and return the correct error_category for each type of error.

Error Categories Failure Handling Navigation Forms
Open Challenge →

🏠 RentEase Portal

Resident portal for rent payments with payment methods, autopay settings, payment history, and lease management.

Payments Forms Cards Settings
Open Challenge →

🏢 Housing Alliance HTX

Property management portal with unit directory, inspection history in paginated modals, and caseworker directory. Tests CSV extraction from popups and multi-page tables.

CSV Extraction Pagination Modals Tables
Open Challenge →

Sub-Agent Testing Challenges

Test agent orchestration patterns with parallel sub-agent delegation.

📋 Sectioned Form (4 Sections)

Large insurance application form with 4 independent sections. Ideal for testing parallel sub-agent delegation where each section can be handled by a separate agent.

4 Sections Parallel Work Sub-Agents 30+ Fields
Pattern: Delegate each section to a sub-agent for parallel processing
Open Challenge →

🔄 Loop Iteration (10 Items)

Process 10 employee onboarding tasks. Each employee requires the same 3-4 click workflow. Tests ability to spawn sub-agents for each iteration of a repetitive loop.

10 Items 3-4 Clicks Each Sub-Agents Iteration
Pattern: Spawn a sub-agent for each employee's onboarding workflow
Open Challenge →

Site Replicas

Realistic site replicas for testing PAD switching, form filling, and end-to-end scraping workflows.

💳 BillSwitch (PAD Switch)

Generic PAD switching flow: login, select account, fill banking details (institution, transit, account number), submit, and capture confirmation screenshot.

Form Filling Screenshot Confirmation ID
Open Challenge →

Hydro Deck (Utility PAD)

Quebec-style utility company replica with Pre-authorized Debit switch flow. Supports single account, multi-account list, and portfolio selection flows.

PAD Switch Multi-Account Portfolio Confirmation ID
Open Challenge →