Browser - Agentbase Docs

Agentbase provides agents with full Chrome browser capabilities for navigating websites, interacting with web applications, extracting data, and automating web workflows.

Overview

The Browser primitive gives agents access to a full Google Chrome browser running within their execution environment. This enables sophisticated web automation, data extraction, and testing capabilities without any additional setup:

Web Navigation: Visit any website and navigate through pages
Element Interaction: Click buttons, fill forms, submit data
Data Extraction: Scrape content, tables, images, and structured data
JavaScript Execution: Run custom JavaScript in the browser context
Screenshots: Capture full pages or specific elements
Session Management: Handle cookies, authentication, and multi-page workflows

Full Chrome Browser

Google Chrome 140.0.7339.127 with complete rendering and JavaScript support

Headless & GUI Modes

Run browser invisibly for automation or with display for debugging

Automation Tools

Built-in support for Selenium, Playwright, and Puppeteer

Internet Access

Full access to the web with cookies, sessions, and authentication

How Browser Automation Works

Agents use browser capabilities automatically when you request web-based tasks:

// Agent automatically uses browser tools
const result = await agentbase.runAgent({
  message: "Navigate to example.com and extract the main heading"
});

// Browser is launched, page is loaded, data is extracted

Browser Lifecycle

Launch: Browser starts when needed (headless by default)
Navigate: Open URLs and wait for page load
Interact: Click, type, scroll, execute JavaScript
Extract: Scrape data, take screenshots, save content
Close: Browser closes automatically after task completion

Browser Specifications

Software Details

Browser: Google Chrome 140.0.7339.127 (Official Build) (64-bit)
ChromeDriver: Compatible version for Selenium automation
Rendering Engine: Chromium Blink
JavaScript: V8 engine with full ES6+ support
Modes: Headless (default) and GUI available

Capabilities

Page Navigation

Load any URL
Follow links and redirects
Handle single-page applications (SPAs)
Wait for dynamic content
Navigate browser history (back/forward)

const nav = await agentbase.runAgent({
  message: "Visit github.com/trending and extract top repositories"
});

Element Interaction

Click buttons and links
Fill input fields and text areas
Select dropdown options
Check/uncheck checkboxes
Submit forms
Hover over elements

const interact = await agentbase.runAgent({
  message: "Go to the search page, enter 'AI tools', and submit the search"
});

Data Extraction

Extract text content
Parse HTML structure
Extract links and images
Scrape tables and lists
Get element attributes
Access page metadata

const scrape = await agentbase.runAgent({
  message: "Extract all product names and prices from the e-commerce page"
});

JavaScript Execution

Run custom JavaScript
Inject scripts into pages
Access window and document objects
Interact with page JavaScript
Return values from executed scripts

const js = await agentbase.runAgent({
  message: "Execute JavaScript to scroll to the bottom of the page"
});

Screenshots

Capture full page screenshots
Screenshot specific elements
Save as PNG or JPEG
Different viewport sizes
Mobile and desktop views

const screenshot = await agentbase.runAgent({
  message: "Take a screenshot of the homepage and save it"
});

Session Management

Handle cookies
Maintain authentication
Session persistence
Local storage access
Cache management

const session = await agentbase.runAgent({
  message: "Login to the website and navigate to the dashboard"
});

Code Examples

// Simple page visit
const visit = await agentbase.runAgent({
  message: "Navigate to https://example.com and show the page title"
});

// Navigate and extract
const extract = await agentbase.runAgent({
  message: "Go to news.ycombinator.com and extract the top 5 story titles"
});

// Follow links
const follow = await agentbase.runAgent({
  message: "Visit the homepage, click on the About link, and show the content"
});

Web Scraping

// Scrape structured data
const scrape = await agentbase.runAgent({
  message: `Visit https://example-store.com/products and extract:
  - Product names
  - Prices
  - Availability status
  Save the data as products.json`
});

// Multi-page scraping
const multiPage = await agentbase.runAgent({
  message: `Scrape the first 3 pages of search results:
  1. Visit the search page
  2. Extract results from page 1
  3. Click next and scrape page 2
  4. Click next and scrape page 3
  5. Combine all results into a CSV file`
});

// Table extraction
const table = await agentbase.runAgent({
  message: "Extract the data table from the page and convert to CSV"
});

Form Interaction

// Fill and submit form
const form = await agentbase.runAgent({
  message: `Fill out the contact form:
  - Name: John Doe
  - Email: john@example.com
  - Message: Hello, I have a question
  Then submit the form and confirm submission`
});

// Login workflow
const login = await agentbase.runAgent({
  message: `Login to the website:
  1. Navigate to /login
  2. Enter username: testuser
  3. Enter password: testpass
  4. Click submit
  5. Verify successful login`
});

// Search and filter
const search = await agentbase.runAgent({
  message: `Use the search and filter:
  1. Enter "laptops" in search box
  2. Select "Price: Low to High" from dropdown
  3. Check "In Stock Only" checkbox
  4. Click Search
  5. Extract the results`
});

Screenshots and Visual Testing

// Full page screenshot
const screenshot = await agentbase.runAgent({
  message: "Navigate to homepage and take a full page screenshot"
});

// Element screenshot
const element = await agentbase.runAgent({
  message: "Take a screenshot of just the navigation menu"
});

// Multiple screenshots
const comparison = await agentbase.runAgent({
  message: `Take screenshots for visual testing:
  1. Desktop view (1920x1080)
  2. Tablet view (768x1024)
  3. Mobile view (375x667)
  Save all three screenshots`
});

// Before/after comparison
const changes = await agentbase.runAgent({
  message: `Compare page changes:
  1. Take screenshot of current state
  2. Click "Show More" button
  3. Take screenshot of new state
  4. Highlight differences`
});

JavaScript Execution

// Execute custom JavaScript
const executeJS = await agentbase.runAgent({
  message: "Run JavaScript to get the page's scroll height"
});

// Interact with page JavaScript
const interact = await agentbase.runAgent({
  message: "Execute JavaScript to trigger the page's custom modal"
});

// Extract dynamic data
const dynamic = await agentbase.runAgent({
  message: `Use JavaScript to:
  1. Wait for data to load via AJAX
  2. Extract the dynamically loaded content
  3. Return as JSON`
});

// Modify page content
const modify = await agentbase.runAgent({
  message: "Execute JavaScript to highlight all external links on the page"
});

Use Cases

1. Competitive Intelligence

Monitor competitor websites:

const competitive = await agentbase.runAgent({
  message: `Competitor analysis:
  1. Visit competitor website
  2. Extract current pricing for all products
  3. Screenshot their homepage
  4. Extract featured products
  5. Save all data to competitor_data.json
  6. Compare with our pricing`
});

2. Web Testing

Automated UI and functionality testing:

const testing = await agentbase.runAgent({
  message: `Test user registration flow:
Navigate to /signup
Fill registration form with test data
Submit form
Verify email confirmation page
Take screenshot of confirmation
Check for any errors or issues
Generate test report`
});

3. Data Collection

Gather data from multiple sources:

const collection = await agentbase.runAgent({
  message: `Collect real estate listings:
  1. Visit property listing site
  2. Search for "apartments in San Francisco"
  3. Extract all listings (address, price, bedrooms, etc.)
  4. Visit each listing for detailed info
  5. Save all data to properties.csv
  6. Create summary statistics`
});

4. Research and Monitoring

Track information over time:

const research = await agentbase.runAgent({
  message: `Research trending topics:
  1. Visit top tech news sites
  2. Extract today's headlines
  3. Identify trending topics
  4. Visit each article and extract summary
  5. Create a consolidated report
  6. Save as research_report.md`
});

5. Form Automation

Automate repetitive form submissions:

const automation = await agentbase.runAgent({
  message: `Submit bulk job applications:
  1. Read applicant_data.csv
  2. For each application:
     - Navigate to application form
     - Fill in all fields from CSV
     - Upload resume.pdf
     - Submit form
     - Save confirmation number
  3. Generate submission report`
});

6. Content Verification

Verify website content and links:

const verification = await agentbase.runAgent({
  message: `Website health check:
  1. Visit all pages listed in sitemap.xml
  2. Check for broken links
  3. Verify images load correctly
  4. Check page load times
  5. Screenshot any error pages
  6. Generate health report`
});

Best Practices

Reliable Web Scraping

Wait for Content to Load

// Good: Wait for dynamic content
const good = await agentbase.runAgent({
  message: `Visit the page and wait for the products to load
  before extracting data`
});

// Avoid: Extracting before content loads
const bad = await agentbase.runAgent({
  message: "Quickly visit page and extract data"
});

Handle Pagination Correctly

// Good: Navigate through all pages
const good = await agentbase.runAgent({
  message: `Scrape all pages:
  - Extract data from current page
  - Click next button
  - Repeat until no more pages
  - Combine all results`
});

Be Specific About Selectors

// Good: Specific instructions
const good = await agentbase.runAgent({
  message: "Extract prices from elements with class 'product-price'"
});

// Avoid: Vague instructions
const bad = await agentbase.runAgent({
  message: "Get all the prices"
});

Error Handling

Graceful Failures: Instruct agents to handle common web issues like timeouts, missing elements, and navigation errors.

// Robust error handling
const robust = await agentbase.runAgent({
  message: `Scrape data with error handling:
  - If page doesn't load, retry up to 3 times
  - If element not found, log and continue
  - If data format unexpected, note in output
  - Generate report of successful and failed extractions`
});

Performance Optimization

Minimize Page Loads

Extract all needed data in one visit when possible

Use Headless Mode

Headless is faster - use GUI only for debugging

Parallel Scraping

Scrape multiple pages concurrently when appropriate

Cache Responses

Save scraped data to avoid re-scraping

// Efficient: Extract all data at once
const efficient = await agentbase.runAgent({
  message: `Visit the page and extract:
  - All product names
  - All prices
  - All descriptions
  - All images
  In a single visit`
});

// Inefficient: Multiple visits
const inefficient = await agentbase.runAgent({
  message: "Visit page, get names, then visit again for prices..."
});

Ethical Considerations

Respect Robots.txt: Always respect website terms of service and robots.txt. Be mindful of scraping frequency and server load.

// Responsible scraping
const responsible = await agentbase.runAgent({
  message: `Scrape responsibly:
  - Check robots.txt first
  - Add delays between requests
  - Respect rate limits
  - Don't overload the server`
});

Integration with Other Primitives

With File System

Save scraped data to files:

const combined = await agentbase.runAgent({
  message: `Scrape product data and save:
  1. Extract data from website
  2. Save as products.json
  3. Also create products.csv
  4. Generate summary report as report.md`
});

Learn more: File System Primitive

With Computer

Use programming tools with browser automation:

const automation = await agentbase.runAgent({
  message: `Create web scraping script:
  1. Install selenium and beautifulsoup4
  2. Write Python script for scraping
  3. Run the script
  4. Save results to database`
});

Learn more: Computer Primitive

With Web Search

Combine search with browsing:

const research = await agentbase.runAgent({
  message: `Research topic:
  1. Search web for "topic"
  2. Visit top 5 results
  3. Extract key information from each
  4. Create comprehensive summary`
});

Learn more: Web Search Extension

With Sessions

Maintain browser state across requests:

// Login persists across requests
const login = await agentbase.runAgent({
  message: "Login to the website"
});

const authenticated = await agentbase.runAgent({
  message: "Navigate to account dashboard and extract data",
  session: login.session  // Maintains login state
});

Learn more: Sessions Primitive

Performance Considerations

Browser Startup

Cold Start: First browser launch ~2-3 seconds
Warm Start: Subsequent pages in same session are faster
Headless Mode: 20-30% faster than GUI mode

Page Load Times

Simple Pages: 1-3 seconds
Complex SPAs: 3-10 seconds
Heavy Content: 10+ seconds

Optimization Tips

// Fast: Headless mode (default)
const base = await agentbase.runAgent({
  message: "Scrape data quickly in headless mode"
});

// Slow: GUI mode (only when needed)
const slow = await agentbase.runAgent({
  message: "Open browser with display for debugging"
});

Resource Usage

Memory: 200-500MB per browser instance
CPU: Varies with page complexity
Network: Depends on page size and requests

Advanced Techniques

Handling Dynamic Content

const dynamic = await agentbase.runAgent({
  message: `Handle dynamic content:
  1. Wait for AJAX calls to complete
  2. Scroll to trigger lazy loading
  3. Wait for animations to finish
  4. Extract the fully loaded content`
});

const session = await agentbase.runAgent({
  message: `Manage sessions:
  1. Login and save cookies
  2. Navigate to protected pages using saved session
  3. Extract user-specific data
  4. Logout when done`
});

Handling Popups and Alerts

const popups = await agentbase.runAgent({
  message: `Handle popups:
  1. Navigate to page
  2. If popup appears, close it
  3. If alert shows, accept it
  4. Continue with main task`
});

Mobile Emulation

const mobile = await agentbase.runAgent({
  message: `Test mobile view:
  1. Set viewport to mobile size (375x667)
  2. Set mobile user agent
  3. Navigate to website
  4. Take screenshots
  5. Extract mobile-specific content`
});

Proxy and Network Control

const network = await agentbase.runAgent({
  message: `Monitor network:
  1. Start network monitoring
  2. Navigate to page
  3. Capture all API requests
  4. Extract request/response data
  5. Save network log`
});

Troubleshooting

Page Not Loading

Problem: Browser cannot load the pageSolutions:

Check URL is correct and accessible
Wait longer for page load
Check for JavaScript errors
Try different user agent

const fix = await agentbase.runAgent({
  message: `If page doesn't load:
  - Wait up to 30 seconds
  - Check for error messages
  - Try loading a different page to test connection`
});

Element Not Found

Problem: Cannot find element to interact withSolutions:

Wait for element to appear
Check selector is correct
Verify element is visible
Check if it’s in an iframe

const fix = await agentbase.runAgent({
  message: `Find element with retry:
  - Wait for element to appear (up to 10s)
  - If not found, check page source
  - List available elements for debugging`
});

JavaScript Errors

Problem: Page JavaScript errors affect functionalitySolutions:

Check browser console for errors
Try different approach
Use direct JavaScript execution

const fix = await agentbase.runAgent({
  message: "Check console for JavaScript errors and try alternative approach"
});

Timeout Errors

Problem: Operations timing outSolutions:

Increase wait time
Check network connection
Simplify task

const fix = await agentbase.runAgent({
  message: "If timeout occurs, wait longer and retry with simpler approach"
});

Anti-Scraping Measures

Problem: Website blocking automationSolutions:

Add delays between requests
Use realistic user agent
Respect rate limits
Consider alternative data sources

const respectful = await agentbase.runAgent({
  message: `Scrape respectfully:
  - Add 2-3 second delays between requests
  - Use standard browser user agent
  - Limit to reasonable number of pages`
});

Browser vs Web Search

When to use Browser vs Web Search extension:

Use Browser When

Need to interact with page elements
Extracting structured data from specific sites
Filling forms or logging in
Taking screenshots
Testing web applications
Navigating multi-step workflows

Use Web Search When

Finding information across the web
Need current events or recent data
Quick fact-checking
Discovering relevant URLs
Broad research topics
Multiple source aggregation

Sandbox

Isolated environment hosting the browser

Computer

Install browser automation tools

File System

Save scraped data and screenshots

Web Search

Search web for information

Additional Resources

Tools Reference

Browser tools documentation

API Reference

Complete API documentation

Remember: Browser capabilities are available automatically when you describe web-based tasks. Agents handle browser management, navigation, and data extraction for you.

Getting Started

Build

Deploy

Improve

Agent Primitives

API Reference

Resources

​Overview

Full Chrome Browser

Headless & GUI Modes

Automation Tools

Internet Access

​How Browser Automation Works

​Browser Lifecycle

​Browser Specifications

​Software Details

​Capabilities

​Code Examples

​Basic Web Navigation

​Web Scraping

​Form Interaction

​Screenshots and Visual Testing

​JavaScript Execution

​Use Cases

​1. Competitive Intelligence

​2. Web Testing

​3. Data Collection

​4. Research and Monitoring

​5. Form Automation

​6. Content Verification

​Best Practices

​Reliable Web Scraping

​Error Handling

​Performance Optimization

Minimize Page Loads

Use Headless Mode

Parallel Scraping

Cache Responses

​Ethical Considerations

​Integration with Other Primitives

​With File System

​With Computer

​With Web Search

​With Sessions

​Performance Considerations

​Browser Startup

​Page Load Times

​Optimization Tips

​Resource Usage

​Advanced Techniques

​Handling Dynamic Content

​Cookie and Session Management

​Handling Popups and Alerts

​Mobile Emulation

​Proxy and Network Control

​Troubleshooting

​Browser vs Web Search

Use Browser When

Use Web Search When

​Related Primitives

Sandbox

Computer

File System

Web Search

​Additional Resources

Tools Reference

API Reference

Overview

How Browser Automation Works

Browser Lifecycle

Browser Specifications

Software Details

Capabilities

Code Examples

Basic Web Navigation

Web Scraping

Form Interaction

Screenshots and Visual Testing

JavaScript Execution

Use Cases

1. Competitive Intelligence

2. Web Testing

3. Data Collection

4. Research and Monitoring

5. Form Automation

6. Content Verification

Best Practices

Reliable Web Scraping

Error Handling

Performance Optimization

Ethical Considerations

Integration with Other Primitives

With File System

With Computer

With Web Search

With Sessions

Performance Considerations

Browser Startup

Page Load Times

Optimization Tips

Resource Usage

Advanced Techniques

Handling Dynamic Content

Cookie and Session Management

Handling Popups and Alerts

Mobile Emulation

Proxy and Network Control

Troubleshooting

Browser vs Web Search

Related Primitives

Additional Resources