ZenRows
ZenRows is an enterprise-grade web scraping tool that provides advanced web data extraction capabilities at scale. ZenRows specializes in scraping modern websites, bypassing anti-bot systems, extracting structured data from any website, rendering JavaScript-heavy content, accessing geo-restricted websites, and more.
langchain-zenrows provides tools that allow LLMs to access web data using ZenRows' powerful scraping infrastructure.
Installation and Setup
pip install langchain-zenrows
You'll need to set up your ZenRows API key:
import os
os.environ["ZENROWS_API_KEY"] = "your-api-key"
Or you can pass it directly when initializing tools:
from langchain_zenrows import ZenRowsUniversalScraper
zenrows_scraper_tool = ZenRowsUniversalScraper(zenrows_api_key="your-api-key")
Tools
ZenRowsUniversalScraper
The ZenRows integration provides comprehensive web scraping features:
- JavaScript Rendering: Scrape modern SPAs and dynamic content
- Anti-Bot Bypass: Overcome sophisticated bot detection systems
- Geo-Targeting: Access region-specific content with 190+ countries
- Multiple Output Formats: HTML, Markdown, Plaintext, PDF, Screenshots
- CSS Extraction: Target specific data with CSS selectors
- Structured Data Extraction: Automatically extract emails, phone numbers, links, and more
- Session Management: Maintain consistent sessions across requests
- Premium Proxies: Residential IPs for maximum success rates
See more in the ZenRows tool documentation.