How to build Actors

At Apify, we try to make building web scraping and automation straightforward. You can customize our universal scrapers with JavaScript for quick tweaks, use our code templates for rapid setup in JavaScript, TypeScript, or Python, or build from scratch using our JavaScript and Python SDKs or Crawlee libraries for Node.js and Python for ultimate flexibility and control. This guide offers a quick overview of our tools to help you find the right fit for your needs.

3 ways to build Actors

Our universal scrapers — customize our boilerplate tools to your needs with a bit of JavaScript and setup.
Our code templates for web scraping projects — for a quick project setup to save you development time (includes JavaScript, TypeScript, and Python templates).
Open-source libraries and SDKs
1. JavaScript SDK & Python SDK — for creating your own solution from scratch on the Apify platform using our free development kits. Involves more coding but offers infinite flexibility.
2. Crawlee and Crawlee for Python — for creating your own solutions from scratch using our free web automation libraries. Involves even more coding but offers infinite flexibility. There’s also no need to host these on the platform.

Universal scrapers. What are they for?

Universal scrapers were built to provide an intuitive UI plus configuration that will help you start extracting data as quickly as possible. Usually, you just provide a simple JavaScript function and set up one or two parameters, and you're good to go.

Since scraping and automation come in various forms, we decided to build not just one, but six scrapers. This way, you can always pick the right tool for the job. Let's take a look at each particular tool and its advantages and disadvantages.

Scraper	Technology	Advantages	Disadvantages	Best for
🌐 Web Scraper	Headless Chrome Browser	Simple, fully JavaScript-rendered pages	Executes only client-side JavaScript	Websites with heavy client-side JavaScript
👐 Puppeteer Scraper	Headless Chrome Browser	Powerful Puppeteer functions, executes both server-side and client-side JavaScript	More complex	Advanced scraping with client/server-side JS
🎭 Playwright Scraper	Cross-browser support with Playwright library	Cross-browser support, executes both server-side and client-side JavaScript	More complex	Cross-browser scraping with advanced features
🍩 Cheerio Scraper	HTTP requests + Cheerio parser (JQuery-like for servers)	Simple, fast, cost-effective	Pages may not be fully rendered (lacks JavaScript rendering), executes only server-side JavaScript	High-speed, cost-effective scraping
⚠️ JSDOM Scraper	JSDOM library (Browser-like DOM API)	+ Handles client-side JavaScript<br>+ Faster than full-browser solutions<br>+ Ideal for light scripting	Not for heavy dynamic JavaScript, executes server-side code only, depends on pre-installed NPM modules	Speedy scraping with light client-side JS
🍲 BeautifulSoup Scraper	Python-based, HTTP requests + BeautifulSoup parser	Python-based, supports recursive crawling and URL lists	No full-featured web browser, not suitable for dynamic JavaScript-rendered pages	Python users needing simple, recursive crawling

How do I choose the right universal web scraper to start with?

<aside> 🎯 Decision points:

Use 🌐 Web Scraper if you need simplicity with full browser capabilities and client-side JavaScript rendering.
Use 🍩 Cheerio Scraper for fast, cost-effective scraping of static pages with simple server-side JavaScript execution.
Use 🎭 Playwright Scraper when cross-browser compatibility is crucial.
Use 👐 Puppeteer Scraper for advanced, powerful scraping where you need both client-side and server-side JavaScript handling.
Use ⚠️ JSDOM Scraper for lightweight, speedy scraping with minimal client-side JavaScript requirements.
Use 🍲 BeautifulSoup Scraper for Python-based scraping, especially with recursive crawling and processing URL lists. </aside>

To make it easier, here's a short questionnaire that guides you on selecting the best scraper based on your specific use case:

Questionnaire

<aside> 📚 Resources:

▶ How to use Web Scraper to scrape any website

▶ How to use Beautiful Soup to scrape the web

▶ Learn about our $1/month Creator plan that encourages devs to build Actors based on Universal scrapers

</aside>

Web scraping code templates

Similar to our universal scrapers, our code templates also provide a quick start for developing web scrapers, automation scripts, and testing tools. Built on popular libraries like BeautifulSoup for Python or Playwright for JavaScript, they save time on setup, allowing you to focus on customization. Though they require more coding than universal scrapers, they're ideal for those who want a flexible foundation while still needing room to tailor their solutions.