Manual Workflow

The Manual Scraper in MrScraper allows you to create custom scraping workflows by defining a series of steps to extract data from web pages.

Tip

This mode is ideal for users who need full control over the scraping process and want to handle complex scenarios.

When to Use Manual Workflow

Use the Manual Scraper when you need to:

Extract data from complex or dynamic websites that require specific interactions.
Implement custom workflows that involve multiple steps, such as navigation, data extraction, and pagination.
Handle situations where our markdown converter cleans or alters the HTML, causing the AI to fail when parsing it into JSON.

Manual Scraper Features

Below are the available step types you can add when building a manual scraper:

Step Type	Description
Extract	Scrape data from the webpage by setting an Extraction Name, choosing an Extraction Type (`Text`, `Inner HTML`, `Outer HTML`, or `Attribute`), and defining CSS selectors to target elements.
Click	Simulates a click action on a specified element.
Delay	Pauses the scraper for a set duration (in milliseconds) before moving to the next step.
Input	Enters text into input fields on the webpage.
Scroll	Scrolls to the end of the page, scrolls until a specific text is found, or scrolls to a specific element (or until a certain number of elements).
Inject JavaScript	Runs custom JavaScript code on the webpage.
Paginate	Automatically navigates through multiple pages.

Pagination Types

Pagination Type	Required Parameter
Query Pagination	Query parameter (e.g., `?page=2`)
Directory Pagination	None (e.g., `/list/page/2`)
Next Page Link	Next page selector

Usage Example

Example 1 : Simple Website

Suppose you want to scrape data from this product page:

https://www.target.com/p/beats-studio-pro-bluetooth-wireless-headphones/-/A-89459966

On this site, the price is loaded dynamically through an internal API, so the AI Scraper won’t be able to extract it automatically. To capture this data, you’ll need to switch to the Manual Workflow and inject a small custom JavaScript snippet.

Follow the steps below:

Create a scraper using the product URL above.
After the AI Scraper completes the initial extraction, open the Manual Workflow tab at the top.
Add Inject JavaScript step.
Fill the Name field with data, and Script timeout with 50.
Use this script:

Script

(() => { return window.__TGT_DATA__.__PRELOADED_QUERIES__.queries})();

Tip

You can locate the price by inspecting the page’s source code.
For this specific page, the price is available under the __TGT_DATA__ object in the window.

Note: The data structure varies by website, so the location of price information may differ on other pages.

Don't know how to write your own script?

Need help writing a custom script for your manual workflow? Contact Us.

Save configuration > Run scraper.
The scraper returns the following output: Result :

Manual workflow result

{
  "data": {
    "...",
    "product": {
      "...",
      "price": {
        "formatted_comparison_price": "$349.99",
        "formatted_comparison_price_type": "reg",
        "formatted_current_price": "$165.99 - $169.99",
        "formatted_current_price_type": "sale",
        "location_id": 3991,
        "current_retail_min": 165.99,
        "reg_retail_max": 349.99
      }
      "...",
    }
    "...",
  }
}

Example 2 : Complex Website

Some websites cannot be scraped with the AI Scraper alone, especially when you need to fill in a form before data appears. One example is:

https://www.handelsregister.de/rp_web/normalesuche/welcome.xhtml

Since the site requires entering search details before showing company information, you’ll need to use a Manual Workflow to automate the steps.