Arbeitsagentur Jobs Listing Scraper
Collection of job listing pages for Arbeitsagentur
Summary
Scrapes job data from Arbeitsagentur.de search results based on specified keywords and locations, supporting multiple sets of keyword collection.
Overview
This app allows you to collect job data from Arbeitsagentur.de based on the keywords provided. For each set of keywords (job + location), it grabs a specified number of pages (each page contains 25 jobs and the data is automatically de-duplicated) and saves the data within an Excel document of your choice. This app is ideal for users who want to capture jobs from different locations.
How to Use
- Download the app from Octoparse AI app store
- Launch the app in your list.
- Parameter Description
- filePath: The excel file format that contains the corresponding keyword pairs you want to search in. You can enter multiple keywords in the file, each keyword in a separate cell. The first row is the table header; enter the keyword in the first column and enter the location in the second column. Every cell can put only one word. The sheet where the keywords are stored must be named Sheet1. Just like the image example:
- Pagination: The number of pages to scrape for each keyword pair (each page contains 25 jobs).
- Click "Run application"
3. Output
- The application will collect data from the specified pages for each keyword pairs and save the results to the input file.
Notes
- Do not use the mouse or keyboard while the app is running to avoid interruptions.
- The data will be automatically de-duplicated, so if the totals don't add up, please check for duplicates yourself first.
- No additional Python packages are required for this application.
- If you have triggered the cookie pop-up window when collecting, please manually click to close the pop-up window, otherwise the webpage can not display the results after keyword input normally.
Troubleshooting
Encounter obstacles but don't know how to resolve them when executing the app?
Please contact our support team at [email protected] to find the way out!
To help us better understand your issue, follow the steps below to export your running logs:
Version
version 1
2025-04-07