Octoparse AI

Get details of web page allows you to extract specific attributes from a web page, such as its URL, title, source code, or page text. This command is useful for capturing web page information that can be used in subsequent workflow steps for analysis, validation, or data extraction.

___________________________________________________________

This action produces a variable containing the selected attribute of the web page. The output variable will store the URL, title, HTML source code, or text content depending on which attribute was selected.

You can use the output variable in subsequent actions by referencing it with the {variable} syntax. For example, you might use the extracted page title in a condition to verify if you're on the correct page, or use the source code for further parsing and data extraction.

Before using this command, ensure you have a valid web page object variable created from previous actions like "<a href="https://www.octoparse.ai/helpcenter/en/articles/8347239-go-to-web-page">Go to web page</a>" or "<a href="https://www.octoparse.ai/helpcenter/en/articles/8360738-navigate-on-web-page">Navigate on web page</a>".

The "Page text" attribute extracts visible text content from the page, which can be useful for text analysis or searching for specific content.

The "Source code" attribute returns the complete HTML of the page, which may be large for complex websites.

The extracted attribute's format depends on the selected attribute type - URLs and titles are typically strings, while source code is HTML content.

- Before using this command, ensure you have a valid web page object variable created from previous actions like "<a href="https://www.octoparse.ai/helpcenter/en/articles/8347239-go-to-web-page">Go to web page</a>" or "<a href="https://www.octoparse.ai/helpcenter/en/articles/8360738-navigate-on-web-page">Navigate on web page</a>".
- The "Page text" attribute extracts visible text content from the page, which can be useful for text analysis or searching for specific content.
- The "Source code" attribute returns the complete HTML of the page, which may be large for complex websites.
- The extracted attribute's format depends on the selected attribute type - URLs and titles are typically strings, while source code is HTML content.

Get details of web page

Go to Octoparse AI

Find answers and get help from Intercom Support and Community Experts

This site employs cookies and other technologies that we and our third party vendors use to monitor and record personal information about you and your interactions with the site (including content viewed, cursor movements, screen recordings, and chat contents) for the purposes described in our Cookie Policy. By continuing to visit our site, you agree to our {websiteTermsLink}, {privacyPolicyLink} and {cookiePolicyLink}.

This site uses cookies and similar technologies ("cookies") as strictly necessary for site operation. We and our partners also would like to set additional cookies to enable site performance analytics, functionality, advertising and social media features. See our {cookiePolicyLink} for details. You can change your cookie preferences in our Cookie Settings.

We use cookies to make our site work and also for analytics and advertising purposes. You can enable or disable optional cookies as desired. See our {cookiePolicyLink} for more details.

You have the right to opt out of the sale of your personal information. See our {cookiePolicyLink} for more details about how we use your data.

Your Privacy Choices

We use cookies to enhance your experience. You can customize your cookie preferences below. See our {cookiePolicyLink} for more details.

Cookie Settings

Link, Press control-option-right-arrow to exit

Empty Help Center

Uh oh. That page doesn’t exist.

Disappointed

Neutral

Smiley

Thinking...

Searching through sources...

Analyzing...

Tickets submitted through the messenger or by a support agent in your conversation will appear here.

Parameter	Description	Possible Values	Required	Options / Notes
Web page	Select a variable that contains the web page to work with		Yes	Must be a valid web page variable
Select attribute	Select the attribute to extract from the web page	Page URL, Page title, Source code, Page text	Yes	Choose the specific web page attribute you want to extract
Store attribute into	Store the attribute into a new variable		Yes	The output variable will contain the extracted attribute value

Parameter Name	Description
Throw error & stop	When an error occurs, the action will trigger an error and stop the execution of the entire app.
Retry command	If an error occurs, the action will retry the command in an attempt to resolve the issue and continue the process.
Ignore error & continue	When an error occurs, the action will be ignored, and the workflow will continue without interruption.

Get details of web page

Definition and Usage

Parameter Values

Input parameters

Error handling

Variables produced

Using Variables in Conditions

Notes