Skip to main content

Get details of web page

Sophie avatar
Written by Sophie
Updated over 2 weeks ago

Definition and Usage

Get details of web page allows you to extract specific attributes from a web page, such as its URL, title, source code, or page text. This command is useful for capturing web page information that can be used in subsequent workflow steps for analysis, validation, or data extraction.


Parameter Values

Input parameters

Parameter

Description

Possible Values

Required

Options / Notes

Web page

Select a variable that contains the web page to work with

Yes

Must be a valid web page variable

Select attribute

Select the attribute to extract from the web page

Page URL, Page title, Source code, Page text

Yes

Choose the specific web page attribute you want to extract

Store attribute into

Store the attribute into a new variable

Yes

The output variable will contain the extracted attribute value

Error handling

Parameter Name

Description

Throw error & stop

When an error occurs, the action will trigger an error and stop the execution of the entire app.

Retry command

If an error occurs, the action will retry the command in an attempt to resolve the issue and continue the process.

Ignore error & continue

When an error occurs, the action will be ignored, and the workflow will continue without interruption.

Variables produced

This action produces a variable containing the selected attribute of the web page. The output variable will store the URL, title, HTML source code, or text content depending on which attribute was selected.


Using Variables in Conditions

You can use the output variable in subsequent actions by referencing it with the {variable} syntax. For example, you might use the extracted page title in a condition to verify if you're on the correct page, or use the source code for further parsing and data extraction.


Notes

  • Before using this command, ensure you have a valid web page object variable created from previous actions like "Go to web page" or "Navigate on web page".

  • The "Page text" attribute extracts visible text content from the page, which can be useful for text analysis or searching for specific content.

  • The "Source code" attribute returns the complete HTML of the page, which may be large for complex websites.

  • The extracted attribute's format depends on the selected attribute type - URLs and titles are typically strings, while source code is HTML content.

Did this answer your question?