Crawling an Entire Site
- Add a step with the Load Page action that loads the main page.
- Add a new step and choose the Crawl Pages action.
- On the Rules tab, add a Crawling Rule that applies to all pages in the site, e.g. by specifying the domain that the pages belong to or by making a pattern that the URL should match. For these pages, the rule should specify "Crawl Entire Page" and "Output the Page".
- On the Rules tab, set the "For all Other Pages" property to "Do Not Crawl".
- After the step with the Crawl Pages action, add steps to handle each page, e.g. by extracting information into returned variables.