Robot Structure

Robots mimic human behavior; they do (more or less) what you do when you are looking for content on the Internet using a browser: You start by searching for the content. Once found, you read and process it. Similarly, most robots can be divided into two parts: a navigation part and an extraction part.

Navigation is concerned with "getting to where the content is." Navigation mainly includes loading pages and submitting forms. When navigating in Design Studio, you typically use the Click action to navigate through and among web pages.

Extraction is concerned with "getting the right content." Extraction mainly includes selecting, copying, and normalizing content from a web page that you navigate to. When extracting in Design Studio, you typically use the Test Tag action to skip uninteresting ("noisy") content, the Extract action to copy content into variables, and the data converters for normalizing the content so that it gets the format you want, such as the right date and number format. Once extracted, you output the value with the Store in Database or Return Value action.

A typical robot starts with one or more steps, each containing a Load Page or Click action to navigate to the interesting content on a web site. It proceeds with one or more steps, each containing an Extract action, and ends with a step storing or returning the extracted value.

Note that in many robots the navigation and extraction parts overlap because the content to extract is located on several pages. Again, this is similar to looking for content yourself; often, you have to visit several pages to get the content you want.

Most robots include other actions than the ones mentioned above, such as a For Each Tag action for loading several similar looking pages or extracting values from several similar looking table rows. Because robots have different tasks, they have different needs. For this reason, we have included a considerable number of step actions and data converters in Design Studio. Start by familiarizing yourself with the basic and most commonly used step actions and data converters, and then begin to explore. Experience shows that you can create most robots using only a handful of step actions and data converters. So, find your own favorite step actions and data converters and stick to them until you feel a need to explore others.