The Scraper Assistant

scraper-assistant-link.png

This tool allows you to easily create a Scraper Config XML to be used with the Scraper Host Object. The Scraper Host Object can be used in a Mashup as follows;
	var config = <config>
                    <var-def name='response'>
                        <html-to-xml>
                            <http method='get' url='http://ww2.wso2.org/~builder/'/>
                        </html-to-xml>
                    </var-def>
                </config>;
	var scraper = new Scraper(config);
	var response = scraper.response;

The above XML segment can be generated via this tool using a Menu Driven interface, which writes the XML for you.


ui-intro.png


You can also use this tool to harvest XPath expressions off target Web Pages easily.

Step 1 - Enter the URL to scrape as illustrated below


step1.png

Step 2 - Highlight and click the page element to harvest its XPath expression.


step2.png