I have recently come across a web page containing a graph object that displays the (x, y) values on the object as the mouse is rolled across it. Is there any way to automate the extraction of this data?
How is the graph data loaded? If embedded in the page source then you can extract it with xpath or regex. Else use Firebug to see how it is loaded.
You will need a solution that works inside the web browser, so the AJAX/Javascript is properly rendered.
I have used iMacros with good success for web scraping in the past. There are free/open-source and "PRO" paid editions (comparison table here).
Another option is always to custom code something with the Microsoft webbrowser control.
Source: http://stackoverflow.com/questions/3980774/how-to-extract-data-from-web-2-0-graphs-using-a-scraper
How is the graph data loaded? If embedded in the page source then you can extract it with xpath or regex. Else use Firebug to see how it is loaded.
You will need a solution that works inside the web browser, so the AJAX/Javascript is properly rendered.
I have used iMacros with good success for web scraping in the past. There are free/open-source and "PRO" paid editions (comparison table here).
Another option is always to custom code something with the Microsoft webbrowser control.
Source: http://stackoverflow.com/questions/3980774/how-to-extract-data-from-web-2-0-graphs-using-a-scraper
 
No comments:
Post a Comment