Scrap2API – Easy scrapping from Excel / LibreOffice
You may want to have in Excel some contents from the Internet. In some case, the basic functionalities of Excel / LibreOffice will be enough to get the data, but in most of the case, you will need more complex processing. So I wrote a simple script that will scrap the content with regular expression, xpath or css selector, and expose the results in a very simple API so that Excel / LibreOffice will be able to use it.
The script is available on github.com/rpeyron/scrap2api/ with full documentation.
A simple example to get the number of results of a google search with the different methods is provided (replace
/path-to/scrap.php with the path where you have put the scrap.php script on your PHP webserver)
- See the result in your browser:
- Get the result in Excel / LibreOffice, use formula :
=WEBSERVICE("/path-to/scrap.php/google-numresults-css/test?token=test")or in French editions :
- See the Swagger UI of the definition of the API :
The script is also extensible through plugins, please refer to developer’s documentation.