Scrap2API – Easy scrapping from Excel / LibreOffice

You may want to have in Excel some contents from the Internet. In some case, the basic functionalities of Excel / LibreOffice will be enough to get the data, but in most of the case, you will need more complex processing. So I wrote a simple script that will scrap the content with regular expression, xpath or css selector, and expose the results in a very simple API so that Excel / LibreOffice will be able to use it.  (more…)

0 Comments

Offline extraction of a WordPress site

I want a offline browseable static version of my wordpress website to be able to put it on USB or upload to a backup static location. I searched some wordpress plugins to do that and wp2static seemed very promising. But it turned out disappointing (version 6.1) because of many flaws in the crawler (many url were missed) and in the ways url are rendered as it is mainly intended to output with a full target URL (relative URLs are really not working at all). I tried a bit to patch the plugin but the code  was too difficult to understand and modify. So I decided to use a tool outside wordpress, the well known httrack I used years ago. (more…)

0 Comments