[ad_1]
Is there a way to scrape all content from a WP blog that you don’t own?
A friend of mine recently passed very suddenly, and we’d like to preserve his blog, possibly even have it made into a book for his family. No one has his login information, so we have no way to use the built-in tools to do this. There are nearly 500 posts so copying it manually would take forever. Would a tool like WP Scraper work?
[ad_2]
The executor of the estate might be able to send the web hosting company a death certificate to get access to everything. I would look into that as there might actually be some value in keeping the blog alive depending upon what it is and whether it is monetized.
This is pretty trivial to get running with Python/Ruby. You seed it with 1 or more urls and then keep scanning urls until no new urls are discovered. Then just save the HTML for each page.
It’s a little more involved when you make the local HTML look nice (e.g. if the data should be structured at all, or need to rewrite asset urls)
WP Scraper is worth a try! Use the pro version of the plugin.
Step one: choose a domain
Step two: start a hosting account at siteground and register your domain
Step three: install the WP Scraper plugin and start scraping.