My plan is to search the WikiLeaks cables for any Persons/Companies/Officers in the Offfshore leaks data and combine the two in a graph database (Neo4j). If anyone has wgot the WikiLeaks data can you let me know. I checked WikiLeaks but they don't seem to sell it as a download.
I am trying to download the WikiLeaks with wget but can't get a good index page to do it from.
The info on (https://cryptome.wikileaks.org/0003/wikileaks-wikiing-10-1207.htm) is out dated, a lot of broken links.
I have downloaded the key cable info without the body text from (https://www.theguardian.com/news/datablog/2010/nov/29/wikileaks-cables-data#data).
Good for a graph structure but I need the body text to search.
I am open to better ways of doing this.
This is my wget command and it only gets one cable.
wget -r --mirror --no-parent --convert-links https://wikileaks.org/cablegate.html