Gone are the days of saving web pages individually and getting annoyed by the quirky rendering of html pages. Thanks to HTTrack, you can now download the complete website for offline reading. As its homepage tells
- HTTrack allows you to download a website from the Internet to your local directory, building recursively all directories, getting HTML, images, and other files from the server to your computer.
- HTTrack arranges the original site’s relative link-structure. Simply open a page of the “mirrored” website in your browser, and you can browse the site from link to link, as if you were viewing it online.
- HTTrack can also update an existing mirrored site, and resume interrupted downloads.
- HTTrack is fully configurable, and has an integrated help system.
- HTTrack supports both Windows and Linux.
Webhttrack is meant for Debian/Ubuntu. To download Webhttrack you need to run
sudo apt-get update sudo apt-get install webhttrack
You can launch the Webhttrack either by running webhttrack or by searching in the Ubuntu dash board. Webhttrack home page gets loaded in the browser once you launched it.
I want to study the SNMP Python scripts of this site. Let me download it using WebHTTrack. First you need to create a project by giving the name. For this example I call it SNMP_Scapy (Scapy is a packet crafting python package.) The default location to which this website gets downloaded is /home/<username>/websites; you can change it if you wish.
Next is to provide the URL for the Webhttrack to download. Note that it will download from the URL and all the directories underneath it.
Before proceeding, you may need to set the proxy if you’re behind one. Open the Preferences and mirror options and click on the proxy link. Use this syntax when entering the proxy URL
http://proxy_ip:proxy_port or http://proxy_username:proxy_password@proxy_ip:proxy_port
With that HTTrack asks to confirm the connection settings. Clicking start button will start the downloading of the website.
You can see the files and directories as they are being downloaded. Depending on the size of the website and the Internet speed, it takes a while to complete.
Once done, you’ll get the site mirroring finished message with a link to the to the downloaded website in the local directory.
Navigate to the directory and you’ll see few folders plus index.html. Open it to see the downloaded site. Click on the hyperlinks and they should work.
You can now copy the entire website directory to another PC and enjoy the full website. Happy browsing.