I want to build some script which will include site cloner, so i wonder if anyone here have experience with that? Which libraries were used or framerwork, i need information. Cheers
Forum Thread: Anyone Have Experience in Cloning Site via Python?
- Hot
- Active
-
Forum Thread: Hack Instagram Account Using BruteForce 203 Replies
29 min ago -
Forum Thread: Hacking Facebook,Twitter,Instagram Account Passwords with BruteForce 161 Replies
2 hrs ago -
Forum Thread: How to Edit Some Specific Part of an Apk File 1 Replies
2 hrs ago -
Forum Thread: I Need Help in Hacking a Gmail Account. 26 Replies
4 hrs ago -
Forum Thread: Kali linux command error. Kindly help 1 Replies
23 hrs ago -
Forum Thread: When to Use Reverse Shell and Bind Shell? 5 Replies
2 days ago -
Forum Thread: Fluxion Not Working 6 Replies
2 days ago -
Forum Thread: Hacking Stream (LIVE) 10 Replies
3 days ago -
Forum Thread: Blind SQL Injection 6 Replies
3 days ago -
Forum Thread: Can C Programming Help Me for Hacking or Pen Testing. 10 Replies
1 wk ago -
Forum Thread: I Want to Hack in to My Friends System Through his wifi router i am controlling 7 Replies
1 wk ago -
Forum Thread: How to Use the Exploits Found in Vega Scan 3 Replies
1 wk ago -
Forum Thread: Can I Send a File to a Computer Using Ip Address 5 Replies
2 wks ago -
Forum Thread: How to Use NGROK in a Reverse_Tcp Attack? 20 Replies
2 wks ago -
Forum Thread: Problem with Hacking Webserver with Armitage 11 Replies
2 wks ago -
Forum Thread: Metasploit - Embedding an Android Payload into a PDF? 7 Replies
2 wks ago -
Forum Thread: Creating an Completely Undetectable Executable in Under 15 Minutes! 35 Replies
2 wks ago -
Forum Thread: How to Hack an Android Device with Only a Ip Adress 43 Replies
2 wks ago -
Forum Thread: Metasploit Use Domain Name as RHOST 4 Replies
2 wks ago -
Forum Thread: Make a Phishing Page for Harvesting Credentials Yourself 4 Replies
2 wks ago
-
How To: Automate Remote SSH Control of Computers with Expect Scripts
-
How To: Hack Android Using Kali (Remotely)
-
How To: Brute-Force Nearly Any Website Login with Hatch
-
How To: Top 10 Things to Do After Installing Kali Linux
-
How To: Phish for Social Media & Other Account Passwords with BlackEye
-
How To: Crack Shadow Hashes After Getting Root on a Linux System
-
How To: Create a Persistent Back Door in Android Using Kali Linux:
-
How To: Crack SSH Private Key Passwords with John the Ripper
-
How To: Crack Password-Protected Microsoft Office Files, Including Word Docs & Excel Spreadsheets
-
How to Hack Wi-Fi: Cracking WPA2 Passwords Using the New PMKID Hashcat Attack
-
How To: Gain SSH Access to Servers by Brute-Forcing Credentials
-
How To: Crack Password-Protected ZIP Files, PDFs & More with Zydra
-
How To: Bypass File Upload Restrictions on Web Apps to Get a Shell
-
How To: Dox Anyone
-
Hack Like a Pro: How to Crack Passwords, Part 4 (Creating a Custom Wordlist with Crunch)
-
How To: Upgrade a Dumb Shell to a Fully Interactive Shell for More Flexibility
-
How to Hack Wi-Fi: Cracking WPA2-PSK Passwords Using Aircrack-Ng
-
Hack Like a Pro: How to Find Directories in Websites Using DirBuster
-
How To: How Hackers Use Your IP Address to Hack Your Computer & How to Stop It
-
How To: Buy the Best Wireless Network Adapter for Wi-Fi Hacking in 2019
2 Responses
That's actually a research project I'm working on, and you can check it out here:
https://github.com/AlexMapley/Bartimeaus/blob/master/spider.py
The series I'm writing right now is actually a build up to this point.
https://null-byte.wonderhowto.com/forum/creating-python-web-crawler-part-1-getting-sites-source-code-0175912/
Although you'd definitely have to tweak this program a little bit, it's designed to go through an entire website and archive all of it's pages. You could definitely use it to clone a website.
To run it from the terminal, run "python spider.py 'http://www.example.com 1"
What it will do is start from the website link you input as argument 1, and archive every single linked webpage with the keyword "example". It will call itself recursively 1 time, or however many times you put in argument 2, opening every link it sees from every page. It will also never open the same link twice.
If you run this on a website, it'll probably take a while (maybe an hour???) but you can definitely clone it.
Thank you, that was very helpful !
Share Your Thoughts