GitHub: https://github.com/josh0xA/darkdump
darkdump
About
Darkdump is a OSINT interface for carrying out deep web investgations written in python in which it allows users to enter a search query in which darkdump provides the ability to scrape .onion sites relating to that query to try to extract emails, metadata, keywords, images, social media etc. Darkdump retrieves sites via Ahmia.fi and scrapes those .onion addresses when connected via the tor network.
Installation
git clone https://github.com/josh0xA/darkdump
cd darkdump
python3 -m pip install -r requirements.txt
python3 darkdump.py --help
Tor Configuration
To use Darkdump effectively, you need to configure Tor to allow your script to control it via the Tor control port. Here's how to set up your torrc
file and verify that Tor is running:
Step 1: Install Tor
If Tor is not already installed on your system, you need to install it. Here's how you can install Tor on various operating systems:
Debian/Kali/Ubuntu: sudo apt install tor
MacOS: brew install tor
Step 2: Configure the Tor torrc File
Locate your torrc file. This file is usually found at /etc/tor/torrc
on Linux and sometimes Mac.
Add the following lines to your torrc to enable the control port and set a control port password:
ControlPort 9051
HashedControlPassword [YourHashedPasswordHere]
Replace [YourHashedPasswordHere]
with a hashed password which can be generated using the tor --hash-password
command: tor --hash-password "my_password"
Step 3: Start Tor Service
Linux: sudo systemctl start tor.service
MacOS: brew services start tor
Example Queries:
python3 darkdump.py -q "hacking" -a 10 --scrape --proxy
- search for 10 links and scrape each sitepython3 darkdump.py -q "free movies" -a 25
- don't scrape, just return 25 links for that query (does not require tor)python3 darkdump.py -q "marketplaces" -a 15 --scrape --proxy -i
- search for 10 links and scrape each site as well as find and store images.
Menu
_ _ _ __
__| | __ _ _ __| | ____| |_ _ _ __ ___ _ __ / /
/ _` |/ _` | '__| |/ / _` | | | | '_ ` _ \| '_ \ / /
| (_| | (_| | | | < (_| | |_| | | | | | | |_) | / /
\__,_|\__,_|_| |_|\_\__,_|\__,_|_| |_| |_| .__/ /_/ v3 by Josh Schiavone
|_|
usage: darkdump.py [-h] [-v] [-q QUERY] [-a AMOUNT] [-p] [-i] [-s]
Darkdump is an interface for scraping the deepweb through Ahmia. Made by yours truly.
options:
-h, --help show this help message and exit
-v, --version returns darkdump's version
-q QUERY, --query QUERY
the keyword or string you want to search on the deepweb
-a AMOUNT, --amount AMOUNT
the amount of results you want to retrieve
-p, --proxy use tor proxy for scraping
-i, --images scrape images and visual content from the site
-s, --scrape scrape the actual site for content and look for keywords
Visual
Ethical Notice
The developer of this program, Josh Schiavone, is not resposible for misuse of this data gathering tool. Do not use darkdump to navigate websites that take part in any activity that is identified as illegal under the laws and regulations of your government. May God bless you all.
License
MIT License
Copyright (c) Josh Schiavone