GitHub: https://github.com/yee-yore/DorkAgent
🤖 LLM-powered agent for automated Google Dorking in bug hunting & pentesting.
Usage
- Git clone
git clone https://github.com/yee-yore/DorkAgent.git
cd DorkAgent
- Run DorkAgent
python dorkagent.py
The program will:
- Auto-install all required packages on first run
- Prompt for API keys and save them to
.envfile automatically
Required API keys:
SERPER_API_KEY- Required (Get from https://serper.dev/)- At least one LLM API key:
OPENAI_API_KEY- For OpenAI models (GPT-4, GPT-5, etc.)ANTHROPIC_API_KEY- For Claude modelsGEMINI_API_KEY- For Gemini models
For more description https://medium.com/@yee-yore/llm-powered-agent-for-automated-google-dorking-dcb14d609dc2
Customize
- The number of google results (
serper_dev_tool.pyinsidesite-packages/crewai_tools/tools/serper_dev_tool/)
class SerperDevTool(BaseTool):
...
args_schema: Type[BaseModel] = SerperDevToolSchema
base_url: str = "https://google.serper.dev"
n_results: int = 10 # min: 10, max: 100
...
- Duration of google search results (
serper_dev_tool.py)
# https://serper.dev/playground
def _make_api_request(self, search_query: str, search_type: str) -> dict:
...
payload = json.dumps({"q": search_query, "num": self.n_results, "tbs": "qdr:m"}) # Past week: "qdr:w", Past month: "qdr:m"
...
- Google dorks (
task())
# Reference https://github.com/TakSec/google-dorks-bug-bounty
- Agents (
agents())
# https://docs.crewai.com/concepts/agents
Update Log
- 2025-12-11: DorkAgent v1.4 - Dynamic model selection via API (choose from available models at runtime), added Pydantic models for structured task output validation, improved code quality with CONVENTIONS.md compliance fixes
- 2025-09-29: Major code refactoring for improved maintainability (@wjdrud2532 PR #3). Split monolithic 800+ line file into modular architecture (config.py, agents.py, tasks.py, utils.py), fixed dependency auto-installation to work before module imports, added timestamps to report filenames (YYMMDD_HHMMSS format), enforced code conventions with standardized import order and removed all inline comments, maintained 100% backward compatibility with no breaking changes
- 2025-08-11: Enhanced security reports with specific information disclosure details, fixed critical notification race condition bug, improved attack vector analysis with actual parameters and payloads, added comprehensive development documentation (CLAUDE.md), restored requirements.txt format
- 2025-05-18: Modified README.md and banner, Added juicy google dorks, Medium article (https://medium.com/@yee-yore/llm-powered-agent-for-automated-google-dorking-dcb14d609dc2)
- 2025-04-17: Removed tasks(old).py, the version prior to prompt engineering; Deleted Google Dork for finding "Confidential" documents (most results were merely informative); Removed Google Dork targeting login panels; Added settings to help avoid LLM provider rate limits; Integrated Gemini Flash 2.0 (free to use and currently considered the best value LLM); Merged tasks.py and agents.py into dorkagent.py for simplified maintenance
- 2025-04-01: Added hybrid LLM option (GPT & Claude); Added dork
intitle:"IIS Windows Server"; Applied prompt engineering to tasks.py; Added default depth consideration for subdomain inputs; Addedrequirements.txtfor Windows/MacOS compatibility