mirror of
https://github.com/frankwxu/mobile-pii-discovery-agent.git
synced 2026-02-20 13:40:41 +00:00
7a41a766ffd1a115318856d60b45d345238d5cb8
LLM-Guided SQL Evidence Extraction
This project implements a lightweight LLM-assisted pipeline for discovering and extracting evidentiary artifacts from SQLite databases commonly found in mobile device extractions.
The system separates discovery and extraction to reduce search space, avoid hallucinated SQL, and preserve explainability.
Features
- LLM-guided SQL planning with deterministic execution
- Discovery to extraction workflow
- Fixed evidence types:
EMAIL,PHONE,USERNAME,PERSON_NAME - Safe SQLite execution with REGEXP support
- UNION / UNION ALL–aware column extraction
- Transparent, inspectable state machine
Setup
pip install langchain langgraph python-dotenv
Languages
Jupyter Notebook
85.1%
Python
12.4%
TeX
2.5%