BabbelPhish
BabbelPhish is an experimental pipeline for fine-tuning an LLM to translate natural language descriptions into Message Query Language (MQL) — the domain-specific language used at Sublime Security for email detection. The project demonstrated that small fine-tuned models can credibly bridge the gap between an analyst’s intent and the DSL needed to express it.
The repo includes:
- A fine-tuning workflow against the BabbelPhish dataset (see the companion HuggingFace dataset)
- A custom tokenizer trained on MQL and a Flask-based comparison webapp (GPT-2 / GPT-3 / GPT-4 / custom)
- Evaluation scripts computing
pass@kand BLEU against a held-out test set - Tokenizer-quality metrics (OOV rate, granularity, information loss, type ratio, reversibility)