Avatar of Bobby Filar

Bobby Filar

Sublime Security

Head of AI at Sublime Security. Research on agentic systems, LLM evaluation, adversarial ML, and AI governance.

  • About
  • Projects
  • Publications
  • Talks & News
  • CV

#benchmarks

Content tagged with "benchmarks"

MQL Benchmark
2026-05-15
#Evaluation #Benchmarks #LLMs

A 30,000-example open-source benchmark for evaluating natural-language → DSL generation, with a public model leaderboard.

View
© 2026 Bobby Filar.
Built with Academic Portfolio Astro