Research#llm📝 BlogAnalyzed: Dec 29, 2025 06:06

CTIBench: Evaluating LLMs in Cyber Threat Intelligence with Nidhi Rastogi - #729

Published:Apr 30, 2025 07:21
1 min read
Practical AI

Analysis

This article from Practical AI discusses CTIBench, a benchmark for evaluating Large Language Models (LLMs) in Cyber Threat Intelligence (CTI). It features an interview with Nidhi Rastogi, an assistant professor at Rochester Institute of Technology. The discussion covers the evolution of AI in cybersecurity, the advantages and challenges of using LLMs in CTI, and the importance of techniques like Retrieval-Augmented Generation (RAG). The article highlights the process of building the benchmark, the tasks it covers, and key findings from benchmarking various LLMs. It also touches upon future research directions, including mitigation techniques, concept drift monitoring, and explainability improvements.

Reference

Nidhi shares the importance of benchmarks in exposing model limitations and blind spots, the challenges of large-scale benchmarking, and the future directions of her AI4Sec Research Lab.