CTIBench: Evaluating LLMs in Cyber Threat Intelligence with Nidhi Rastogi - #729

Research #llm 📝 Blog|Analyzed: Dec 29, 2025 06:06•

Published: Apr 30, 2025 07:21

•

1 min read

Analysis

This article from Practical AI discusses CTIBench, a benchmark for evaluating Large Language Models (LLMs) in Cyber Threat Intelligence (CTI). It features an interview with Nidhi Rastogi, an assistant professor at Rochester Institute of Technology. The discussion covers the evolution of AI in cybersecurity, the advantages and challenges of using LLMs in CTI, and the importance of techniques like Retrieval-Augmented Generation (RAG). The article highlights the process of building the benchmark, the tasks it covers, and key findings from benchmarking various LLMs. It also touches upon future research directions, including mitigation techniques, concept drift monitoring, and explainability improvements.

Key Takeaways

•CTIBench is a benchmark for evaluating LLMs in Cyber Threat Intelligence.
•RAG is crucial for keeping LLMs up-to-date with emerging threats.
•The research lab is focusing on mitigation techniques, concept drift monitoring, and explainability.

Reference / Citation

View Original

"Nidhi shares the importance of benchmarks in exposing model limitations and blind spots, the challenges of large-scale benchmarking, and the future directions of her AI4Sec Research Lab."

Practical AIApr 30, 2025 07:21

* Cited for critical analysis under Article 32.

Older

OpenAI's Approach to Building AI Agents: A Discussion with Josh Tobin

Newer

Generative Benchmarking with Kelly Hong - Episode Analysis

Related Analysis

Research

CTIBench: Evaluating LLMs in Cyber Threat Intelligence with Nidhi Rastogi - #729

Analysis

Key Takeaways

Related Analysis

Human AI Detection

Deep Learning Book Implementation Focus

Personalizing Gemini

📬 Get AI News Delivered

Browse by Category

Trending Topics

📬 Get AI News Delivered

Browse by Category

Trending Topics