CricBench: A Benchmark for LLMs in Cricket Analytics

Research Paper #Large Language Models, Cricket Analytics, Benchmarking, Multilingual NLP 🔬 Research|Analyzed: Jan 3, 2026 23:56•

Published: Dec 26, 2025 05:59

•

1 min read

•ArXiv

Analysis

This paper introduces CricBench, a specialized benchmark for evaluating Large Language Models (LLMs) in the domain of cricket analytics. It addresses the gap in LLM capabilities for handling domain-specific nuances, complex schema variations, and multilingual requirements in sports analytics. The benchmark's creation, including a 'Gold Standard' dataset and multilingual support (English and Hindi), is a key contribution. The evaluation of state-of-the-art models reveals that performance on general benchmarks doesn't translate to success in specialized domains, and code-mixed Hindi queries can perform as well or better than English, challenging assumptions about prompt language.

Key Takeaways

•CricBench is a new benchmark for evaluating LLMs in cricket analytics.
•The benchmark includes a 'Gold Standard' dataset and supports English and Hindi.
•Performance on general benchmarks doesn't guarantee success in specialized domains.
•Code-mixed Hindi queries can perform as well or better than English.

Reference / Citation

View Original

"The open-weights reasoning model DeepSeek R1 achieves state-of-the-art performance (50.6%), surpassing proprietary giants like Claude 3.7 Sonnet (47.7%) and GPT-4o (33.7%), it still exhibits a significant accuracy drop when moving from general benchmarks (BIRD) to CricBench."

ArXivDec 26, 2025 05:59

* Cited for critical analysis under Article 32.

Older

A Communication-Efficient Distributed Algorithm for Learning with Heterogeneous and Structurally Incomplete Multi-Site Data

Newer

Beyond Single Bugs: Benchmarking Large Language Models for Multi-Vulnerability Detection

Related Analysis

Research Paper

CricBench: A Benchmark for LLMs in Cricket Analytics

Analysis

Key Takeaways

Related Analysis

SpaceTimePilot: Generative Video Rendering with Space-Time Control

Randomness Generation in Quantum Chaotic Systems

GaMO: Geometry-aware Diffusion for Sparse-View 3D Reconstruction

📬 Get AI News Delivered

Browse by Category

Trending Topics

📬 Get AI News Delivered

Browse by Category

Trending Topics