Search:
Match:
2 results
Research#Agent🔬 ResearchAnalyzed: Jan 10, 2026 11:23

NL2Repo-Bench: Evaluating Long-Horizon Code Generation Agents

Published:Dec 14, 2025 15:12
1 min read
ArXiv

Analysis

This ArXiv paper introduces NL2Repo-Bench, a new benchmark for evaluating coding agents. The benchmark focuses on assessing the performance of agents in generating complete and complex software repositories.
Reference

NL2Repo-Bench aims to evaluate coding agents.

Research#LLM🔬 ResearchAnalyzed: Jan 10, 2026 14:26

OmniStruct: Advancing Text-to-Structure Generation

Published:Nov 23, 2025 08:18
1 min read
ArXiv

Analysis

The OmniStruct paper presents a novel approach to generate structured data from text across various schemas, suggesting improvements in the flexibility and applicability of text-to-structure models. The research, available on ArXiv, highlights the ongoing advancements in automating data extraction and knowledge representation.
Reference

The research is available on ArXiv.