Classifying Long Legal Documents with Chunking and Temporal

Paper #llm 🔬 Research|Analyzed: Jan 3, 2026 06:15•

Published: Dec 31, 2025 17:48

•

1 min read

Analysis

This paper addresses the practical challenges of classifying long legal documents using Transformer-based models. The core contribution is a method that uses short, randomly selected chunks of text to overcome computational limitations and improve efficiency. The deployment pipeline using Temporal is also a key aspect, highlighting the importance of robust and reliable processing for real-world applications. The reported F-score and processing time provide valuable benchmarks.

Key Takeaways

•Addresses the challenge of classifying long legal documents.
•Employs a chunking strategy with DeBERTa V3 and LSTM.
•Utilizes Temporal for a robust deployment pipeline.
•Achieves a weighted F-score of 0.898.
•Provides processing time benchmarks for CPU deployment.

Reference / Citation

"The best model had a weighted F-score of 0.898, while the pipeline running on CPU had a processing median time of 498 seconds per 100 files."

A

ArXivDec 31, 2025 17:48

* Cited for critical analysis under Article 32.

Implementing a ChatGPT-like LLM from scratch, step by step

ETH Zurich and EPFL to release a LLM developed on public infrastructure

Related Analysis

Instant 3D Scene Editing from Unposed Images

Jan 3, 2026 06:10

Coordinated Humanoid Manipulation with Choice Policies

Jan 3, 2026 06:10

LLM Forecasting for Future Prediction

Jan 3, 2026 06:10