MLLMs Struggle with Vertical Japanese Text: New Research Reveals Performance Gaps

Research#MLLMs🔬 Research|Analyzed: Jan 26, 2026 11:43
Published: Nov 19, 2025 03:04
1 min read
ArXiv

Analysis

This research highlights a critical challenge for Multimodal Large Language Models (MLLMs) in processing Japanese documents: the models' underperformance on vertically written text. The study demonstrates the need for specialized training data to improve MLLMs' ability to understand this common form of Japanese writing.
Reference / Citation
View Original
"Using these datasets, we demonstrate that the existing MLLMs perform worse on vertically written Japanese text than on horizontally written Japanese text."
A
ArXivNov 19, 2025 03:04
* Cited for critical analysis under Article 32.