Search:
Match:
1 results

Analysis

This paper addresses the challenge of contextual biasing, particularly for named entities and hotwords, in Large Language Model (LLM)-based Automatic Speech Recognition (ASR). It proposes a two-stage framework that integrates hotword retrieval and LLM-ASR adaptation. The significance lies in improving ASR performance, especially in scenarios with large vocabularies and the need to recognize specific keywords (hotwords). The use of reinforcement learning (GRPO) for fine-tuning is also noteworthy.
Reference

The framework achieves substantial keyword error rate (KER) reductions while maintaining sentence accuracy on general ASR benchmarks.