Hardware Acceleration for Neural Networks: A Survey

Published:Dec 30, 2025 00:27
1 min read
ArXiv

Analysis

This survey paper provides a comprehensive overview of hardware acceleration techniques for deep learning, addressing the growing importance of efficient execution due to increasing model sizes and deployment diversity. It's valuable for researchers and practitioners seeking to understand the landscape of hardware accelerators, optimization strategies, and open challenges in the field.

Reference

The survey reviews the technology landscape for hardware acceleration of deep learning, spanning GPUs and tensor-core architectures; domain-specific accelerators (e.g., TPUs/NPUs); FPGA-based designs; ASIC inference engines; and emerging LLM-serving accelerators such as LPUs (language processing units), alongside in-/near-memory computing and neuromorphic/analog approaches.