HERO-Sign: GPU Acceleration for Post-Quantum Signatures
Analysis
Key Takeaways
- •Proposes HERO-Sign, a GPU-accelerated implementation of SPHINCS+.
- •Employs hierarchical tuning and compiler-time optimizations for performance.
- •Introduces a Tree Fusion strategy and adaptive compilation.
- •Achieves significant throughput improvements compared to existing GPU implementations.
- •Reduces kernel launch latency.
“HERO Sign achieves throughput improvements of 1.28-3.13, 1.28-2.92, and 1.24-2.60 under the SPHINCS+ 128f, 192f, and 256f parameter sets on RTX 4090.”