Close

Presentation

FQP: A Fibonacci Quantization Processor with Multiplication-Free Computing and Topological-Order Routing
DescriptionNeural networks demand increasing computational power and memory access due to growing parameter sizes. A solution is low bit-width quantization, but conventional uniform quantization suffers from distribution mismatches, leading to accuracy loss. We introduce Fibonacci Quantization, closely aligning with neural network data distributions using Fibonacci numbers. Fibonacci Quantization Processor (FQP) features two multiplication-free computing units: the Dualistic-Transformation Adder for large numbers multiplication and the Bit-Exclusive Adder for small numbers multiplication. Additionally, Topological-Order Routing optimizes data mapping onto these units. FQP demonstrates either a 0.98% accuracy improvement or 2.17x higher energy efficiency for ResNet50 on ImageNet1k compared to uniform quantization.
Event Type
Research Manuscript
TimeThursday, June 2711:45am - 12:00pm PDT
Location3010, 3rd Floor
Topics
AI
Design
Keywords
AI/ML Architecture Design