Presentation

· Contributors · Organizations · Search Program · Flagged · Happening Now

Towards Redundancy-Free Recommendation Model Training via Reusable-aware Near-Memory Processing

SessionWhere Processing-in-Memory Fits Best in the System

DescriptionThe memory-intensive embedding layer in the recommendation model continues to be the performance bottleneck. While prior works have attempted to improve the embedding layer performance by exploiting the data locality to cache the frequently accessed embedding vectors and their partial sums. However, these solutions rely on the static cache, which is invalidated in the embedding training scenario of the embedding vectors being updated frequently. To this end, this paper proposes ReFree, a redundancy-free near-memory processing (NMP) solution for embedding training. Specifically, ReFree identifies the reusable data in real-time for both forward and backpropagation of the embedding layer training, and leverages a lightweight NMP architecture to enable redundancy-free near-memory acceleration of the entire embedding training process. Evaluation results on real-world datasets show that ReFree outperforms the state-of-the-art solutions by 10.9x and reduces 5.3x energy consumption on average.

Authors

Haifeng Liu

Huazhong University of Science and Technology

Long Zheng

Huazhong University of Science and Technology

Yu Huang

Huazhong University of Science and Technology

Haoyan Huang

Huazhong University of Science and Technology

Xiaofei Liao

Huazhong University of Science and Technology

Jin Hai