Accelerating Recommender model training by leveraging popular choices -- VLDB 2022
☆31Sep 15, 2024Updated last year
Alternatives and similar repositories for Accelerating-RecSys-Training
Users that are interested in Accelerating-RecSys-Training are comparing it to the libraries listed below
Sorting:
- Set of datasets for the deep learning recommendation model (DLRM).☆49Dec 21, 2022Updated 3 years ago
- Sharing the codebase and steps for artifact evaluation for ISCA 2023 paper☆15Feb 20, 2024Updated 2 years ago
- Enabling pure data parallel training of DLRM via caching and prefetching☆17Oct 29, 2021Updated 4 years ago
- [AAAI'23] FinalMLP: An Enhanced Two-Stream MLP Model for CTR Prediction https://arxiv.org/abs/2304.00902☆10Apr 9, 2023Updated 2 years ago
- ☆71Jan 23, 2021Updated 5 years ago
- Graph accelerator on FPGAs and ASICs☆11Aug 16, 2018Updated 7 years ago
- MultiPIM: A Detailed and Configurable Multi-Stack Processing-In-Memory Simulator☆56Jun 12, 2021Updated 4 years ago
- ☆18May 8, 2021Updated 4 years ago
- OpenEmbedding is an open source framework for Tensorflow distributed training acceleration.☆33Apr 13, 2023Updated 2 years ago
- ☆22Jun 4, 2023Updated 2 years ago
- ☆15Apr 26, 2022Updated 3 years ago
- ☆19Jun 1, 2023Updated 2 years ago
- Artifact of ASPLOS'23 paper entitled: GRACE: A Scalable Graph-Based Approach to Accelerating Recommendation Model Inference☆19Mar 5, 2023Updated 2 years ago
- CasHMC: A Cycle-accurate Simulator for Hybrid Memory Cube☆23Aug 10, 2018Updated 7 years ago
- [KDD 2022] AutoShard: Automated Embedding Table Sharding for Recommender Systems☆22Mar 24, 2023Updated 2 years ago
- SoCC'20 and TPDS'21: Scaling GNN Training on Large Graphs via Computation-aware Caching and Partitioning.☆51May 23, 2023Updated 2 years ago
- ☆26Aug 19, 2022Updated 3 years ago
- [NeurIPS 2022] DreamShard: Generalizable Embedding Table Placement for Recommender Systems☆29Mar 24, 2023Updated 2 years ago
- ☆31May 31, 2023Updated 2 years ago
- CNN accelerator☆29Jun 11, 2017Updated 8 years ago
- ☆31Feb 22, 2024Updated 2 years ago
- ☆24Apr 20, 2024Updated last year
- Artifact for PPoPP22 QGTC: Accelerating Quantized GNN via GPU Tensor Core.☆30Feb 12, 2022Updated 4 years ago
- SparseP is the first open-source Sparse Matrix Vector Multiplication (SpMV) software package for real-world Processing-In-Memory (PIM) ar…☆77Jun 29, 2022Updated 3 years ago
- A tracing tool to analyze the I/O behavior of a program.☆12Sep 25, 2019Updated 6 years ago
- Trading algorithm for Bitcoins in USD on quantconnect.com☆13Jan 12, 2018Updated 8 years ago
- A POC of Google's Wide & Deep Learning models deployed on Google Cloud ML Engine for Kaggle's Outbrain Click Competition☆36Jun 19, 2018Updated 7 years ago
- ML Input Data Processing as a Service. This repository contains the source code for Cachew (built on top of TensorFlow).☆40Sep 10, 2024Updated last year
- http://vlsiarch.eecs.harvard.edu/research/recommendation/☆134Sep 15, 2022Updated 3 years ago
- ☆79Mar 7, 2022Updated 3 years ago
- TLB Benchmarks☆35Sep 11, 2017Updated 8 years ago
- SHADE: Enable Fundamental Cacheability for Distributed Deep Learning Training☆36Mar 1, 2023Updated 2 years ago
- PUMA Compiler☆30Oct 13, 2025Updated 4 months ago
- ☆11Dec 10, 2015Updated 10 years ago
- This repository contains python code to create, backtest and automate intraday-trading algorithms in financial markets using Machine Lear…☆10Sep 30, 2021Updated 4 years ago
- A simple cycle-accurate DaDianNao simulator☆13Mar 27, 2019Updated 6 years ago
- Accepted at WWW 25 Industrial Track (oral)☆18Jun 6, 2025Updated 8 months ago
- ☆10Jan 11, 2024Updated 2 years ago
- This code is a version of implement of the essay named Deep Inception Networks: A General End-to-End Framework for Multi-asset Quantitati…☆12Mar 15, 2024Updated last year