distributed-embeddings is a library for building large embedding based models in Tensorflow 2.
☆47Oct 17, 2023Updated 2 years ago
Alternatives and similar repositories for distributed-embeddings
Users that are interested in distributed-embeddings are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆26Aug 15, 2022Updated 3 years ago
- Repository to go along with the paper "Plumber: Diagnosing and Removing Performance Bottlenecks in Machine Learning Data Pipelines"☆10Mar 31, 2022Updated 3 years ago
- Merlin Systems provides tools for combining recommendation models with other elements of production recommender systems (like feature sto…☆94Jun 11, 2024Updated last year
- CUDA Embedding Lookup Kernel Library☆43Feb 9, 2026Updated last month
- A research group at UCSD CSE focused on Advanced Data Analytics: data management and systems for ML/AI and data science.☆11Feb 27, 2026Updated 3 weeks ago
- ☆57Oct 17, 2023Updated 2 years ago
- HugeCTR is a high efficiency GPU framework designed for Click-Through-Rate (CTR) estimating training☆1,053Mar 12, 2026Updated last week
- ☆23Jun 5, 2019Updated 6 years ago
- A high-performance framework for training wide-and-deep recommender systems on heterogeneous cluster☆160Apr 20, 2024Updated last year
- WholeGraph - large scale Graph Neural Networks☆106Nov 25, 2024Updated last year
- Source code for QuickSel (SIGMOD 2020)☆19Jul 12, 2025Updated 8 months ago
- This repository contains the results and code for the MLPerf™ Inference v0.7 benchmark.☆17Jul 24, 2025Updated 7 months ago
- A platform to evaluate techniques used in multicore graph processing.☆37Oct 25, 2018Updated 7 years ago
- CSAPP3e Course Labs Files☆10Oct 9, 2020Updated 5 years ago
- Dev repo for power measurement for the MLPerf™ benchmarks☆28Sep 11, 2025Updated 6 months ago
- Examples for Recommenders - easy to train and deploy on accelerated infrastructure.☆231Updated this week
- ☆10Jul 16, 2016Updated 9 years ago
- Cluster simulator with far memory☆12Apr 28, 2020Updated 5 years ago
- The merlin dataloader lets you rapidly load tabular data for training deep leaning models with TensorFlow, PyTorch or JAX☆423Apr 16, 2024Updated last year
- Simple Bert Implementation (TensorFlow 2.0)☆13Aug 9, 2019Updated 6 years ago
- MIDict (Multi-Index Dict) can be indexed by any "keys" or "values", suitable as a bidirectional/inverse dict or a multi-key/multi-value d…☆14May 19, 2016Updated 9 years ago
- Final Project for Parallel Computing at CMU (15-618/15-418)☆10May 13, 2016Updated 9 years ago
- tiny pytorch implementation of neural style transfer.☆13Jul 6, 2017Updated 8 years ago
- ☆12Sep 11, 2020Updated 5 years ago
- OpenEmbedding is an open source framework for Tensorflow distributed training acceleration.☆33Apr 13, 2023Updated 2 years ago
- ☆17Updated this week
- HCC Sample Applications☆13Jan 3, 2017Updated 9 years ago
- ☆22Apr 27, 2023Updated 2 years ago
- Python implementation of DPP sampling☆14Nov 4, 2024Updated last year
- Online Variance Reduction☆15May 9, 2019Updated 6 years ago
- Python tools for NVIDIA Profiler☆21Dec 21, 2017Updated 8 years ago
- Modified version of PyTorch able to work with changes to GPGPU-Sim☆57Nov 18, 2022Updated 3 years ago
- ☆14Mar 10, 2024Updated 2 years ago
- ☆16Jun 25, 2024Updated last year
- GLake: optimizing GPU memory management and IO transmission.☆498Mar 24, 2025Updated last year
- Materials for ECS 201A☆11Oct 23, 2019Updated 6 years ago
- Display reminders due today, add new reminders, and show today's events.☆16Feb 25, 2026Updated 3 weeks ago
- ☆14Sep 27, 2021Updated 4 years ago
- R2Plus1D MXNet Implementation☆11Jul 11, 2018Updated 7 years ago