☆57Oct 17, 2023Updated 2 years ago
Alternatives and similar repositories for hugectr_backend
Users that are interested in hugectr_backend are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- HugeCTR is a high efficiency GPU framework designed for Click-Through-Rate (CTR) estimating training☆1,058Mar 12, 2026Updated last month
- This repository contains the results and code for the MLPerf™ Inference v0.7 benchmark.☆17Jul 24, 2025Updated 8 months ago
- Transformer related optimization, including BERT, GPT☆39Feb 10, 2023Updated 3 years ago
- ☆24Jun 24, 2025Updated 9 months ago
- FIL backend for the Triton Inference Server☆90Apr 8, 2026Updated last week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆18Apr 8, 2022Updated 4 years ago
- DeepRec is a high-performance recommendation deep learning framework based on TensorFlow. It is hosted in incubation in LF AI & Data Foun…☆1,174Jan 21, 2025Updated last year
- CUDA Tensor Transpose (cuTT) library☆54Aug 10, 2017Updated 8 years ago
- A simple example of map_in_map usage in libbpf☆10Mar 18, 2020Updated 6 years ago
- An implementation of the MDCC (Multi-Data Center Commit) Protocol featuring Fast Paxos.☆18Mar 16, 2013Updated 13 years ago
- HierarchicalKV is a part of NVIDIA Merlin and provides hierarchical key-value storage to meet RecSys requirements. The key capability of…☆198Updated this week
- 记录阅读各类paper的想法笔记(关注体系结构,机器学习系统,深度学习,计算机视觉)☆25Oct 25, 2019Updated 6 years ago
- Multiple 1-stencil implementations using nvidia cuda.☆12Dec 2, 2017Updated 8 years ago
- Code for COMET: Cardinality Constrained Mixture of Experts with Trees and Local Search☆11Jun 21, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Code for "AtTGen: Attribute Tree Generation for Real-World Attribute Joint Extraction", ACL 2023☆13May 19, 2023Updated 2 years ago
- ☆24Sep 25, 2025Updated 6 months ago
- CAS Data Engine - Library to serve IOs on uZFS with synchronous replication, snapshots and clones☆19Dec 11, 2023Updated 2 years ago
- Optimizing data-intensive systems in disaggregated data centers☆13Jun 13, 2022Updated 3 years ago
- A flexible, high-performance serving system for machine learning models☆145Nov 24, 2021Updated 4 years ago
- Computes the Henry coefficient of methane in IRMOF-1☆10Oct 5, 2021Updated 4 years ago
- Common source, scripts and utilities for creating Triton backends.☆370Updated this week
- FGNN's artifact evaluation (EuroSys 2022)☆18Apr 25, 2022Updated 3 years ago
- gRPC server over a FAISS index☆19Aug 19, 2021Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆11Mar 4, 2021Updated 5 years ago
- benchmark for linux server☆13Nov 6, 2016Updated 9 years ago
- A PyTorch implementation of Determinantal Point Process Likelihoods for Sequential Recommendation☆12Dec 9, 2024Updated last year
- Triton backend that enables pre-process, post-processing and other logic to be implemented in Python.☆673Updated this week
- ☆12Sep 3, 2018Updated 7 years ago
- Enhanced networking support for TensorFlow. Maintained by SIG-networking.☆99Nov 19, 2021Updated 4 years ago
- Write pandoc markdown in OverLeaf☆12Sep 28, 2022Updated 3 years ago
- THIS REPOSITORY HAS MOVED TO github.com/nvidia/cub, WHICH IS AUTOMATICALLY MIRRORED HERE.☆11May 6, 2023Updated 2 years ago
- Driver for the LDBC SNB Interactive workload☆20Apr 6, 2026Updated last week
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- A Strong FuxiCTR Baseline for News CTR Challenge at RecSys 2024☆20Jul 13, 2024Updated last year
- My notebook.☆12Jun 11, 2019Updated 6 years ago
- implementation of Session-Based Social Recommendation via Dynamic Graph Attention Networks☆10Apr 17, 2020Updated 5 years ago
- A Factored System for Sample-based GNN Training over GPUs☆46Jul 26, 2023Updated 2 years ago
- Source code for Noise-Contrastive Estimation for Multivariate Point Processes (NeurIPS 2020).☆15Nov 3, 2020Updated 5 years ago
- A simple implement of TransE, the ML algorithm published in 2013☆12May 3, 2018Updated 7 years ago
- The prototype for NSDI paper "NetHint: White-Box Networking for Multi-Tenant Data Centers"☆26Feb 2, 2024Updated 2 years ago