triton-inference-server/hugectr_backend

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/triton-inference-server/hugectr_backend)

triton-inference-server / hugectr_backend

☆57

Alternatives and similar repositories for hugectr_backend

Users that are interested in hugectr_backend are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

NVIDIA-Merlin / HugeCTR
View on GitHub
HugeCTR is a high efficiency GPU framework designed for Click-Through-Rate (CTR) estimating training
☆1,067Mar 12, 2026Updated 4 months ago
mlcommons / inference_results_v0.7
View on GitHub
This repository contains the results and code for the MLPerf™ Inference v0.7 benchmark.
☆17Jul 24, 2025Updated last year
miziha-zp / BiuG-XMRec-WSDMCup22
View on GitHub
☆18Feb 22, 2022Updated 4 years ago
nvidia-riva / sample-apps
View on GitHub
Sample applications using NVIDIA Riva Skills
☆30Nov 17, 2025Updated 8 months ago
triton-inference-server / fil_backend
View on GitHub
FIL backend for the Triton Inference Server
☆94Jul 17, 2026Updated last week
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
DeepRec-AI / DeepRec
View on GitHub
DeepRec is a high-performance recommendation deep learning framework based on TensorFlow. It is hosted in incubation in LF AI & Data Foun…
☆1,197Jan 21, 2025Updated last year
ap-hynninen / cutt
View on GitHub
CUDA Tensor Transpose (cuTT) library
☆55Aug 10, 2017Updated 8 years ago
hechaoli / libbpf_map_in_map
View on GitHub
A simple example of map_in_map usage in libbpf
☆10Mar 18, 2020Updated 6 years ago
NVIDIA-Merlin / HierarchicalKV
View on GitHub
HierarchicalKV is a part of NVIDIA Merlin and provides hierarchical key-value storage to meet RecSys requirements. The key capability of…
☆208May 22, 2026Updated 2 months ago
meton-robean / PaperNotes
View on GitHub
记录阅读各类paper的想法笔记（关注体系结构，机器学习系统，深度学习，计算机视觉）
☆25Oct 25, 2019Updated 6 years ago
MatanHamilis / one_stencil
View on GitHub
Multiple 1-stencil implementations using nvidia cuda.
☆12Dec 2, 2017Updated 8 years ago
tensorflow / recommenders-addons
View on GitHub
Additional utils and helpers to extend TensorFlow when build recommendation systems, contributed and maintained by SIG Recommenders.
☆635Sep 4, 2025Updated 10 months ago
lukedeo / torch-serving
View on GitHub
Simple HTTP serving for PyTorch 🚀
☆10Oct 15, 2020Updated 5 years ago
general-labs / Image-AI
View on GitHub
General purpose AI experiments. Use face detection to identify politics figure, automatic face priority crop, image caption and labelling…
☆10Nov 22, 2022Updated 3 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
NVIDIA-Merlin / NVTabular
View on GitHub
NVTabular is a feature engineering and preprocessing library for tabular data designed to quickly and easily manipulate terabyte scale da…
☆1,149May 22, 2026Updated 2 months ago
iqiyi / xgboost-serving
View on GitHub
A flexible, high-performance serving system for machine learning models
☆145Nov 24, 2021Updated 4 years ago
NVIDIA / recsys-examples
View on GitHub
Examples for Recommenders - easy to train and deploy on accelerated infrastructure.
☆293Updated this week
NVIDIA-Merlin / Merlin
View on GitHub
NVIDIA Merlin is an open source library providing end-to-end GPU-accelerated recommender systems, from feature engineering and preprocess…
☆900Updated this week
codejitsu / session4rec
View on GitHub
GRu4Rec in TensorFlow
☆14Apr 11, 2018Updated 8 years ago
eniac / TELEPORT
View on GitHub
Optimizing data-intensive systems in disaggregated data centers
☆13Jun 13, 2022Updated 4 years ago
lodemo / CATANA
View on GitHub
Video face recognition and content creator collaboration detection framework.
☆15Mar 24, 2023Updated 3 years ago
SJTU-IPADS / fgnn-artifacts
View on GitHub
FGNN's artifact evaluation (EuroSys 2022)
☆18Apr 25, 2022Updated 4 years ago
triton-inference-server / backend
View on GitHub
Common source, scripts and utilities for creating Triton backends.
☆377Updated this week
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
jplu / faiss-grpc-server
View on GitHub
gRPC server over a FAISS index
☆19Aug 19, 2021Updated 4 years ago
jackwener / IT-common-disease
View on GitHub
程序员职业病
☆12Apr 26, 2020Updated 6 years ago
Jhy1993 / HGSRec
View on GitHub
☆12Mar 14, 2021Updated 5 years ago
ch-xu / RUM
View on GitHub
☆12Sep 3, 2018Updated 7 years ago
triton-inference-server / pytorch_backend
View on GitHub
The Triton backend for the PyTorch TorchScript models.
☆178Updated this week
tensorflow / networking
View on GitHub
Enhanced networking support for TensorFlow. Maintained by SIG-networking.
☆99Nov 19, 2021Updated 4 years ago
triton-inference-server / python_backend
View on GitHub
Triton backend that enables pre-process, post-processing and other logic to be implemented in Python.
☆678Updated this week
nvidia-china-sae / WholeGraph
View on GitHub
☆11Mar 4, 2021Updated 5 years ago
jackodirks / cma_malloc
View on GitHub
A Linux kernel module which can be used to allocate memory from the Contiguous Memory Allocator
☆16Oct 31, 2017Updated 8 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
crazyboycjr / nethint
View on GitHub
The prototype for NSDI paper "NetHint: White-Box Networking for Multi-Tenant Data Centers"
☆26Feb 2, 2024Updated 2 years ago
chensi01 / DGRec
View on GitHub
implementation of Session-Based Social Recommendation via Dynamic Graph Attention Networks
☆10Apr 17, 2020Updated 6 years ago
xxxliu95 / RA_FA_Cardiac
View on GitHub
☆10Sep 18, 2020Updated 5 years ago
SJTU-IPADS / gnnlab
View on GitHub
A Factored System for Sample-based GNN Training over GPUs
☆46Jul 26, 2023Updated 2 years ago
OpenMPDK / DSS
View on GitHub
Disaggregated Storage Solution
☆42Jan 15, 2025Updated last year
hongyuanmei / nce-mpp
View on GitHub
Source code for Noise-Contrastive Estimation for Multivariate Point Processes (NeurIPS 2020).
☆15Nov 3, 2020Updated 5 years ago
eunomia-bpf / nccl-eBPF
View on GitHub
☆20Jul 7, 2026Updated 2 weeks ago