☆71Feb 10, 2025Updated last year
Alternatives and similar repositories for libfabric-efa-demo
Users that are interested in libfabric-efa-demo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆13May 30, 2025Updated 11 months ago
- ☆10Feb 17, 2026Updated 2 months ago
- these are custom recipes of nvidia nsight system post collection analysis.☆16Nov 7, 2025Updated 6 months ago
- This is a plugin which lets EC2 developers use libfabric as network provider while running NCCL applications.☆211Updated this week
- ☆16Apr 7, 2026Updated 3 weeks ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Managed collective communication service☆24Sep 2, 2024Updated last year
- ☆80Jan 5, 2025Updated last year
- Open Fabric Interfaces☆785Updated this week
- Debug print operator for cudagraph debugging☆15Aug 2, 2024Updated last year
- Perplexity GPU Kernels☆570Nov 7, 2025Updated 6 months ago
- Inference Llama 2 with a model compiled to native code by TorchInductor☆14Feb 8, 2024Updated 2 years ago
- NVIDIA Inference Xfer Library (NIXL)☆1,011Updated this week
- Lustre Repository with MS patches☆17Updated this week
- Build and run container environment for LFRic☆10Jan 8, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Kubernetes CSI Driver for serving OCI model artifacts☆25Apr 29, 2026Updated last week
- pytorch ucc plugin☆23Jul 8, 2021Updated 4 years ago
- FlashSampling: Fast and Memory-Efficient Exact Sampling (https://huggingface.co/papers/2603.15854)☆70Apr 25, 2026Updated last week
- Perplexity open source garden for inference technology☆404Dec 25, 2025Updated 4 months ago
- Openfold inference architecture for Amazon EKS☆11Oct 1, 2024Updated last year
- TACCL: Guiding Collective Algorithm Synthesis using Communication Sketches☆80Jul 25, 2023Updated 2 years ago
- Lustre diagnostic tools for running Lustre in Azure☆10Apr 17, 2024Updated 2 years ago
- ☆165Dec 27, 2024Updated last year
- Japanese Entity Linker.☆12Jul 25, 2021Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- An application of CNN for crack detection using Caffe☆11Aug 11, 2020Updated 5 years ago
- Cray Lustre is HPE's curated Lustre distro for HPE ClusterStor, Cray EX, and other HPE/Cray clients☆18Updated this week
- Implementation of M4 in Python☆10Dec 4, 2022Updated 3 years ago
- NCCL Profiling Kit☆153Jul 1, 2024Updated last year
- A low-latency & high-throughput serving engine for LLMs☆496Jan 8, 2026Updated 3 months ago
- Reduction Server in Rust☆14Apr 9, 2024Updated 2 years ago
- Chimera: bidirectional pipeline parallelism for efficiently training large-scale models.☆71Mar 20, 2025Updated last year
- Use metric learning to cluster images and run similar image queries☆17Jun 13, 2017Updated 8 years ago
- Convert Travis.yml to GitHub Actions workflows.☆12Aug 20, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Linux tree for ntrdma driver development.☆11Jun 29, 2017Updated 8 years ago
- Expert Specialization MoE Solution based on CUTLASS☆26Apr 14, 2026Updated 3 weeks ago
- Academic Operating System from Scratch☆14May 2, 2016Updated 10 years ago
- torchcomms: a modern PyTorch communications API☆359Updated this week
- Framework to reduce autotune overhead to zero for well known deployments.☆99Sep 19, 2025Updated 7 months ago
- ☆16Apr 23, 2026Updated last week
- CS61C Spring 2015 HW1 Starter Code☆10Feb 4, 2015Updated 11 years ago