☆79Sep 15, 2025Updated 7 months ago
Alternatives and similar repositories for TrEnv-X
Users that are interested in TrEnv-X are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An Attention Superoptimizer☆22Jan 20, 2025Updated last year
- hot page accounting and migration☆26Apr 23, 2019Updated 6 years ago
- An experimental communicating attention kernel based on DeepEP.☆35Jul 29, 2025Updated 8 months ago
- [NSDI25] AutoCCL: Automated Collective Communication Tuning for Accelerating Distributed and Parallel DNN Training☆31May 2, 2025Updated 11 months ago
- DLSlime: Flexible & Efficient Heterogeneous Transfer Toolkit☆95Apr 6, 2026Updated last week
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆28Aug 14, 2024Updated last year
- High Performance KV Cache Store for LLM☆53Apr 6, 2026Updated last week
- Eurosys22' - Rolis: a software approach to efficiently replicating multi-core transactions☆17Feb 28, 2024Updated 2 years ago
- ☆57Feb 24, 2026Updated last month
- https://bbuf.github.io/gpu-glossary-zh/☆26Nov 7, 2025Updated 5 months ago
- Efficient GPU communication over multiple NICs.☆27Nov 20, 2025Updated 4 months ago
- BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions☆25Aug 8, 2024Updated last year
- WaferLLM: Large Language Model Inference at Wafer Scale☆97Apr 4, 2026Updated last week
- [OSDI'24] Serving LLM-based Applications Efficiently with Semantic Variable☆213Sep 21, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 🔥 LLM-powered GPU kernel synthesis: Train models to convert PyTorch ops into optimized Triton kernels via SFT+RL. Multi-turn compilation…☆134Nov 10, 2025Updated 5 months ago
- ☆28Jun 22, 2025Updated 9 months ago
- ☆42Mar 23, 2026Updated 3 weeks ago
- [ICLR 2026] Official Implementation of "FeatureBench: Benchmarking Agentic Coding for Complex Feature Development"☆45Mar 31, 2026Updated 2 weeks ago
- Automatic resource configuration for serverless workflows.☆21Mar 24, 2024Updated 2 years ago
- ☆23Oct 10, 2025Updated 6 months ago
- Verlog: A Multi-turn RL framework for LLM agents☆72Mar 27, 2026Updated 2 weeks ago
- Might be a graph storage engine. (WIP)☆13May 14, 2023Updated 2 years ago
- A Filesystem Semi-Microkernel.☆46Oct 24, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- PerFlow-AI is a programmable performance analysis, modeling, prediction tool for AI system.☆32Apr 1, 2026Updated 2 weeks ago
- TiledLower is a Dataflow Analysis and Codegen Framework written in Rust.☆13Nov 23, 2024Updated last year
- ☆28Jul 29, 2025Updated 8 months ago
- TileGraph is an experimental DNN compiler that utilizes static code generation and kernel fusion techniques.☆11Sep 18, 2024Updated last year
- Virtual Memory Abstraction for Serverless Architectures☆49Mar 18, 2022Updated 4 years ago
- NVIDIA Inference Xfer Library (NIXL)☆970Updated this week
- ☆57May 14, 2024Updated last year
- ☆13May 22, 2023Updated 2 years ago
- Open-source implementation for "Helix: Serving Large Language Models over Heterogeneous GPUs and Network via Max-Flow"☆81Oct 15, 2025Updated 6 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Compiler for Dynamic Neural Networks☆45Nov 13, 2023Updated 2 years ago
- The source of LMSYS website and blogs☆83Updated this week
- A Practitioner's Guide to M(eow)ti Turn Agentic ReinfOrcement learning☆81Jan 16, 2026Updated 2 months ago
- ☆12Mar 26, 2024Updated 2 years ago
- REEF is a GPU-accelerated DNN inference serving system that enables instant kernel preemption and biased concurrent execution in GPU sche…☆107Dec 24, 2022Updated 3 years ago
- High performance Transformer implementation in C++.☆154Jan 18, 2025Updated last year
- A Streaming-Native Serving Engine for TTS/STS Models☆66Updated this week