☆74Sep 15, 2025Updated 5 months ago
Alternatives and similar repositories for TrEnv-X
Users that are interested in TrEnv-X are comparing it to the libraries listed below
Sorting:
- An Attention Superoptimizer☆22Jan 20, 2025Updated last year
- ☆28Updated this week
- hot page accounting and migration☆26Apr 23, 2019Updated 6 years ago
- ☆28Aug 14, 2024Updated last year
- An experimental communicating attention kernel based on DeepEP.☆35Jul 29, 2025Updated 7 months ago
- https://bbuf.github.io/gpu-glossary-zh/☆26Nov 7, 2025Updated 3 months ago
- ☆28Jun 22, 2025Updated 8 months ago
- Might be a graph storage engine. (WIP)☆13May 14, 2023Updated 2 years ago
- [ICLR 2026] Official Implementation of "FeatureBench: Benchmarking Agentic Coding for Complex Feature Development"☆25Updated this week
- ☆53Feb 24, 2026Updated last week
- Eurosys22' - Rolis: a software approach to efficiently replicating multi-core transactions☆17Feb 28, 2024Updated 2 years ago
- [OSDI'24] Serving LLM-based Applications Efficiently with Semantic Variable☆210Sep 21, 2024Updated last year
- 🔥 LLM-powered GPU kernel synthesis: Train models to convert PyTorch ops into optimized Triton kernels via SFT+RL. Multi-turn compilation…☆127Nov 10, 2025Updated 3 months ago
- ☆18Aug 5, 2025Updated 7 months ago
- WaferLLM: Large Language Model Inference at Wafer Scale☆90Jan 7, 2026Updated last month
- Efficient GPU communication over multiple NICs.☆26Nov 20, 2025Updated 3 months ago
- A Streaming-Native Serving Engine for TTS/STS Models☆56Feb 22, 2026Updated last week
- A Filesystem Semi-Microkernel.☆46Oct 24, 2023Updated 2 years ago
- PerFlow-AI is a programmable performance analysis, modeling, prediction tool for AI system.☆29Feb 3, 2026Updated last month
- Compiler for Dynamic Neural Networks☆45Nov 13, 2023Updated 2 years ago
- Virtual Memory Abstraction for Serverless Architectures☆49Mar 18, 2022Updated 3 years ago
- ☆20Nov 18, 2023Updated 2 years ago
- ☆51Mar 13, 2024Updated last year
- [OSDI 2024] Motor: Enabling Multi-Versioning for Distributed Transactions on Disaggregated Memory☆50Mar 3, 2024Updated 2 years ago
- ☆34Jun 22, 2024Updated last year
- Automatic resource configuration for serverless workflows.☆21Mar 24, 2024Updated last year
- ☆44Updated this week
- DLSlime: Flexible & Efficient Heterogeneous Transfer Toolkit☆92Jan 26, 2026Updated last month
- An experimental parallel training platform☆56Mar 25, 2024Updated last year
- ☆57May 14, 2024Updated last year
- Supplemental materials for The ASPLOS 2025 / EuroSys 2025 Contest on Intra-Operator Parallelism for Distributed Deep Learning☆25May 12, 2025Updated 9 months ago
- 💻 SETA: Scaling Environments for Terminal Agents☆67Feb 16, 2026Updated 2 weeks ago
- NVIDIA Inference Xfer Library (NIXL)☆898Updated this week
- [HotStorage'24 Best Paper] Can Modern LLMs Tune and Configure LSM-based Key-Value Stores?☆27Nov 27, 2024Updated last year
- A benchmark suite for evaluating FaaS scheduler.☆23Nov 5, 2022Updated 3 years ago
- APRIL: Active Partial Rollouts in Reinforcement Learning to Tame Long-tail Generation. A system-level optimization for scalable LLM tra…☆51Oct 11, 2025Updated 4 months ago
- Open-source implementation for "Helix: Serving Large Language Models over Heterogeneous GPUs and Network via Max-Flow"☆77Oct 15, 2025Updated 4 months ago
- Open ABI and FFI for Machine Learning Systems☆355Updated this week
- ☆28Sep 17, 2024Updated last year