☆65Jul 14, 2025Updated 9 months ago
Alternatives and similar repositories for AutoTriton
Users that are interested in AutoTriton are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- TritonBench: Benchmarking Large Language Model Capabilities for Generating Triton Operators☆125Jun 14, 2025Updated 10 months ago
- ☆12Jun 13, 2025Updated 10 months ago
- ☆27Jul 6, 2024Updated last year
- Generating Efficient AI-Centric Kernels☆92Apr 28, 2026Updated last week
- Code for "Adaptive Self-improvement LLM Agentic System for ML Library Development" (ICML 2025)☆16Jan 6, 2026Updated 3 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Residual vector quantization for KV cache compression in large language model☆12Oct 22, 2024Updated last year
- ☆11Jun 11, 2025Updated 10 months ago
- ☆20Jan 14, 2022Updated 4 years ago
- [DAC2024] Explainable Fuzzy Neural Network with Multi-Fidelity Reinforcement Learning for Micro-Architecture Design Space Exploration☆10Oct 31, 2024Updated last year
- [ICLR 2026] AgentSynth: Scalable Task Generation for Generalist Computer-Use Agents☆40Apr 17, 2026Updated 2 weeks ago
- Artifacts of EVT ASPLOS'24☆30Mar 6, 2024Updated 2 years ago
- Official implementation of TBA for async LLM post-training.☆30Nov 5, 2025Updated 5 months ago
- 上海交通大学软件学院本科计算机图形学课程代码仓库☆14Oct 3, 2025Updated 7 months ago
- GPU-enabled Hardware Fuzzer using Genetic Algorithm☆20Jul 12, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆18Mar 2, 2026Updated 2 months ago
- The original Shared Recurrent Memory Transformer implementation☆35Jul 11, 2025Updated 9 months ago
- 上海交通大学软件学院课程《应用系统体系架构》(SE3353)笔记☆12Feb 2, 2024Updated 2 years ago
- [ICLR 26] The official code repository for the paper "Mirage or Method? How Model–Task Alignment Induces Divergent RL Conclusions".☆17Feb 9, 2026Updated 2 months ago
- ☆28Aug 19, 2025Updated 8 months ago
- SJTU SE3331 CSE (a distributed file system with Raft and MapReduce)☆10Jan 14, 2024Updated 2 years ago
- Utility that parses stack sizes section from elf objects and displays the preallocated stack size of each function.☆14Jan 15, 2020Updated 6 years ago
- DuoGuard: A Two-Player RL-Driven Framework for Multilingual LLM Guardrails☆33Feb 26, 2025Updated last year
- The official implementation of ICLR 2025 paper "Polynomial Composition Activations: Unleashing the Dynamics of Large Language Models".☆18Apr 25, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆14Apr 25, 2025Updated last year
- The open-source materials for paper "Sparsing Law: Towards Large Language Models with Greater Activation Sparsity".☆30Nov 12, 2024Updated last year
- instruction-following benchmark for large reasoning models☆48Apr 19, 2026Updated 2 weeks ago
- Code accompanying the paper "Noise Contrastive Alignment of Language Models with Explicit Rewards" (NeurIPS 2024)☆58Nov 8, 2024Updated last year
- A torch compile backend for multi-targets☆49Apr 2, 2026Updated last month
- [ACL2023] Source code for Decouple knowledge from paramters for plug-and-play language modeling☆20Sep 18, 2023Updated 2 years ago
- Implementation of various equivariant models in JAX☆19Apr 12, 2024Updated 2 years ago
- The official implementation of "DOTS: Decoupling Operation and Topology in Differentiable Architecture Search"☆20Apr 19, 2021Updated 5 years ago
- ☆46Sep 27, 2025Updated 7 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- This repository presents the source code for the paper "MILLION: Mastering Long-Context LLM Inference Via Outlier-Immunized KV Product Qu…☆23Apr 2, 2025Updated last year
- [ICCV 2025] EA-ViT: Efficient Adaptation for Elastic Vision Transformer☆27Jul 28, 2025Updated 9 months ago
- ☆13May 7, 2024Updated last year
- ☆40Jul 15, 2025Updated 9 months ago
- Cavs: An Efficient Runtime System for Dynamic Neural Networks☆15Sep 18, 2020Updated 5 years ago
- [ICML'25] "Rethinking Addressing in Language Models via Contextualized Equivariant Positional Encoding" by Jiajun Zhu, Peihao Wang, Ruisi…☆15Jun 6, 2025Updated 10 months ago
- ICML 2025 Papers: Dive into cutting-edge research from the premier machine learning conference. Stay current with breakthroughs in deep l…☆37Oct 24, 2025Updated 6 months ago