The dataset and baseline code for ASC23 LLM inference optimization challenge.
☆34Dec 20, 2023Updated 2 years ago
Alternatives and similar repositories for ASC24-LLM-inference-optimization
Users that are interested in ASC24-LLM-inference-optimization are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- OpenCAEPoro for ASC 2024☆39Dec 21, 2023Updated 2 years ago
- Ongoing research training transformer models at scale☆18Apr 3, 2026Updated last week
- SEU-HPC | 东南大学超算平台☆26Jan 19, 2025Updated last year
- ✂️ Trim sequencing adapters from NGS data automatically☆14Sep 5, 2025Updated 7 months ago
- 此仓库是我们小组在《计算机游戏开发》课程(深圳大学)的大作业,是一个模仿《slay the spire》的卡牌游戏☆10Jun 28, 2019Updated 6 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- AgenTracer: A Lightweight Failure Attributor for Agentic Systems☆86Nov 12, 2025Updated 5 months ago
- 🧪 Ultrafast bisulfite☆38Apr 23, 2024Updated last year
- Use the tokenizer in parallel to achieve superior acceleration☆20Mar 21, 2024Updated 2 years ago
- ☆14Oct 17, 2024Updated last year
- R1-Code-Interpreter: Training LLMs to Reason with Code via Supervised and Reinforcement Learning☆36Feb 9, 2026Updated 2 months ago
- Injecting Adrenaline into LLM Serving: Boosting Resource Utilization and Throughput via Attention Disaggregation☆40Mar 30, 2026Updated 2 weeks ago
- ☆57Feb 24, 2026Updated last month
- ☆14Aug 14, 2024Updated last year
- System-on-chip design for NOP in NSCSCC 2023.☆13Aug 21, 2023Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Offcial Repo of Paper "Eliminating Position Bias of Language Models: A Mechanistic Approach""☆22Jun 13, 2025Updated 10 months ago
- [ICLR-2026] Official Implementation of our paper "THOR: Tool-Integrated Hierarchical Optimization via RL for Mathematical Reasoning".☆32Feb 26, 2026Updated last month
- This is the repository of the paper: Roman Numeral Analysis with Graph Neural Networks☆13Nov 6, 2024Updated last year
- A set of learned index papers w/o notes☆17Apr 25, 2024Updated last year
- 南昌大学超算队官方网站☆19Aug 9, 2024Updated last year
- 上海交通大学软件学院课程云操作系统设计与实践(SE3356)笔记☆17Sep 5, 2022Updated 3 years ago
- ☆13Feb 8, 2025Updated last year
- 存储天空盒图片。☆15Jul 13, 2021Updated 4 years ago
- ☆18Nov 19, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆16Jul 24, 2023Updated 2 years ago
- 南方科技大学数字逻辑课程资料: notes, assignments and project☆15Mar 13, 2023Updated 3 years ago
- ☆18Jul 11, 2024Updated last year
- ☆20Nov 19, 2024Updated last year
- Code for paper: Long cOntext aliGnment via efficient preference Optimization☆24Oct 10, 2025Updated 6 months ago
- A small GUI Library for Minecraft☆15Oct 19, 2014Updated 11 years ago
- Official Implementation of "DeCoRe: Decoding by Contrasting Retrieval Heads to Mitigate Hallucination"☆29Dec 18, 2024Updated last year
- libsmctrl论文的复现,添加了python端接口,可以在python端灵活调用接口来分配计算资源☆12May 21, 2024Updated last year
- DiffSinger Editor developed by OpenVPI☆36Oct 21, 2025Updated 5 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Tile-based language built for AI computation across all scales☆141Mar 27, 2026Updated 2 weeks ago
- Storage Performance Development Kit☆11Apr 6, 2026Updated last week
- ☆22Oct 10, 2025Updated 6 months ago
- Implementation from scratch in C of the Multi-head latent attention used in the Deepseek-v3 technical paper.☆18Jan 15, 2025Updated last year
- ☆31Dec 31, 2025Updated 3 months ago
- Nebula: Deep Neural Network Benchmarks in C++☆13Jan 2, 2025Updated last year
- ☆23Dec 17, 2024Updated last year