The dataset and baseline code for ASC23 LLM inference optimization challenge.
☆34Dec 20, 2023Updated 2 years ago
Alternatives and similar repositories for ASC24-LLM-inference-optimization
Users that are interested in ASC24-LLM-inference-optimization are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- OpenCAEPoro for ASC 2024☆39Dec 21, 2023Updated 2 years ago
- Ongoing research training transformer models at scale☆18Apr 9, 2026Updated last month
- SEU-HPC | 东南大学超算平台☆26Jan 19, 2025Updated last year
- ✂️ Trim sequencing adapters from NGS data automatically☆14Sep 5, 2025Updated 8 months ago
- 此仓库是我们小组在《计算机游戏开发》课程(深圳大学)的大作业,是一个模仿《slay the spire》的卡牌游戏☆10Jun 28, 2019Updated 6 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- AgenTracer: A Lightweight Failure Attributor for Agentic Systems☆89Nov 12, 2025Updated 5 months ago
- Use the tokenizer in parallel to achieve superior acceleration☆20Mar 21, 2024Updated 2 years ago
- 浙江工业大学,Internet编程(Javaweb课程设计),软件测试管理系统☆15Dec 2, 2020Updated 5 years ago
- 🧪 Ultrafast bisulfite☆38Apr 23, 2024Updated 2 years ago
- R1-Code-Interpreter: Training LLMs to Reason with Code via Supervised and Reinforcement Learning☆39Feb 9, 2026Updated 2 months ago
- ☆14Aug 14, 2024Updated last year
- Reproducing R1 for Code with Reliable Rewards☆12Apr 9, 2025Updated last year
- System-on-chip design for NOP in NSCSCC 2023.☆13Aug 21, 2023Updated 2 years ago
- ☆13Feb 1, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A Workbench for Autograding Retrieve/Generate Systems☆15Jun 30, 2025Updated 10 months ago
- [ICLR-2026] Official Implementation of our paper "THOR: Tool-Integrated Hierarchical Optimization via RL for Mathematical Reasoning".☆32Feb 26, 2026Updated 2 months ago
- Offcial Repo of Paper "Eliminating Position Bias of Language Models: A Mechanistic Approach""☆23Jun 13, 2025Updated 10 months ago
- A set of learned index papers w/o notes☆17Apr 25, 2024Updated 2 years ago
- 南昌大学超算队官方网站☆19Aug 9, 2024Updated last year
- 上海交通大学软件学院课程云操作系统设计与实践(SE3356)笔记☆17Sep 5, 2022Updated 3 years ago
- ☆17Jun 11, 2025Updated 10 months ago
- ☆13Feb 8, 2025Updated last year
- 存储天空盒图片。☆15Jul 13, 2021Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- a simple API to use CUPTI☆10Aug 19, 2025Updated 8 months ago
- DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling☆22Updated this week
- ☆18Nov 19, 2021Updated 4 years ago
- ☆16Jul 24, 2023Updated 2 years ago
- 欢迎参加中文讽刺计算评测任务!☆14Nov 4, 2024Updated last year
- (CVPR 26 Findings) Official implementation of the paper "Bind-Your-Avatar: Multi-Talking-Character Video Generation with Dynamic 3D-mask-…☆35Apr 7, 2026Updated last month
- Code for paper: Long cOntext aliGnment via efficient preference Optimization☆24Oct 10, 2025Updated 6 months ago
- Official Implementation of "DeCoRe: Decoding by Contrasting Retrieval Heads to Mitigate Hallucination"☆29Dec 18, 2024Updated last year
- libsmctrl论文的复现,添加了python端接口,可以在python端灵活调用接口来分配计算资源☆12May 21, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A paper review list for computer architecture and systems research, maintained by the LEMONADE group at Peking University.☆17Apr 23, 2026Updated 2 weeks ago
- DiffSinger Editor developed by OpenVPI☆36Oct 21, 2025Updated 6 months ago
- deprecated, use https://github.com/octohelm/piper instead.☆14Sep 3, 2024Updated last year
- ☆24May 9, 2025Updated last year
- Tile-based language built for AI computation across all scales☆146Updated this week
- 能自己部署的微信机器人,使用免费的大模型API☆23Nov 26, 2024Updated last year
- Helper script to install GNS3 on Arch☆10Feb 7, 2023Updated 3 years ago