The dataset and baseline code for ASC23 LLM inference optimization challenge.
☆34Dec 20, 2023Updated 2 years ago
Alternatives and similar repositories for ASC24-LLM-inference-optimization
Users that are interested in ASC24-LLM-inference-optimization are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- OpenCAEPoro for ASC 2024☆39Dec 21, 2023Updated 2 years ago
- Ongoing research training transformer models at scale☆18Jun 11, 2026Updated last week
- SEU-HPC | 东南大学超算平台☆26Jan 19, 2025Updated last year
- ✂️ Trim sequencing adapters from NGS data automatically☆14Sep 5, 2025Updated 9 months ago
- Use the tokenizer in parallel to achieve superior acceleration☆20Mar 21, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- AgenTracer: A Lightweight Failure Attributor for Agentic Systems☆94Nov 12, 2025Updated 7 months ago
- ☆18May 14, 2025Updated last year
- 🧪 Ultrafast bisulfite☆38Apr 23, 2024Updated 2 years ago
- ☆14Oct 17, 2024Updated last year
- Injecting Adrenaline into LLM Serving: Boosting Resource Utilization and Throughput via Attention Disaggregation☆41May 25, 2026Updated 3 weeks ago
- R1-Code-Interpreter: Training LLMs to Reason with Code via Supervised and Reinforcement Learning☆43Feb 9, 2026Updated 4 months ago
- ☆57Feb 24, 2026Updated 3 months ago
- Simple console ascii chart (lines and bars) - for node, browser and terminal, no dependencies.☆38Apr 3, 2026Updated 2 months ago
- ☆11May 17, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- This project was inspired by the unclecode/crawl4ai repository. It provided valuable insights and ideas that helped shape the development…☆16Dec 25, 2025Updated 5 months ago
- Reproducing R1 for Code with Reliable Rewards☆12Apr 9, 2025Updated last year
- System-on-chip design for NOP in NSCSCC 2023.☆13Aug 21, 2023Updated 2 years ago
- A Workbench for Autograding Retrieve/Generate Systems☆15Jun 30, 2025Updated 11 months ago
- Rain is a statistics-based workload generation toolkit that uses parameterized and empirical distributions to model the different classes…☆35Nov 2, 2016Updated 9 years ago
- [ICLR-2026] Official Implementation of our paper "THOR: Tool-Integrated Hierarchical Optimization via RL for Mathematical Reasoning".☆32Feb 26, 2026Updated 3 months ago
- Offcial Repo of Paper "Eliminating Position Bias of Language Models: A Mechanistic Approach""☆23Jun 13, 2025Updated last year
- A set of learned index papers w/o notes☆18Apr 25, 2024Updated 2 years ago
- 南昌大学超算队官方网站☆19Aug 9, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆13Feb 8, 2025Updated last year
- 存储天空盒图片。☆15Jul 13, 2021Updated 4 years ago
- Open-source toolkit for training, Priming, and serving next generation Hybrid architectures☆72Jun 9, 2026Updated last week
- A classic 5-stage rv32i(incomplete) toy implementation based on powerful SpinalHDL☆10Jul 5, 2021Updated 4 years ago
- unofficial impelement of the webformer: The Web-page Transformer for Structure Information Extraction☆13Apr 20, 2023Updated 3 years ago
- DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling☆27Updated this week
- ☆18Nov 19, 2021Updated 4 years ago
- ☆16Jul 24, 2023Updated 2 years ago
- 欢迎参加中文讽刺计算评测任务!☆14Nov 4, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- (CVPR 26 Findings) Official implementation of the paper "Bind-Your-Avatar: Multi-Talking-Character Video Generation with Dynamic 3D-mask-…☆34Apr 7, 2026Updated 2 months ago
- 2022 南方科技大学 SUSTech CS315 计算机安全 课程报告 满分题解☆17Dec 27, 2022Updated 3 years ago
- ☆18Jul 11, 2024Updated last year
- A paper review list for computer architecture and systems research, maintained by the LEMONADE group at Peking University.☆21Jun 11, 2026Updated last week
- Storage Performance Development Kit☆12Jun 9, 2026Updated last week
- psdoom-ng is a First Person Shooter operating system process killer based on psDooM and Chocolate Doom.☆33Jun 10, 2025Updated last year
- ☆31Aug 16, 2024Updated last year