☆29Mar 13, 2026Updated last week
Alternatives and similar repositories for BenchMAX
Users that are interested in BenchMAX are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 使用torch.distributed实现DP/TP/PP☆13Dec 28, 2023Updated 2 years ago
- ☆17Aug 28, 2025Updated 6 months ago
- ☆22Dec 11, 2024Updated last year
- [COLM 2025] "C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing"☆20Apr 9, 2025Updated 11 months ago
- A controlled benchmark on evaluating and studying the dynamics of Long Context Language Models☆25Oct 17, 2025Updated 5 months ago
- ACL24☆11Jun 7, 2024Updated last year
- ACL Rolling Review website☆11Updated this week
- ☆32Feb 8, 2025Updated last year
- Code for RATIONALYST: Pre-training Process-Supervision for Improving Reasoning https://arxiv.org/pdf/2410.01044☆35Oct 3, 2024Updated last year
- EACL 2021☆11May 4, 2021Updated 4 years ago
- Reverse engineered ChatGPT API☆10Feb 14, 2023Updated 3 years ago
- Gene Neural Network (GNN)☆11Oct 5, 2019Updated 6 years ago
- Long CoT Fine-Tuning and Reinforcement Learning for LLMs in the Context of the 24-Point Game: A Toy Project☆25Feb 22, 2025Updated last year
- Code for ACL 2022 paper "HIBRIDS: Attention with Hierarchical Biases for Structure-aware Long Document Summarization".☆13May 24, 2022Updated 3 years ago
- Global Greedy Dependency Parsing☆10Mar 16, 2021Updated 5 years ago
- Official repository for the paper Local Linear Attention: An Optimal Interpolation of Linear and Softmax Attention For Test-Time Regressi…☆23Oct 1, 2025Updated 5 months ago
- ☆34Apr 1, 2025Updated 11 months ago
- 超简单复现Deepseek-R1-Zero和Deepseek-R1,以「24点游戏」为例。通过zero-RL、SFT以及SFT+RL,以激发LLM的自主验证反思能力。 About Clean, minimal, accessible reproduction of Dee…☆34Apr 5, 2025Updated 11 months ago
- DatasetResearch: Benchmarking Agent Systems for Demand-Driven Dataset Discovery☆20Sep 24, 2025Updated 6 months ago
- FrugalScore is an approach to learn a fixed, low cost version of any expensive NLG metric, while retaining most of its original performan…☆16Sep 21, 2022Updated 3 years ago
- Multiplex Thinking: Reasoning via Token-wise Branch-and-Merge☆113Jan 30, 2026Updated last month
- TensorFlow 中文文档☆11Jul 6, 2019Updated 6 years ago
- Awesome list for High Performance Computing / Parallel Computing resources.☆12Sep 20, 2017Updated 8 years ago
- Target speaker automatic speech recognition (TS-ASR)☆12Oct 14, 2023Updated 2 years ago
- Transformer based ASR Engine.☆13Aug 23, 2021Updated 4 years ago
- [ICLR 26] The official code repository for the paper "Mirage or Method? How Model–Task Alignment Induces Divergent RL Conclusions".☆17Feb 9, 2026Updated last month
- GenExam: A Multidisciplinary Text-to-Image Exam☆63Feb 27, 2026Updated 3 weeks ago
- ☆20Sep 11, 2025Updated 6 months ago
- a standalone pitch extractor☆13Oct 19, 2017Updated 8 years ago
- Volcengine TOS C++ SDK☆11Feb 28, 2026Updated 3 weeks ago
- Collection of different types of transformers for learning purposes☆12Jan 30, 2020Updated 6 years ago
- Implementation of the paper 'Improve Discourse Dependency Parsing with Contextualized Representations', Findings of NAACL 2022☆14Jul 15, 2022Updated 3 years ago
- Blogs that I'm actively following.☆14Sep 17, 2023Updated 2 years ago
- A top-down text-level discourse parser.☆17Jun 26, 2023Updated 2 years ago
- Empowering LLM Agents for Real-World Computer System Optimization☆17Sep 10, 2025Updated 6 months ago
- Research Artifact For Our Submission To VLDB☆10Oct 27, 2021Updated 4 years ago
- Fast and memory-efficient exact attention☆21Mar 13, 2026Updated last week
- Encode and decode pairs of surrogate characters in Python 3☆10Mar 9, 2022Updated 4 years ago
- High-performance ASR tool using Faster Whisper, supporting custom models, multi-language transcription, and real-time processing feedback…☆10Sep 17, 2025Updated 6 months ago