[DAI 2025] Beyond GPT-5: Making LLMs Cheaper and Better via Performance–Efficiency Optimized Routing
☆211Dec 11, 2025Updated 6 months ago
Alternatives and similar repositories for AvengersPro
Users that are interested in AvengersPro are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [Findings@ACL'26] LLMRouterBench: A Massive Benchmark and Unified Framework for LLM Routing☆68Apr 6, 2026Updated 2 months ago
- memory-efficient fine-tuning; support 24G GPU memory fine-tuning 7B☆21May 26, 2024Updated 2 years ago
- ☆128Apr 19, 2026Updated last month
- Category Theory for Quantum Natural Language Processing☆11Feb 22, 2023Updated 3 years ago
- Build your agent from 200,000+ skills via skill RETRIEVAL & ORCHESTRATION☆424Mar 7, 2026Updated 3 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Efficient LLM query routing via multi-sampling. BEST-Route selects both model and number of responses based on query difficulty, cutting …☆58Apr 8, 2026Updated 2 months ago
- The code of RouterDC☆75Apr 14, 2025Updated last year
- ☆56Mar 18, 2026Updated 3 months ago
- The original Shared Recurrent Memory Transformer implementation☆36Jul 11, 2025Updated 11 months ago
- A comprehensive overview of affective computing research in the era of large language models (LLMs).☆32Aug 7, 2024Updated last year
- This repository provides a comprehensive library for parallel training and LoRA algorithm implementations, supporting multiple parallel s…☆62Mar 25, 2026Updated 2 months ago
- ☆30May 24, 2025Updated last year
- RouterArena: An open framework for evaluating LLM routers with standardized datasets, metrics, an automated framework, and a live leaderb…☆94Updated this week
- AIBuildAI Inc☆284Updated this week
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Unofficial Implementation of Selective Attention Transformer☆20Oct 31, 2024Updated last year
- Simple python rasterizer tool implemented by OpenGL and C++☆15Nov 10, 2025Updated 7 months ago
- The first high school physics Olympiad benchmark for evaluating (M)LLMs with step-level grading and human-level comparison.☆25Dec 19, 2025Updated 5 months ago
- Reproducible and flexible LLM evaluations for scientific reasoning.☆28Jul 23, 2025Updated 10 months ago
- This is the open-source code for TokenCarve.☆25Jan 23, 2026Updated 4 months ago
- ☆48Sep 13, 2025Updated 9 months ago
- UnitEval is a benchmarking and evaluation tools for AutoDev Coder.☆14Jan 2, 2024Updated 2 years ago
- scDNAm-GPT: A Foundation Model for Analyzing Single-Cell Whole-Genome Bisulfite Sequencing☆19Nov 30, 2025Updated 6 months ago
- DeepDive: Advancing Deep Search Agents with Knowledge Graphs and Multi-Turn RL☆322Oct 2, 2025Updated 8 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- GenRM-CoT: Data release for verification rationales☆68Oct 16, 2024Updated last year
- [NeurIPS 2024] BEACON: Benchmark for Comprehensive RNA Tasks and Language Models☆63Aug 2, 2024Updated last year
- Source code for PECRS (EACL 2024)☆12Feb 3, 2024Updated 2 years ago
- [ACL'25] Code for ACL'25 paper "IRT-Router: Effective and Interpretable Multi-LLM Routing via Item Response Theory"☆32Feb 19, 2025Updated last year
- this is for fun, ain't it grand!☆21Sep 18, 2025Updated 9 months ago
- ☆17Nov 3, 2024Updated last year
- ☆337May 31, 2025Updated last year
- Mixture of Cognitive Reasoners: Modular Reasoning with Brain-Like Specialization☆42Feb 7, 2026Updated 4 months ago
- Marketplace ML experiment - training without backprop☆28Sep 9, 2025Updated 9 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- 学习Rust的书籍资料☆28May 5, 2026Updated last month
- ☆121Apr 27, 2025Updated last year
- A Comprehensive Benchmark for Routing LLMs to Explore Model-level Scaling Up in Large Language Models☆115Jun 3, 2026Updated 2 weeks ago
- G-Refer: Graph Retrieval-Augmented Large Language Model for Explainable Recommendation☆21Mar 5, 2025Updated last year
- Alice in Wonderland code base for experiments and raw experiments data☆129Feb 4, 2026Updated 4 months ago
- The offical repo for "Parallel-R1: Towards Parallel Thinking via Reinforcement Learning"☆258Feb 4, 2026Updated 4 months ago
- Emergent Hierarchical Reasoning in LLMs/VLMs through Reinforcement Learning [ICLR26]☆65Apr 11, 2026Updated 2 months ago