[ACL'25] Code for ACL'25 paper "IRT-Router: Effective and Interpretable Multi-LLM Routing via Item Response Theory"
☆26Feb 19, 2025Updated last year
Alternatives and similar repositories for IRT-Router
Users that are interested in IRT-Router are comparing it to the libraries listed below
Sorting:
- ☆18Dec 30, 2025Updated 2 months ago
- ☆10Mar 8, 2025Updated 11 months ago
- Continuous Pipelined Speculative Decoding☆16Jan 4, 2026Updated last month
- Secure Inference Resilient Against Malicious Clients☆15May 3, 2022Updated 3 years ago
- ☆12Jan 9, 2026Updated last month
- Implementation of the paper "Improving the Accuracy-Robustness Trade-off of Classifiers via Adaptive Smoothing".☆10Feb 6, 2024Updated 2 years ago
- Exploring how optimizations for GEMMs work☆28Jan 1, 2026Updated 2 months ago
- 2023/12/22 电三 420 每周会议技术分享:「容器」的 slides 和附件☆10Dec 22, 2023Updated 2 years ago
- HOLMES: Health OnLine Model Ensemble Serving for Deep Learning Models in Intensive Care Units (KDD 2020)☆12Jan 25, 2021Updated 5 years ago
- Code implementation for paper AbsenceBench: Language Models Can't Tell What's Missing☆17Oct 23, 2025Updated 4 months ago
- This CG provides a safe space to assess use cases, modularization (role, scope, outcomes), existing and emerging AI architectures, progre…☆21Oct 9, 2025Updated 4 months ago
- Federated reconnaissance mini-ImageNet benchmark and baseline models☆13Sep 2, 2021Updated 4 years ago
- Source code of IPA, https://escholarship.org/uc/item/2p0805dq☆12Jun 27, 2024Updated last year
- Minimal TPU implementation with 8x8 systolic array and PyTorch integration☆49Jan 26, 2026Updated last month
- Minimal implementation of a Byte Pair Encoding (BPE) tokenizer in Zig☆14Apr 7, 2025Updated 10 months ago
- Source code for Jellyfish, a soft real-time inference serving system☆15Dec 20, 2022Updated 3 years ago
- Code for NDSS '25 paper "Passive Inference Attacks on Split Learning via Adversarial Regularization"☆13Sep 16, 2024Updated last year
- Orpheus TTS Server with streaming support (TTFB ~160ms)☆23Sep 21, 2025Updated 5 months ago
- ☆11Aug 10, 2021Updated 4 years ago
- ☆17Jun 18, 2025Updated 8 months ago
- ☆12Jul 12, 2020Updated 5 years ago
- 2022 USTC 011705 (OSH) Course Project of Runikraft Group☆13Jul 22, 2022Updated 3 years ago
- ☆14Aug 19, 2024Updated last year
- ☆11Sep 28, 2023Updated 2 years ago
- ☆12Mar 27, 2024Updated last year
- Toolkit for Universal Retrieval, such as text retrieval, item recommendation, image retrieval, etc.☆17Sep 15, 2025Updated 5 months ago
- Generate massive fake datasets for your datalake, fast. By SOMA☆20Oct 30, 2025Updated 4 months ago
- Multi-GPU CUDA based scheduler.☆13Jul 20, 2017Updated 8 years ago
- MARSHAL: Incentivizing Multi-Agent Reasoning via Self-Play with Strategic LLMs☆38Feb 19, 2026Updated last week
- A comprehensive hands-on project for learning GPU programming with CUDA and HIP, covering fundamental concepts through advanced optimizat…☆35Nov 20, 2025Updated 3 months ago
- ☆15Aug 15, 2024Updated last year
- TempoPFN: Zero-shot Time Series Forecasting (accepted at EurIPS 2025 AI for Tabular Data Workshop)☆35Nov 10, 2025Updated 3 months ago
- Benchmark of robust self-supervised learning (RobustSSL) methods & Code for AutoLoRa (ICLR 2024).☆19Dec 10, 2025Updated 2 months ago
- Pre-training Intent-Aware Encoders for Zero- and Few-Shot Intent Classification☆16Jan 8, 2024Updated 2 years ago
- ☆14Jul 24, 2024Updated last year
- This is the official implementation of NNSplitter (ICML'23)☆12Jun 11, 2024Updated last year
- ☆22Jan 29, 2026Updated last month
- ☆14Jan 12, 2022Updated 4 years ago
- ☆11Feb 5, 2026Updated 3 weeks ago