[ACL'25] Code for ACL'25 paper "IRT-Router: Effective and Interpretable Multi-LLM Routing via Item Response Theory"
☆34Feb 19, 2025Updated last year
Alternatives and similar repositories for IRT-Router
Users that are interested in IRT-Router are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆10Feb 21, 2023Updated 3 years ago
- Hardware implementation of a Fixed Point Recursive Forward and Inverse FFT algorithm☆17Mar 3, 2018Updated 8 years ago
- NetEaseCrowd dataset, a collection of data obtained from You Ling crowdsourcing platform, Fuxi AI Lab, NetEase.☆14Dec 19, 2024Updated last year
- Implementation to VirtualTaobao☆13Jan 17, 2020Updated 6 years ago
- FedNew: A Communication-Efficient and Privacy-Preserving Newton-Type Method for Federated Learning☆17Jun 2, 2022Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆10Mar 8, 2025Updated last year
- ☆15Dec 22, 2022Updated 3 years ago
- ☆32May 30, 2025Updated last year
- Teacher - student distillation using DeepSpeed☆20Oct 7, 2022Updated 3 years ago
- HOLMES: Health OnLine Model Ensemble Serving for Deep Learning Models in Intensive Care Units (KDD 2020)☆12Jan 25, 2021Updated 5 years ago
- Principles and Methodologies for Serial Performance Optimization (OSDI' 25)☆30Jun 5, 2025Updated last year
- Source code for Jellyfish, a soft real-time inference serving system☆15Dec 20, 2022Updated 3 years ago
- ☆20Jun 10, 2026Updated 3 weeks ago
- The simulator for education☆16Mar 16, 2021Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Source code of IPA, https://escholarship.org/uc/item/2p0805dq☆12Jun 27, 2024Updated 2 years ago
- ☆15Jun 18, 2024Updated 2 years ago
- ☆16Aug 19, 2024Updated last year
- ☆15Nov 26, 2024Updated last year
- Efficient LLM query routing via multi-sampling. BEST-Route selects both model and number of responses based on query difficulty, cutting …☆63Apr 8, 2026Updated 2 months ago
- This CG provides a safe space to assess use cases, modularization (role, scope, outcomes), existing and emerging AI architectures, progre…☆31Oct 9, 2025Updated 8 months ago
- [DATE 2023] Pipe-BD: Pipelined Parallel Blockwise Distillation☆12Jul 13, 2023Updated 2 years ago
- Continuous Pipelined Speculative Decoding☆20May 25, 2026Updated last month
- ☆11Jul 12, 2020Updated 5 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆22Jun 7, 2024Updated 2 years ago
- ☆14Jan 12, 2022Updated 4 years ago
- ☆24Apr 9, 2024Updated 2 years ago
- Modality-Agnostic Self-Supervised Learning with Meta-Learned Masked Auto-Encoder (NeurIPS 2023)☆10Jun 5, 2024Updated 2 years ago
- SAGA: A Security Architecture for Governing AI Agentic Systems☆24May 18, 2026Updated last month
- Code for the paper "Age of Information Analysis in Edge Computing Servers"☆22Feb 12, 2024Updated 2 years ago
- Adversarial Attack Zoo and Victim Model Zoo for general Pixel-to-Pixel Tasks☆16May 26, 2020Updated 6 years ago
- ☆12Mar 27, 2024Updated 2 years ago
- This is the repository for codes in paper "ShaderPerFormer: Platform-independent Context-aware Shader Performance Predictor"☆12May 16, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A Mel-frequency cepstrum core in FPGA☆21Jun 30, 2021Updated 5 years ago
- Official PyTorch implemetation of paper "X-Adv: Physical Adversarial Object Attacks against X-ray Prohibited Item Detection".☆16Feb 21, 2023Updated 3 years ago
- Code for NDSS '25 paper "Passive Inference Attacks on Split Learning via Adversarial Regularization"☆13Sep 16, 2024Updated last year
- ☆20Feb 9, 2020Updated 6 years ago
- ☆15Aug 15, 2024Updated last year
- ☆33Aug 30, 2025Updated 10 months ago
- INFOCOM 2024: Online Resource Allocation for Edge Intelligence with Colocated Model Retraining and Inference☆34Oct 13, 2024Updated last year