[ICLR 2026 🔥] Dr.LLM: Dynamic Layer Routing in LLMs
☆45Apr 21, 2026Updated last week
Alternatives and similar repositories for dr-llm
Users that are interested in dr-llm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Multi-Agent LLM Evaluation Docs: https://maseval.readthedocs.io/☆31Apr 16, 2026Updated 2 weeks ago
- Source code and data for ADEPT: A DEbiasing PrompT Framework (AAAI-23).☆15Dec 13, 2024Updated last year
- [NAACL'25 🏆 SAC Award] Official code for "Advancing MoE Efficiency: A Collaboration-Constrained Routing (C2R) Strategy for Better Expert…☆16Feb 4, 2025Updated last year
- Validating image classification benchmark results on ViTs and ResNets (v2)☆13Nov 3, 2022Updated 3 years ago
- a website for accessing many models through api(deepseek、Qwen、Hunyuan etc.)☆16Jul 12, 2025Updated 9 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Official repository for the paper "Salient Mask-Guided Vision Transformer for Fine-Grained Classification" (VISIGRAPP '23)☆21Mar 6, 2023Updated 3 years ago
- 2022 秋季学期清华大学电子系数据与算法课程 OJ 参考解答☆10Jun 18, 2023Updated 2 years ago
- ☆40Mar 25, 2026Updated last month
- LiteGPT: A 124M Small Language Model (SLM) pre-trained on FineWeb and fine-tuned on Alpaca.☆35Dec 16, 2025Updated 4 months ago
- All my experiments with the various transformers and various transformer frameworks available☆14Apr 30, 2021Updated 5 years ago
- ☆11Dec 8, 2024Updated last year
- ☆18Jul 24, 2023Updated 2 years ago
- Official implementation of "Reasoning by Superposition: A Theoretical Perspective on Chain of Continuous Thought" (NeurIPS 2025)☆39Oct 8, 2025Updated 6 months ago
- Source code of "Leaky Thoughts: Large Reasoning Models Are Not Private Thinkers" EMNLP 2025☆17Jan 12, 2026Updated 3 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Source code for the paper "Do Deep Neural Network Solutions form a Star Domain?"☆12May 26, 2024Updated last year
- 2022龙芯杯个人赛三等奖作品☆14Oct 11, 2023Updated 2 years ago
- [ICML2025] Official Repo for Paper "Optimizing Temperature for Language Models with Multi-Sample Inference"☆22Feb 16, 2025Updated last year
- 📚📚📚📚📚📚📚📚📚 Reading everything☆15Mar 11, 2026Updated last month
- Tunisian Arabish Corpus☆12Mar 12, 2024Updated 2 years ago
- Code for paper "Optima: Optimizing Effectiveness and Efficiency for LLM-Based Multi-Agent System"☆72Nov 14, 2024Updated last year
- Source code of NAACL 2025 Findings "Scaling Up Membership Inference: When and How Attacks Succeed on Large Language Models"☆15Dec 16, 2025Updated 4 months ago
- [ICML 2025] RocketKV: Accelerating Long-Context LLM Inference via Two-Stage KV Cache Compression☆45Aug 7, 2025Updated 8 months ago
- Curated list of Moroccans publishing in the most prestigious AI conferences☆11Oct 14, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Official repository of paper titled "D3Former: Debiased Dual Distilled Transformer for Incremental Learning".☆25Jul 10, 2023Updated 2 years ago
- Launch programs on multiple hosts. (多机启动程序)☆14Jul 4, 2023Updated 2 years ago
- NSCSCC “龙芯杯” 2024 个人赛 LoongArch 赛道三等奖☆16Aug 17, 2024Updated last year
- ☆15Oct 20, 2023Updated 2 years ago
- Implementation of Contrastive Predictive Coding for Natural Language☆10Sep 16, 2020Updated 5 years ago
- ☆19Jul 24, 2023Updated 2 years ago
- Code for NeurIPS'23 paper "A Bayesian Approach To Analysing Training Data Attribution In Deep Learning"☆17Jan 12, 2024Updated 2 years ago
- Interpreting Learned Search and Planning: Reverse-engineering recurrent convolutional networks (DRC) that play Sokoban☆19Jun 29, 2025Updated 10 months ago
- 2023龙芯杯mips赛道作品☆14Dec 23, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- LAReQA is a challenging benchmark for evaluating language agnostic answer retrieval from a multilingual candidate pool. This repository c…☆14May 19, 2020Updated 5 years ago
- 清华大学第八届人工智能挑战 赛电子系赛道(原电子系第 26 届队式程序设计大赛 teamstyle26)☆16Apr 14, 2026Updated 2 weeks ago
- VoiRS is a cutting-edge Text-to-Speech (TTS), Voice Recognition, Sound framework that unifies high-performance crates from the cool-japan…☆31Updated this week
- https://liuzeming01.github.io/XDailyDialog/☆15Jun 25, 2023Updated 2 years ago
- [EMNLP'23] ClimateGPT: a specialized LLM for conversations related to Climate Change and Sustainability topics in both English and Arabi…☆79Sep 24, 2024Updated last year
- (BMVC 2022--Oral) Official repository for "Adversarial Pixel Restoration as a Pretext Task for Transferable Perturbations" …☆34Jan 8, 2023Updated 3 years ago
- [ICLR 2024] Official code for the paper "LLM Blueprint: Enabling Text-to-Image Generation with Complex and Detailed Prompts"☆84May 18, 2024Updated last year