[ICLR 2026 ๐ฅ] Dr.LLM: Dynamic Layer Routing in LLMs
โ53Apr 24, 2026Updated 2 months ago
Alternatives and similar repositories for dr-llm
Users that are interested in dr-llm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Source code and data for ADEPT: A DEbiasing PrompT Framework (AAAI-23).โ15Dec 13, 2024Updated last year
- โ11May 9, 2023Updated 3 years ago
- [NAACL'25 ๐ SAC Award] Official code for "Advancing MoE Efficiency: A Collaboration-Constrained Routing (C2R) Strategy for Better Expertโฆโ16Feb 4, 2025Updated last year
- Validating image classification benchmark results on ViTs and ResNets (v2)โ13Nov 3, 2022Updated 3 years ago
- 2022 ็งๅญฃๅญฆๆๆธ ๅๅคงๅญฆ็ตๅญ็ณปๆฐๆฎไธ็ฎๆณ่ฏพ็จ OJ ๅ่่งฃ็ญโ10Jun 18, 2023Updated 3 years ago
- Managed Database hosting by DigitalOcean โข AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- LiteGPT: A 124M Small Language Model (SLM) pre-trained on FineWeb and fine-tuned on Alpaca.โ35Dec 16, 2025Updated 6 months ago
- All my experiments with the various transformers and various transformer frameworks availableโ14Apr 30, 2021Updated 5 years ago
- โ18Jul 24, 2023Updated 2 years ago
- Official implementation of "Reasoning by Superposition: A Theoretical Perspective on Chain of Continuous Thought" (NeurIPS 2025)โ42Oct 8, 2025Updated 8 months ago
- Pre-processing DBpedia datasets to load into Dgraphโ13Mar 6, 2022Updated 4 years ago
- 2022้พ่ฏๆฏไธชไบบ่ตไธ็ญๅฅไฝๅโ14Oct 11, 2023Updated 2 years ago
- โ19May 16, 2024Updated 2 years ago
- โ18Mar 15, 2021Updated 5 years ago
- โ10Nov 6, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient โข AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- โ24Jan 26, 2026Updated 5 months ago
- [ICML2025] Official Repo for Paper "Optimizing Temperature for Language Models with Multi-Sample Inference"โ23Feb 16, 2025Updated last year
- Can LLMs Predict Their Own Failures? Self-Awareness via Internal Circuitsโ46Jan 8, 2026Updated 5 months ago
- Token-free Language Modeling with ByGPT5 & Friends!โ12Jul 18, 2025Updated 11 months ago
- ๐๐๐๐๐๐๐๐๐ Reading everythingโ16Mar 11, 2026Updated 3 months ago
- A Large-Scale Dataset for Paraphrased Reading Comprehensionโ15Jul 16, 2023Updated 2 years ago
- Source code of NAACL 2025 Findings "Scaling Up Membership Inference: When and How Attacks Succeed on Large Language Models"โ16Dec 16, 2025Updated 6 months ago
- Data and code for paper "ODSum: New Benchmarks for Open Domain Multi-Document Summarization"โ11Sep 20, 2024Updated last year
- [ICML 2025] RocketKV: Accelerating Long-Context LLM Inference via Two-Stage KV Cache Compressionโ47Aug 7, 2025Updated 10 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer โข AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Implementation of Contrastive Predictive Coding for Natural Languageโ10Sep 16, 2020Updated 5 years ago
- โ19Jul 24, 2023Updated 2 years ago
- 2023้พ่ฏๆฏmips่ต้ไฝๅโ14Dec 23, 2023Updated 2 years ago
- Interpreting Learned Search and Planning: Reverse-engineering recurrent convolutional networks (DRC) that play Sokobanโ21Jun 29, 2025Updated last year
- Diacritization of Arabic textsโ11Apr 13, 2016Updated 10 years ago
- LAReQA is a challenging benchmark for evaluating language agnostic answer retrieval from a multilingual candidate pool. This repository cโฆโ14May 19, 2020Updated 6 years ago
- Better coding experience for Flaskโ16Oct 21, 2025Updated 8 months ago
- My personal notes for Facebook's Secure and Private AI Scholarship Course 2019 on Udacityโ10Jun 12, 2019Updated 7 years ago
- [CVPR 2024] KEPP: Why Not Use Your Textbook? Knowledge-Enhanced Procedure Planning of Instructional Videosโ12Sep 24, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer โข AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- โ46Sep 30, 2025Updated 9 months ago
- โ12Oct 4, 2023Updated 2 years ago
- NSCSCC 2023 The Second Prize. TEAM PUA FROM HDU.โ14Mar 29, 2025Updated last year
- Scripts to finetune the official implementation of OpenAI's Whisper modelโ25Apr 14, 2026Updated 2 months ago
- This repository contains content related to 2D and 3D lane detection, as well as video lane detection. There are not only papers here, buโฆโ13Sep 1, 2024Updated last year
- Proposed fuzzy reward model with GRPO to improve VLM's abilities in crowd counting task.โ21Apr 11, 2025Updated last year
- This repository contains the code for the paper "Exploiting Foundation Models and Speech Enhancement for Parkinson's Disease Detection frโฆโ12Dec 19, 2025Updated 6 months ago