[ICLR 2026 🔥] Dr.LLM: Dynamic Layer Routing in LLMs
☆44Oct 15, 2025Updated 5 months ago
Alternatives and similar repositories for dr-llm
Users that are interested in dr-llm are comparing it to the libraries listed below
Sorting:
- ☆11May 9, 2023Updated 2 years ago
- ☆12Aug 2, 2022Updated 3 years ago
- supplement material for BlackHat2020 talk: Multiple Bugs in Multi-Party Computation: Breaking Cryptocurrency's Strongest Wallets☆12Aug 13, 2020Updated 5 years ago
- A connector to Rainbow Bridge that allows sending $NEAR to Ethereum as an ERC-20 token (eNEAR)☆10Mar 29, 2025Updated 11 months ago
- LiteGPT: A 124M Small Language Model (SLM) pre-trained on FineWeb and fine-tuned on Alpaca.☆34Dec 16, 2025Updated 3 months ago
- ☆11Dec 8, 2024Updated last year
- All my experiments with the various transformers and various transformer frameworks available☆14Apr 30, 2021Updated 4 years ago
- Source code of "Leaky Thoughts: Large Reasoning Models Are Not Private Thinkers" EMNLP 2025☆17Jan 12, 2026Updated 2 months ago
- ☆12Dec 29, 2023Updated 2 years ago
- Source code for the paper "Do Deep Neural Network Solutions form a Star Domain?"☆12May 26, 2024Updated last year
- Code used to create the Linked WikiText-2 dataset☆16May 22, 2023Updated 2 years ago
- Pre-processing DBpedia datasets to load into Dgraph☆13Mar 6, 2022Updated 4 years ago
- ☆28May 27, 2024Updated last year
- ☆10Jul 21, 2023Updated 2 years ago
- ☆24Jan 26, 2026Updated last month
- ☆10Nov 6, 2024Updated last year
- Token-free Language Modeling with ByGPT5 & Friends!☆12Jul 18, 2025Updated 8 months ago
- 📚📚📚📚📚📚📚📚📚 Reading everything☆15Mar 11, 2026Updated last week
- Tunisian Arabish Corpus☆12Mar 12, 2024Updated 2 years ago
- Code for paper "Optima: Optimizing Effectiveness and Efficiency for LLM-Based Multi-Agent System"☆69Nov 14, 2024Updated last year
- Source code of NAACL 2025 Findings "Scaling Up Membership Inference: When and How Attacks Succeed on Large Language Models"☆15Dec 16, 2025Updated 3 months ago
- Data and code for paper "ODSum: New Benchmarks for Open Domain Multi-Document Summarization"☆11Sep 20, 2024Updated last year
- [ICML 2025] RocketKV: Accelerating Long-Context LLM Inference via Two-Stage KV Cache Compression☆34Aug 7, 2025Updated 7 months ago
- Curated list of Moroccans publishing in the most prestigious AI conferences☆10Oct 14, 2024Updated last year
- ☆19Jul 24, 2023Updated 2 years ago
- Code for NeurIPS'23 paper "A Bayesian Approach To Analysing Training Data Attribution In Deep Learning"☆17Jan 12, 2024Updated 2 years ago
- LAReQA is a challenging benchmark for evaluating language agnostic answer retrieval from a multilingual candidate pool. This repository c…☆14May 19, 2020Updated 5 years ago
- Confidence Regulation Neurons in Language Models (NeurIPS 2024)☆15Feb 1, 2025Updated last year
- A repo of useful MLX skills.☆77Jan 25, 2026Updated last month
- (BMVC 2022--Oral) Official repository for "Adversarial Pixel Restoration as a Pretext Task for Transferable Perturbations" …☆34Jan 8, 2023Updated 3 years ago
- [CVPR 2024] KEPP: Why Not Use Your Textbook? Knowledge-Enhanced Procedure Planning of Instructional Videos☆12Sep 24, 2024Updated last year
- snarkpack for arkwork☆22Jun 11, 2023Updated 2 years ago
- ☆16May 27, 2024Updated last year
- This repository contains content related to 2D and 3D lane detection, as well as video lane detection. There are not only papers here, bu…☆13Sep 1, 2024Updated last year
- Scripts to finetune the official implementation of OpenAI's Whisper model☆24Jul 6, 2025Updated 8 months ago
- Proposed fuzzy reward model with GRPO to improve VLM's abilities in crowd counting task.☆21Apr 11, 2025Updated 11 months ago
- Reading comprehension on the Holy Qur'an☆10Oct 15, 2025Updated 5 months ago
- This repository contains the code for the paper "Exploiting Foundation Models and Speech Enhancement for Parkinson's Disease Detection fr…☆12Dec 19, 2025Updated 3 months ago
- Interactive clap☆18Dec 10, 2025Updated 3 months ago