Official codebase for our paper "Do Language Models Use Their Depth Efficiently?"
☆29Jun 25, 2025Updated 9 months ago
Alternatives and similar repositories for llm_effective_depth
Users that are interested in llm_effective_depth are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- for EE1520 NCKU☆14May 1, 2025Updated 11 months ago
- https://footprints.baulab.info☆18Oct 4, 2024Updated last year
- Serve LLMs on NCSA hardware. Support the best FOSS models, and the long tail on HuggingFace Hub.☆14May 8, 2024Updated last year
- Invariant Representation learning☆20Jun 26, 2024Updated last year
- ☆19Sep 16, 2025Updated 7 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆12Feb 28, 2025Updated last year
- Official repository for the paper Number Cookbook: Number Understanding of Language Models and How to Improve It.☆20Mar 31, 2025Updated last year
- Research project management and automation.☆39Apr 6, 2026Updated last week
- [CVPR2024] LeGO: Leveraging a Surface Deformation Network for Animatable Stylized Face Generation with One Example☆13Jun 3, 2024Updated last year
- Towards a Mechanistic Understanding of Large Reasoning Models: A Survey of Training, Inference, and Failures☆33Jan 29, 2026Updated 2 months ago
- STAR: Similarity-guided Teacher-Assisted Refinement for Super-Tiny Function Calling Models☆42Mar 23, 2026Updated 3 weeks ago
- Code repository for "The Clock and the Pizza: Two Stories in Mechanistic Explanation of Neural Networks"☆18Nov 24, 2023Updated 2 years ago
- Interpretable Diffusion Via Information Decomposition☆29Jul 18, 2024Updated last year
- Training Small Language Model☆28Dec 26, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆28Nov 10, 2025Updated 5 months ago
- Official Code for ACL 2023 Outstanding Paper: World-to-Words: Grounded Open Vocabulary Acquisition through Fast Mapping in Vision-Languag…☆33Oct 20, 2023Updated 2 years ago
- The NDIF server, which performs deep inference and serves nnsight requests remotely☆44Updated this week
- Large-scale uncertainty benchmark in deep learning.☆64May 10, 2025Updated 11 months ago
- [NeurIPS 2024] Low rank memory efficient optimizer without SVD☆33Jul 1, 2025Updated 9 months ago
- A handy plugin for copying requests/responses directly from Burp, some extra magic included.☆13Oct 15, 2021Updated 4 years ago
- This repository contains the code used for the experiments in the paper "Fine-Tuning Enhances Existing Mechanisms: A Case Study on Entity…☆30Oct 27, 2025Updated 5 months ago
- ☆10Jul 28, 2021Updated 4 years ago
- exploring whether LLMs perform case-based or rule-based reasoning☆30Mar 2, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆89Updated this week
- -☆11Nov 21, 2020Updated 5 years ago
- Row-wise block scaling for fp8 quantization matrix multiplication. Solution to GPU mode AMD challenge.☆19Feb 9, 2026Updated 2 months ago
- PeRL: Parameter-Efficient Reinforcement Learning☆74Apr 6, 2026Updated last week
- PyTorch implementation of HashedNets☆38Apr 21, 2023Updated 2 years ago
- Triton‑style kernel toolkit for MLX plus a small upstream incubator: prototype, benchmark, and upstream fusions for Apple Silicon☆43Mar 31, 2026Updated 2 weeks ago
- SliderSpace: Decomposing the Visual Capabilities of Diffusion Models☆118Nov 25, 2025Updated 4 months ago
- easy-phi☆25Feb 4, 2015Updated 11 years ago
- ☆47Apr 9, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆82Mar 3, 2026Updated last month
- Lets build a Deep Learning Framework!☆25Mar 12, 2026Updated last month
- An example platform daemon in Rust; written for Mastering Embedded Linux☆12May 8, 2020Updated 5 years ago
- 🪄 Interpreto is an interpretability toolbox for LLMs☆165Apr 6, 2026Updated last week
- High-performance KV cache storage for LLM inference — GPU offloading, SSD caching, and cross-node sharing via RDMA. Works with vLLM and S…☆42Updated this week
- PolEval 2021 Task 1☆15Jun 28, 2022Updated 3 years ago
- Reproduction Package for the paper "Type-Constrained Code Generation with Language Models" [PLDI 2025]☆86Mar 11, 2026Updated last month