Official codebase for our paper "Do Language Models Use Their Depth Efficiently?"
☆29Jun 25, 2025Updated 11 months ago
Alternatives and similar repositories for llm_effective_depth
Users that are interested in llm_effective_depth are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- for EE1520 NCKU☆14May 1, 2025Updated last year
- https://footprints.baulab.info☆18Oct 4, 2024Updated last year
- Serve LLMs on NCSA hardware. Support the best FOSS models, and the long tail on HuggingFace Hub.☆14May 8, 2024Updated 2 years ago
- Invariant Representation learning☆20Jun 26, 2024Updated last year
- ☆20Apr 26, 2026Updated last month
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- ☆12Feb 28, 2025Updated last year
- Official repository for the paper Number Cookbook: Number Understanding of Language Models and How to Improve It.☆21Mar 31, 2025Updated last year
- Accelerated research project management for humans (and AI agents).☆48Updated this week
- [CVPR2024] LeGO: Leveraging a Surface Deformation Network for Animatable Stylized Face Generation with One Example☆13Jun 3, 2024Updated 2 years ago
- Towards a Mechanistic Understanding of Large Reasoning Models: A Survey of Training, Inference, and Failures☆33Jan 29, 2026Updated 4 months ago
- Code repository for "The Clock and the Pizza: Two Stories in Mechanistic Explanation of Neural Networks"☆20Nov 24, 2023Updated 2 years ago
- STAR: Similarity-guided Teacher-Assisted Refinement for Super-Tiny Function Calling Models☆49Apr 23, 2026Updated last month
- Interpretable Diffusion Via Information Decomposition☆29Jul 18, 2024Updated last year
- Training Small Language Model☆29Dec 26, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆27Jun 2, 2026Updated last week
- Official Code for ACL 2023 Outstanding Paper: World-to-Words: Grounded Open Vocabulary Acquisition through Fast Mapping in Vision-Languag…☆33Oct 20, 2023Updated 2 years ago
- The NDIF server, which performs deep inference and serves nnsight requests remotely☆45Updated this week
- [NeurIPS 2024] Low rank memory efficient optimizer without SVD☆33Jul 1, 2025Updated 11 months ago
- Large-scale uncertainty benchmark in deep learning.☆67May 10, 2025Updated last year
- A handy plugin for copying requests/responses directly from Burp, some extra magic included.☆13Oct 15, 2021Updated 4 years ago
- This repository contains the code used for the experiments in the paper "Fine-Tuning Enhances Existing Mechanisms: A Case Study on Entity…☆31Oct 27, 2025Updated 7 months ago
- ☆10Jul 28, 2021Updated 4 years ago
- exploring whether LLMs perform case-based or rule-based reasoning☆31Mar 2, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- -☆11Nov 21, 2020Updated 5 years ago
- Row-wise block scaling for fp8 quantization matrix multiplication. Solution to GPU mode AMD challenge.☆19Feb 9, 2026Updated 4 months ago
- ☆108Jun 6, 2026Updated last week
- PyTorch implementation of HashedNets☆38Apr 21, 2023Updated 3 years ago
- PeRL: Parameter-Efficient Reinforcement Learning☆79May 20, 2026Updated 3 weeks ago
- Tensara's GPU programming problems☆20Apr 23, 2026Updated last month
- Triton‑style kernel toolkit for MLX plus a small upstream incubator: prototype, benchmark, and upstream fusions for Apple Silicon☆45Mar 31, 2026Updated 2 months ago
- ☆35May 25, 2026Updated 3 weeks ago
- SliderSpace: Decomposing the Visual Capabilities of Diffusion Models☆123Nov 25, 2025Updated 6 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- easy-phi☆25Feb 4, 2015Updated 11 years ago
- ☆47Apr 9, 2025Updated last year
- ☆83Mar 3, 2026Updated 3 months ago
- Lets build a Deep Learning Framework!☆27Mar 12, 2026Updated 3 months ago
- An example platform daemon in Rust; written for Mastering Embedded Linux☆13May 8, 2020Updated 6 years ago
- PolEval 2021 Task 1☆15Jun 28, 2022Updated 3 years ago
- Reproduction Package for the paper "Type-Constrained Code Generation with Language Models" [PLDI 2025]☆95Mar 11, 2026Updated 3 months ago