Official codebase for our paper "Do Language Models Use Their Depth Efficiently?"
☆29Jun 25, 2025Updated 9 months ago
Alternatives and similar repositories for llm_effective_depth
Users that are interested in llm_effective_depth are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- for EE1520 NCKU☆13May 1, 2025Updated 10 months ago
- https://footprints.baulab.info☆18Oct 4, 2024Updated last year
- Serve LLMs on NCSA hardware. Support the best FOSS models, and the long tail on HuggingFace Hub.☆14May 8, 2024Updated last year
- Invariant Representation learning☆20Jun 26, 2024Updated last year
- ☆19Sep 16, 2025Updated 6 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆12Feb 28, 2025Updated last year
- Official repository for the paper Number Cookbook: Number Understanding of Language Models and How to Improve It.☆19Mar 31, 2025Updated 11 months ago
- A composition and automation layer for research workflows.☆38Mar 18, 2026Updated last week
- [CVPR2024] LeGO: Leveraging a Surface Deformation Network for Animatable Stylized Face Generation with One Example☆13Jun 3, 2024Updated last year
- Towards a Mechanistic Understanding of Large Reasoning Models: A Survey of Training, Inference, and Failures☆31Jan 29, 2026Updated last month
- STAR: Similarity-guided Teacher-Assisted Refinement for Super-Tiny Function Calling Models☆40Mar 20, 2026Updated last week
- Code repository for "The Clock and the Pizza: Two Stories in Mechanistic Explanation of Neural Networks"☆18Nov 24, 2023Updated 2 years ago
- Training Small Language Model☆28Dec 26, 2023Updated 2 years ago
- Interpretable Diffusion Via Information Decomposition☆29Jul 18, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆28Nov 10, 2025Updated 4 months ago
- Official Code for ACL 2023 Outstanding Paper: World-to-Words: Grounded Open Vocabulary Acquisition through Fast Mapping in Vision-Languag…☆33Oct 20, 2023Updated 2 years ago
- The NDIF server, which performs deep inference and serves nnsight requests remotely☆43Updated this week
- Large-scale uncertainty benchmark in deep learning.☆64May 10, 2025Updated 10 months ago
- [NeurIPS 2024] Low rank memory efficient optimizer without SVD☆33Jul 1, 2025Updated 8 months ago
- A handy plugin for copying requests/responses directly from Burp, some extra magic included.☆13Oct 15, 2021Updated 4 years ago
- This repository contains the code used for the experiments in the paper "Fine-Tuning Enhances Existing Mechanisms: A Case Study on Entity…☆30Oct 27, 2025Updated 4 months ago
- ☆10Jul 28, 2021Updated 4 years ago
- exploring whether LLMs perform case-based or rule-based reasoning☆30Mar 2, 2024Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆87Mar 17, 2026Updated last week
- -☆11Nov 21, 2020Updated 5 years ago
- Row-wise block scaling for fp8 quantization matrix multiplication. Solution to GPU mode AMD challenge.☆18Feb 9, 2026Updated last month
- High-performance KV cache storage for LLM inference — GPU offloading, SSD caching, and cross-node sharing via RDMA. Works with vLLM and S…☆27Updated this week
- PeRL: Parameter-Efficient Reinforcement Learning☆73Mar 10, 2026Updated 2 weeks ago
- PyTorch implementation of HashedNets☆38Apr 21, 2023Updated 2 years ago
- Triton‑style kernel toolkit for MLX plus a small upstream incubator: prototype, benchmark, and upstream fusions for Apple Silicon☆42Mar 3, 2026Updated 3 weeks ago
- OpenCode GUI extension for VSCode☆24Mar 11, 2026Updated 2 weeks ago
- SliderSpace: Decomposing the Visual Capabilities of Diffusion Models☆118Nov 25, 2025Updated 4 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- easy-phi☆25Feb 4, 2015Updated 11 years ago
- Reproduction Package for the paper "Type-Constrained Code Generation with Language Models" [PLDI 2025]☆85Mar 11, 2026Updated 2 weeks ago
- ☆47Apr 9, 2025Updated 11 months ago
- 🪄 Interpreto is an interpretability toolbox for LLMs☆161Updated this week
- An example platform daemon in Rust; written for Mastering Embedded Linux☆12May 8, 2020Updated 5 years ago
- PolEval 2021 Task 1☆15Jun 28, 2022Updated 3 years ago
- Automated tools for creating streamlined Windows 11 images with CI/CD support. Builds Tiny11 and Tiny11 Core ISOs with GitHub Actions wor…☆28Mar 15, 2026Updated last week