Official codebase for our paper "Do Language Models Use Their Depth Efficiently?"
☆29Jun 25, 2025Updated 11 months ago
Alternatives and similar repositories for llm_effective_depth
Users that are interested in llm_effective_depth are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- for EE1520 NCKU☆14May 1, 2025Updated last year
- https://footprints.baulab.info☆18Oct 4, 2024Updated last year
- Serve LLMs on NCSA hardware. Support the best FOSS models, and the long tail on HuggingFace Hub.☆14May 8, 2024Updated 2 years ago
- Invariant Representation learning☆20Jun 26, 2024Updated last year
- ☆19Apr 26, 2026Updated last month
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆12Feb 28, 2025Updated last year
- Official repository for the paper Number Cookbook: Number Understanding of Language Models and How to Improve It.☆21Mar 31, 2025Updated last year
- That's a nice result you got there, but how did you calculate it?☆45Updated this week
- [CVPR2024] LeGO: Leveraging a Surface Deformation Network for Animatable Stylized Face Generation with One Example☆13Jun 3, 2024Updated last year
- Towards a Mechanistic Understanding of Large Reasoning Models: A Survey of Training, Inference, and Failures☆33Jan 29, 2026Updated 3 months ago
- Code repository for "The Clock and the Pizza: Two Stories in Mechanistic Explanation of Neural Networks"☆20Nov 24, 2023Updated 2 years ago
- STAR: Similarity-guided Teacher-Assisted Refinement for Super-Tiny Function Calling Models☆48Apr 23, 2026Updated last month
- Interpretable Diffusion Via Information Decomposition☆29Jul 18, 2024Updated last year
- Training Small Language Model☆29Dec 26, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆27Nov 10, 2025Updated 6 months ago
- Official Code for ACL 2023 Outstanding Paper: World-to-Words: Grounded Open Vocabulary Acquisition through Fast Mapping in Vision-Languag…☆33Oct 20, 2023Updated 2 years ago
- The NDIF server, which performs deep inference and serves nnsight requests remotely☆44Updated this week
- [NeurIPS 2024] Low rank memory efficient optimizer without SVD☆33Jul 1, 2025Updated 10 months ago
- Large-scale uncertainty benchmark in deep learning.☆66May 10, 2025Updated last year
- A handy plugin for copying requests/responses directly from Burp, some extra magic included.☆13Oct 15, 2021Updated 4 years ago
- This repository contains the code used for the experiments in the paper "Fine-Tuning Enhances Existing Mechanisms: A Case Study on Entity…☆31Oct 27, 2025Updated 6 months ago
- ☆10Jul 28, 2021Updated 4 years ago
- exploring whether LLMs perform case-based or rule-based reasoning☆31Mar 2, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆106May 19, 2026Updated last week
- -☆11Nov 21, 2020Updated 5 years ago
- Row-wise block scaling for fp8 quantization matrix multiplication. Solution to GPU mode AMD challenge.☆19Feb 9, 2026Updated 3 months ago
- PyTorch implementation of HashedNets☆38Apr 21, 2023Updated 3 years ago
- PeRL: Parameter-Efficient Reinforcement Learning☆80Updated this week
- Tensara's GPU programming problems☆20Apr 23, 2026Updated last month
- Triton‑style kernel toolkit for MLX plus a small upstream incubator: prototype, benchmark, and upstream fusions for Apple Silicon☆45Mar 31, 2026Updated last month
- SliderSpace: Decomposing the Visual Capabilities of Diffusion Models☆123Nov 25, 2025Updated 6 months ago
- easy-phi☆25Feb 4, 2015Updated 11 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆47Apr 9, 2025Updated last year
- ☆83Mar 3, 2026Updated 2 months ago
- Lets build a Deep Learning Framework!☆27Mar 12, 2026Updated 2 months ago
- An example platform daemon in Rust; written for Mastering Embedded Linux☆13May 8, 2020Updated 6 years ago
- PolEval 2021 Task 1☆15Jun 28, 2022Updated 3 years ago
- Reproduction Package for the paper "Type-Constrained Code Generation with Language Models" [PLDI 2025]☆93Mar 11, 2026Updated 2 months ago
- ☆14Jul 5, 2025Updated 10 months ago