Official Pytorch Implementation of Paper "DarwinLM: Evolutionary Structured Pruning of Large Language Models"
☆20Feb 21, 2025Updated last year
Alternatives and similar repositories for DarwinLM
Users that are interested in DarwinLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The official implementation of Bi-Mamba☆16Oct 22, 2025Updated 6 months ago
- ☆19Jan 3, 2025Updated last year
- [ICLR 2024] Jaiswal, A., Gan, Z., Du, X., Zhang, B., Wang, Z., & Yang, Y. Compressing llms: The truth is rarely pure and never simple.☆27Apr 21, 2025Updated last year
- ☆13Nov 29, 2024Updated last year
- Learnable Semi-structured Sparsity for Vision Transformers and Diffusion Transformers☆15Feb 7, 2025Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆30Jun 11, 2023Updated 2 years ago
- [ACL2026 Main] AgencyBench: Benchmarking the Frontiers of Autonomous Agents in 1M-Token Real-World Contexts☆77Jan 23, 2026Updated 3 months ago
- 关于AI,ML,DA,DV等的几个经典案例,包括堵车模拟(NagelSchreckenberg)、蒙特卡洛排队问题(Monte Carlo Queuing Problem)、人脸识别(RecognitionFace)、遗传算法推断图像(IconGenetic)☆10Oct 14, 2018Updated 7 years ago
- ☆16Oct 18, 2024Updated last year
- ☆31Feb 8, 2026Updated 2 months ago
- vortex particles for simulating smoke in 2d☆16Dec 13, 2021Updated 4 years ago
- ☆14Apr 14, 2025Updated last year
- Code repository for the paper on "Predicting the Performance of Black-Box LLMs through Self-Queries".☆12Jan 9, 2025Updated last year
- ☆13Jan 22, 2025Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- FBI-LLM: Scaling Up Fully Binarized LLMs from Scratch via Autoregressive Distillation☆51Aug 24, 2025Updated 8 months ago
- DatasetResearch: Benchmarking Agent Systems for Demand-Driven Dataset Discovery☆20Sep 24, 2025Updated 7 months ago
- My personal site, using Wowchemy☆13Updated this week
- Official implementation of the paper: "A deeper look at depth pruning of LLMs"☆15Jul 24, 2024Updated last year
- This is the official repo for "Differentiable Model Scaling using Differentiable Topk"☆12May 16, 2024Updated last year
- [TVCG & VR'25] LAPIG: Language Guided Projector Image Generation with Surface Adaptation and Stylization☆11Apr 16, 2026Updated 2 weeks ago
- Code and data release of the paper Enhancing LLM Complex Problem-Solving with Hybrid Thinking and Dynamic Workflows☆15Oct 4, 2024Updated last year
- PCA-SVD-Autoencoder-Fourier-Wavelet-Transformation-for-denoising☆22Feb 16, 2022Updated 4 years ago
- [AAAI 2024] Fluctuation-based Adaptive Structured Pruning for Large Language Models☆75Jan 6, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Official repository for the paper Number Cookbook: Number Understanding of Language Models and How to Improve It.☆20Mar 31, 2025Updated last year
- Control LLM☆23Apr 6, 2025Updated last year
- The official implementation of "DDR-Net: Learning Multi-Stage Multi-View Stereo With Dynamic Depth Range"☆24Aug 25, 2021Updated 4 years ago
- [ICML 2025] SliM-LLM: Salience-Driven Mixed-Precision Quantization for Large Language Models☆59Aug 9, 2024Updated last year
- https://footprints.baulab.info☆18Oct 4, 2024Updated last year
- Official Pytorch Implementation of "Outlier Weighed Layerwise Sparsity (OWL): A Missing Secret Sauce for Pruning LLMs to High Sparsity"☆81Jul 7, 2025Updated 9 months ago
- PyTorch code for "Perceiver-VL: Efficient Vision-and-Language Modeling with Iterative Latent Attention" (WACV 2023)☆34Feb 5, 2023Updated 3 years ago
- Jax implementation of "Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models"☆15May 10, 2024Updated last year
- 中科大郑启龙2021年并行程序设计课程实验☆11Jan 15, 2022Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Implementation code for ACL2024:Advancing Parameter Efficiency in Fine-tuning via Representation Editing☆15Apr 20, 2024Updated 2 years ago
- [ACL2024] Exploring the Potential of Large Language Models in Computational Argumentation☆18Aug 21, 2024Updated last year
- Landmark Deep Equilibrium Model (LDEQ), applied to videos with a Recurrence without Recurrence (RwR) paradigm☆43Jun 18, 2023Updated 2 years ago
- ☆33Oct 13, 2025Updated 6 months ago
- ☆14Jan 5, 2022Updated 4 years ago
- A thesis template compliant with King's College London and UCL rules☆19Dec 14, 2025Updated 4 months ago
- A method for evaluating the high-level coherence of machine-generated texts. Identifies high-level coherence issues in transformer-based …☆11Mar 18, 2023Updated 3 years ago