☆56Nov 6, 2024Updated last year
Alternatives and similar repositories for DAM
Users that are interested in DAM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆146Aug 20, 2025Updated 10 months ago
- Distill thinking dataset more compactly and accurately!☆38Jun 6, 2025Updated last year
- Latent Large Language Models☆19Aug 24, 2024Updated last year
- 🐜🔧 A minimalistic tool to fine-tune your LLMs☆18Aug 17, 2023Updated 2 years ago
- Genetics for Language Models☆18Jul 1, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Using multiple LLMs for ensemble Forecasting☆16Jan 17, 2024Updated 2 years ago
- Localize-and-Stitch: Efficient Model Merging via Sparse Task Arithmetic☆32Feb 18, 2026Updated 4 months ago
- An unofficial implementation of SOLAR-10.7B model and the newly proposed interlocked-DUS(iDUS) implementation and experiment details.☆14Mar 20, 2024Updated 2 years ago
- Reasoning Activation in LLMs via Small Model Transfer (NeurIPS 2025)☆22Oct 16, 2025Updated 8 months ago
- ☆137Aug 19, 2024Updated last year
- This is the open-source code for TokenCarve.☆25Jan 23, 2026Updated 5 months ago
- ☆40Jun 19, 2024Updated 2 years ago
- PyTorch implementation for MRL☆23Feb 22, 2024Updated 2 years ago
- ☆21Jul 25, 2025Updated 11 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- An Open Source Toolkit For LLM Distillation☆973May 12, 2026Updated last month
- The official repository of "Whoever Started the Interference Should End It: Guiding Data-Free Model Merging via Task Vectors""☆50Oct 1, 2025Updated 9 months ago
- Clustered Compositional Embeddings☆13Oct 25, 2023Updated 2 years ago
- [NeurIPS 2024] For paper Parameter Competition Balancing for Model Merging☆48Oct 11, 2024Updated last year
- A dashboard for exploring timm learning rate schedulers☆20Nov 22, 2024Updated last year
- Sakura-SOLAR-DPO: Merge, SFT, and DPO☆116Dec 30, 2023Updated 2 years ago
- Open source LLM arena created by the French Government☆76Updated this week
- ☆162Dec 2, 2024Updated last year
- DELLA-Merging: Reducing Interference in Model Merging through Magnitude-Based Sampling☆37Jul 12, 2024Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Official implementation of ECCV24 paper: POA☆24Aug 8, 2024Updated last year
- Dynamic Shell Command MCP Server☆41Feb 27, 2025Updated last year
- Automated Identification of Redundant Layer Blocks for Pruning in Large Language Models☆267Apr 23, 2024Updated 2 years ago
- All-in-one repository for Fine-tuning & Pretraining (Large) Language Models☆15Mar 8, 2023Updated 3 years ago
- Replicating O1 inference-time scaling laws☆94Dec 1, 2024Updated last year
- Mixture of Expert (MoE) techniques for enhancing LLM performance through expert-driven prompt mapping and adapter combinations.☆12Feb 11, 2024Updated 2 years ago
- Pytorch code for NeurIPS 2025 paper "Accurate and Efficient Low-Rank Model Merging in Core Space"☆41Feb 2, 2026Updated 4 months ago
- ☆15May 17, 2024Updated 2 years ago
- Automatically evaluate your LLMs in Google Colab☆687May 7, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- future-proof vulnerability detection benchmark, based on CVEs in open-source repos☆70Updated this week
- Code for the EMNLP24 paper "A simple and effective L2 norm based method for KV Cache compression."☆18Dec 13, 2024Updated last year
- [ICLR 2026] Official Implementation of ProxyThinker: Test-Time Guidance through Small Visual Reasoners.☆22Sep 24, 2025Updated 9 months ago
- Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters☆133Dec 3, 2024Updated last year
- Mixture of Lora Experts☆11Apr 7, 2024Updated 2 years ago
- ☆14Sep 7, 2024Updated last year
- The implementation for FREE-Merging: Fourier Transform for Model Merging with Lightweight Experts (ICCV25)☆16Jun 26, 2025Updated last year