☆22Jun 5, 2025Updated 10 months ago
Alternatives and similar repositories for SQuat
Users that are interested in SQuat are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Synthetic Data Generation Toolkit for LLMs☆129Updated this week
- Code for paper DMCVR: Morphology-Guided Diffusion Model for 3D Cardiac Volume Reconstruction☆13Jan 12, 2024Updated 2 years ago
- Bayesian Low-Rank Adaptation of LLMs: BLoB [NeurIPS 2024] and TFB [NeurIPS 2025]☆35Feb 4, 2026Updated 2 months ago
- ☆52Mar 17, 2025Updated last year
- ☆21Mar 12, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆41Jul 1, 2025Updated 9 months ago
- ☆41May 24, 2024Updated last year
- Implementation for the paper 'Momentum Stiefel Optimizer, with Applications to Suitably-Orthogonal Attention, and Optimal Transport' (ICL…☆19Jan 1, 2025Updated last year
- ☆57Apr 9, 2026Updated last week
- [NeurIPS 2025] Encoder-Decoder Diffusion Language Models for Efficient Training and Inference☆36Oct 29, 2025Updated 5 months ago
- ☆76Jul 28, 2025Updated 8 months ago
- [NeurIPS 2025 Spotlight] FUDOKI: Discrete Flow-based Unified Understanding and Generation via Kinetic-Optimal Velocities☆75Dec 21, 2025Updated 3 months ago
- Public code release for the paper "Reawakening knowledge: Anticipatory recovery from catastrophic interference via structured training"☆11Oct 27, 2025Updated 5 months ago
- ☆59Jan 28, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- QRHead: Query-Focused Retrieval Heads Improve Long-Context Reasoning and Re-ranking☆38Jan 20, 2026Updated 2 months ago
- ☆25Oct 3, 2025Updated 6 months ago
- Make reasoning models scalable☆49May 31, 2025Updated 10 months ago
- Source code for a LoRA-based continual relation extraction method.☆14Sep 25, 2023Updated 2 years ago
- Officially unofficial PyTorch code for the NIPS paper 'Natural-Parameter Networks: A Class of Probabilistic Neural Networks'☆11Sep 28, 2021Updated 4 years ago
- Entropy-Driven GRPO with Guided Error Correction for Advantage Diversity☆22Aug 28, 2025Updated 7 months ago
- [ICLRW'26] EoRA: Fine-tuning-free Compensation for Compressed LLM with Eigenspace Low-Rank Approximation☆33Mar 24, 2026Updated 3 weeks ago
- Official implementation for Text Generation Beyond Discrete Token Sampling☆24Aug 11, 2025Updated 8 months ago
- [ICML 2023] Taxonomy-Structured Domain Adaptation☆12Oct 6, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Artifact evaluation for HPCA'24 paper Lightening-Transformer: A Dynamically-operated Optically-interconnected Photonic Transformer Accele…☆11Mar 3, 2024Updated 2 years ago
- An Ultra-Long Output Reinforcement Learning Approach☆23Jul 31, 2025Updated 8 months ago
- Pytorch implementation of iCaRL with some extras.☆15Nov 29, 2020Updated 5 years ago
- ☆15Apr 26, 2025Updated 11 months ago
- ☆13Apr 1, 2026Updated 2 weeks ago
- High Performance FP8 GEMM Kernels for SM89 and later GPUs.☆21Jan 24, 2025Updated last year
- The official repo of paper "Self-Control of LLM Behaviors by Compressing Suffix Gradient into Prefix Controller"☆18Aug 13, 2024Updated last year
- ☆53Oct 29, 2024Updated last year
- Code for the paper "Mehta, S. V., Patil, D., Chandar, S., & Strubell, E. (2023). An Empirical Investigation of the Role of Pre-training i…☆17Mar 18, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Official implementation of MARIO: Model Agnostic Recipe for Improving OOD Generalization of Graph Contrastive Learning☆19Jan 27, 2024Updated 2 years ago
- ☆50Apr 29, 2020Updated 5 years ago
- [ICLR24] Better Neural PDE Solvers Through Data-Free Mesh Movers☆17Mar 20, 2024Updated 2 years ago
- [ICML 2021] "Auto-NBA: Efficient and Effective Search Over the Joint Space of Networks, Bitwidths, and Accelerators" by Yonggan Fu, Yonga…☆16Jan 3, 2022Updated 4 years ago
- ☆10Oct 24, 2024Updated last year
- ☆24Jul 27, 2022Updated 3 years ago
- ☆37Mar 17, 2025Updated last year