☆82Nov 11, 2024Updated last year
Alternatives and similar repositories for LoQT
Users that are interested in LoQT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official implementation of ICLR 2025 'LORO: Parameter and Memory Efficient Pretraining via Low-rank Riemannian Optimization'☆17Apr 24, 2025Updated last year
- This is the official repository for the paper "Flora: Low-Rank Adapters Are Secretly Gradient Compressors" in ICML 2024.☆107Jul 1, 2024Updated last year
- ☆17Dec 7, 2025Updated 4 months ago
- [NeurIPS 2024] VeLoRA : Memory Efficient Training using Rank-1 Sub-Token Projections☆21Oct 15, 2024Updated last year
- [EMNLP 2024] Quantize LLM to extremely low-bit, and finetune the quantized LLMs☆15Jul 18, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆27Mar 29, 2025Updated last year
- ☆13Apr 27, 2026Updated last week
- [WACV 2025] Exploiting VLM Localizability and Semantics for Open Vocabulary Action Detection☆17Mar 23, 2025Updated last year
- 4-bit Shampoo for Memory-Efficient Network Training (NeurIPS 2024)☆13Feb 13, 2025Updated last year
- A pure and fast NumPy implementation of Mamba with cache support.☆18Jun 16, 2024Updated last year
- ☆56Jul 7, 2025Updated 9 months ago
- SwiftLet is a lightweight Python framework for running open-source Large Language Models (LLMs) locally using safetensors☆29Aug 6, 2025Updated 9 months ago
- WeGeFT: Weight‑Generative Fine‑Tuning for Multi‑Faceted Efficient Adaptation of Large Models☆23Jul 10, 2025Updated 9 months ago
- This repository contains code for the MicroAdam paper.☆21Dec 14, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- [ICML2025] LoRA fine-tune directly on the INT4 models.☆40Nov 25, 2024Updated last year
- The hearth of The Pulsar App, fast, secure and shared inference with modern UI☆60Dec 1, 2024Updated last year
- ☆15Nov 7, 2024Updated last year
- ☆19Feb 23, 2026Updated 2 months ago
- High Performance FP8 GEMM Kernels for SM89 and later GPUs.☆21Jan 24, 2025Updated last year
- ☆39Aug 27, 2024Updated last year
- Official implementation of Decoupled MeanFlow☆41Oct 28, 2025Updated 6 months ago
- ☆52Nov 5, 2024Updated last year
- A universal adapter including zero-copy Python bindings for Philip Turner's metal flash attention library.☆25Dec 15, 2025Updated 4 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- An agentic runtime that enables secure, extensible and configurable AI automation from any model☆18Apr 17, 2026Updated 2 weeks ago
- [ICML 2025] From Low Rank Gradient Subspace Stabilization to Low-Rank Weights: Observations, Theories and Applications☆52Oct 30, 2025Updated 6 months ago
- ☆22Dec 1, 2021Updated 4 years ago
- 🌳 MCTS-inspired parallel beam search for conversation optimization. Explore multiple dialogue strategies simultaneously, stress-test a…☆36Jan 18, 2026Updated 3 months ago
- [COLM 2025] SEAL: Steerable Reasoning Calibration of Large Language Models for Free☆57Apr 6, 2025Updated last year
- DMax: Aggressive Parallel Decoding for dLLMs☆112Apr 20, 2026Updated 2 weeks ago
- ☆25Oct 31, 2024Updated last year
- ☆53Oct 29, 2024Updated last year
- Writing Tools, Apple's AI-inspired app, enchants Windows, enhancing your pen with AI LLMs. One hotkey press, system-wide, fixes grammar, …☆27Jul 26, 2025Updated 9 months ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- This is a detailed code demo on how to conduct Full-Param Supervised Fine-tuning (SFT) and DPO (Direct Preference Optimization)☆19Jan 9, 2025Updated last year
- Run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources by exposing them on differe…☆90Apr 20, 2026Updated 2 weeks ago
- ☆31Aug 27, 2024Updated last year
- SLiM: One-shot Quantized Sparse Plus Low-rank Approximation of LLMs (ICML 2025)☆36Nov 28, 2025Updated 5 months ago
- Zeroth-Order Fine-Tuning of LLMs in Random Subspaces (ICCV 2025)☆19Nov 22, 2024Updated last year
- Official code for our paper, "LoRA-Pro: Are Low-Rank Adapters Properly Optimized? "☆148Apr 8, 2025Updated last year
- [ICLR 2025] Weighted-Reward Preference Optimization for Implicit Model Fusion☆14Mar 17, 2025Updated last year