☆82Nov 11, 2024Updated last year
Alternatives and similar repositories for LoQT
Users that are interested in LoQT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official implementation of ICLR 2025 'LORO: Parameter and Memory Efficient Pretraining via Low-rank Riemannian Optimization'☆16Apr 24, 2025Updated 11 months ago
- [EMNLP 25] An effective and interpretable weight-editing method for mitigating overly short reasoning in LLMs, and a mechanistic study un…☆18Dec 17, 2025Updated 3 months ago
- Fine-tuning Quantized Neural Networks with Zeroth-order Optimization☆18Sep 17, 2025Updated 6 months ago
- ☆17Dec 7, 2025Updated 4 months ago
- [NeurIPS 2024] VeLoRA : Memory Efficient Training using Rank-1 Sub-Token Projections☆21Oct 15, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [EMNLP 2024] Quantize LLM to extremely low-bit, and finetune the quantized LLMs☆15Jul 18, 2024Updated last year
- ☆27Mar 29, 2025Updated last year
- ☆13Apr 1, 2026Updated 2 weeks ago
- ☆33Nov 11, 2024Updated last year
- A pure and fast NumPy implementation of Mamba with cache support.☆18Jun 16, 2024Updated last year
- ☆56Jul 7, 2025Updated 9 months ago
- WeGeFT: Weight‑Generative Fine‑Tuning for Multi‑Faceted Efficient Adaptation of Large Models☆23Jul 10, 2025Updated 9 months ago
- [NeurIPS 2025] Think Silently, Think Fast: Dynamic Latent Compression of LLM Reasoning Chains☆89Mar 27, 2026Updated 2 weeks ago
- The hearth of The Pulsar App, fast, secure and shared inference with modern UI☆59Dec 1, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Exploring the minimal architecture required for coherent English language generation.☆12Mar 5, 2025Updated last year
- ENRICH: multi-purposE dataset for beNchmaRking In Computer vision and pHotogrammetry☆11Mar 13, 2023Updated 3 years ago
- ☆19Feb 23, 2026Updated last month
- ☆39Aug 27, 2024Updated last year
- Code and instructions accompanying ICCV'23 paper Protoype-based Dataset Comparison☆18Dec 15, 2023Updated 2 years ago
- JacQues is a Dash-based interactive web application that facilitates real-time chat and document management.☆22Jan 5, 2026Updated 3 months ago
- Code repo for the paper "SpinQuant LLM quantization with learned rotations"☆387Feb 14, 2025Updated last year
- This is the official implementation of the Concept Discovery Models paper.☆15Aug 27, 2023Updated 2 years ago
- A universal adapter including zero-copy Python bindings for Philip Turner's metal flash attention library.☆25Dec 15, 2025Updated 4 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- An agentic runtime that enables secure, extensible and configurable AI automation from any model☆18Updated this week
- [ICML 2025] From Low Rank Gradient Subspace Stabilization to Low-Rank Weights: Observations, Theories and Applications☆52Oct 30, 2025Updated 5 months ago
- ☆22Dec 1, 2021Updated 4 years ago
- ☆14Dec 6, 2023Updated 2 years ago
- ACL 2021 paper "Style is NOT a single variable: Case Studies for Cross-Style Language Understanding " by Dongyeop Kang and Eduard Hovy☆15Jul 19, 2021Updated 4 years ago
- [COLM 2025] SEAL: Steerable Reasoning Calibration of Large Language Models for Free☆56Apr 6, 2025Updated last year
- DMax: Aggressive Parallel Decoding for dLLMs☆85Updated this week
- ☆25Oct 31, 2024Updated last year
- ☆53Oct 29, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Writing Tools, Apple's AI-inspired app, enchants Windows, enhancing your pen with AI LLMs. One hotkey press, system-wide, fixes grammar, …☆27Jul 26, 2025Updated 8 months ago
- Remove generated stories with stray unicode characters☆12Jan 3, 2024Updated 2 years ago
- Run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources by exposing them on differe…☆90Updated this week
- A Python reimplementation + extension of "Planning with Large Language Models for Code Generation" (https://arxiv.org/abs/2303.05510)☆18Dec 1, 2023Updated 2 years ago
- ☆31Aug 27, 2024Updated last year
- SLiM: One-shot Quantized Sparse Plus Low-rank Approximation of LLMs (ICML 2025)☆35Nov 28, 2025Updated 4 months ago
- Zeroth-Order Fine-Tuning of LLMs in Random Subspaces (ICCV 2025)☆19Nov 22, 2024Updated last year