☆82Nov 11, 2024Updated last year
Alternatives and similar repositories for LoQT
Users that are interested in LoQT are comparing it to the libraries listed below
Sorting:
- This is the official repository for the paper "Flora: Low-Rank Adapters Are Secretly Gradient Compressors" in ICML 2024.☆106Jul 1, 2024Updated last year
- Official implementation of ICLR 2025 'LORO: Parameter and Memory Efficient Pretraining via Low-rank Riemannian Optimization'☆16Apr 24, 2025Updated 10 months ago
- [EMNLP 25] An effective and interpretable weight-editing method for mitigating overly short reasoning in LLMs, and a mechanistic study un…☆17Dec 17, 2025Updated 2 months ago
- SwiftLet is a lightweight Python framework for running open-source Large Language Models (LLMs) locally using safetensors☆28Aug 6, 2025Updated 6 months ago
- A pure and fast NumPy implementation of Mamba with cache support.☆18Jun 16, 2024Updated last year
- JacQues is a Dash-based interactive web application that facilitates real-time chat and document management.☆22Jan 5, 2026Updated last month
- The hearth of The Pulsar App, fast, secure and shared inference with modern UI☆59Dec 1, 2024Updated last year
- Python package for Geometric / Clifford Algebra with Pytorch.☆14Jan 25, 2026Updated last month
- ☆30Aug 27, 2024Updated last year
- An agentic runtime that enables secure, extensible and configurable AI automation from any model☆17Feb 21, 2026Updated last week
- A universal adapter including zero-copy Python bindings for Philip Turner's metal flash attention library.☆23Dec 15, 2025Updated 2 months ago
- Writing Tools, Apple's AI-inspired app, enchants Windows, enhancing your pen with AI LLMs. One hotkey press, system-wide, fixes grammar, …☆27Jul 26, 2025Updated 7 months ago
- Fine-tuning Quantized Neural Networks with Zeroth-order Optimization☆16Sep 17, 2025Updated 5 months ago
- 🌳 MCTS-inspired parallel beam search for conversation optimization. Explore multiple dialogue strategies simultaneously, stress-test a…☆35Jan 18, 2026Updated last month
- ENRICH: multi-purposE dataset for beNchmaRking In Computer vision and pHotogrammetry☆11Mar 13, 2023Updated 2 years ago
- ☆17Apr 22, 2024Updated last year
- ☆13Feb 28, 2024Updated 2 years ago
- AI Based "Happiness Optimizer"☆12Oct 20, 2024Updated last year
- ☆55Jul 7, 2025Updated 7 months ago
- [ICML2025] LoRA fine-tune directly on the quantized models.☆39Nov 25, 2024Updated last year
- ☆32Nov 11, 2024Updated last year
- [ICLR 2025] Weighted-Reward Preference Optimization for Implicit Model Fusion☆14Mar 17, 2025Updated 11 months ago
- ☆15Nov 7, 2024Updated last year
- ☆13Jan 15, 2025Updated last year
- ☆16Dec 18, 2023Updated 2 years ago
- [EMNLP 2024] Quantize LLM to extremely low-bit, and finetune the quantized LLMs☆15Jul 18, 2024Updated last year
- ☆24Jun 18, 2025Updated 8 months ago
- ☆16Jan 2, 2024Updated 2 years ago
- 33B Chinese LLM, DPO QLORA, 100K context, AirLLM 70B inference with single 4GB GPU☆13May 5, 2024Updated last year
- ☆14Dec 6, 2023Updated 2 years ago
- Your universal AI text processor, powered by local and cloud LLMs. Edit, refactor, and transform text in any application on Windows, macO…☆72Nov 9, 2025Updated 3 months ago
- Code repo for the paper "SpinQuant LLM quantization with learned rotations"☆373Feb 14, 2025Updated last year
- ☆18Apr 18, 2025Updated 10 months ago
- High Performance FP8 GEMM Kernels for SM89 and later GPUs.☆20Jan 24, 2025Updated last year
- [NeurIPS 2025] Think Silently, Think Fast: Dynamic Latent Compression of LLM Reasoning Chains☆79Jul 29, 2025Updated 7 months ago
- ☆20Oct 13, 2024Updated last year
- ☆52Nov 5, 2024Updated last year
- WeGeFT: Weight‑Generative Fine‑Tuning for Multi‑Faceted Efficient Adaptation of Large Models☆22Jul 10, 2025Updated 7 months ago
- SLiM: One-shot Quantized Sparse Plus Low-rank Approximation of LLMs (ICML 2025)☆32Nov 28, 2025Updated 3 months ago