tiiuae/onebitllms

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/tiiuae/onebitllms)

tiiuae / onebitllms

Lightweight toolkit package to train and fine-tune 1.58bit Language models

☆146

Alternatives and similar repositories for onebitllms

Users that are interested in onebitllms are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ngxson / ggml-easy
View on GitHub
Thin wrapper around GGML to make life easier
☆48Updated this week
tiiuae / Falcon-H1
View on GitHub
All information and news with respect to Falcon-H1 series
☆122Oct 9, 2025Updated 9 months ago
webis-de / rank-distillm
View on GitHub
Rank-DistiLLM: Closing the Effectiveness Gap Between Cross-Encoders and LLMs for Passage Re-Ranking
☆25Apr 4, 2025Updated last year
hotchpotch / yasem
View on GitHub
YASEM - Yet Another Splade|Sparse Embedder - A simple and efficient library for SPLADE embeddings
☆13May 22, 2025Updated last year
callbacked / vela
View on GitHub
An LLM Client for the PS Vita
☆13Jun 23, 2025Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
ModelCloud / Evalution
View on GitHub
Evalution: evolve your LLMs with better evals.
☆16Updated this week
schneiderkamplab / bitlinear
View on GitHub
BitLinear implementation
☆38Jul 8, 2026Updated 3 weeks ago
ielab / Starbucks
View on GitHub
Starbucks: Improved Training for 2D Matryoshka Embeddings
☆25Jun 30, 2025Updated last year
catid / lllm
View on GitHub
Latent Large Language Models
☆19Aug 24, 2024Updated last year
C0-Design / MemoryFormer
View on GitHub
An implementation is provided here for the NeurIPS2024 paper "MemoryFormer : Minimize Transformer Computation by Removing Fully-Connected…
☆16Mar 24, 2026Updated 4 months ago
IST-DASLab / gptq-gguf-toolkit
View on GitHub
Efficient non-uniform quantization with GPTQ for GGUF
☆64Sep 17, 2025Updated 10 months ago
thooton / aspen
View on GitHub
Personal voice assistant, with voice interruption and Twilio support
☆18Feb 24, 2025Updated last year
recombee / CompresSAE
View on GitHub
Sparse Embedding Compression for Scalable Retrieval in Recommender Systems
☆39Nov 21, 2025Updated 8 months ago
Knowledgator / FlashDeBERTa
View on GitHub
Trully flash implementation of DeBERTa disentangled attention mechanism.
☆90Feb 10, 2026Updated 5 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
chanind / linear-relational
View on GitHub
Linear Relational Embeddings (LREs) and Linear Relational Concepts (LRCs) for LLMs in PyTorch
☆11Aug 7, 2024Updated last year
huggingface / nanotron
View on GitHub
Minimalistic large language model 3D-parallelism training
☆2,768May 26, 2026Updated 2 months ago
SDLAML / disco
View on GitHub
☆16Dec 11, 2025Updated 7 months ago
EPFLiGHT / FullyOpenMeditron
View on GitHub
We release Open Meditron, a fully open, clinician-audited medical training corpus and evaluation protocol that closes the open-vs-closed …
☆15May 15, 2026Updated 2 months ago
MinishLab / tokenlearn
View on GitHub
Pre-train Static Word Embeddings
☆109Jun 9, 2026Updated last month
dropbox / hqq
View on GitHub
Official implementation of Half-Quadratic Quantization (HQQ)
☆948Feb 26, 2026Updated 5 months ago
jfkback / hypencoder-paper
View on GitHub
Official Repository for "Hypencoder: Hypernetworks for Information Retrieval"
☆41Sep 20, 2025Updated 10 months ago
frankxwang / dpo-prefix-sharing
View on GitHub
DPO, but faster 🚀
☆52Dec 6, 2024Updated last year
Pleias / Pleias-RAG-Library
View on GitHub
Python library to use Pleias-RAG models
☆72Jul 1, 2026Updated 3 weeks ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
IST-DASLab / MicroAdam
View on GitHub
This repository contains code for the MicroAdam paper.
☆21Dec 14, 2024Updated last year
arcee-ai / DistillKit
View on GitHub
An Open Source Toolkit For LLM Distillation
☆992May 12, 2026Updated 2 months ago
samchaineau / llm_slerp_generation
View on GitHub
Repo hosting codes and materials related to speeding LLMs' inference using token merging.
☆37Oct 9, 2025Updated 9 months ago
tdrussell / qlora-pipe
View on GitHub
A pipeline parallel training script for LLMs.
☆167Apr 30, 2025Updated last year
astramind-ai / BitMat
View on GitHub
An efficent implementation of the method proposed in "The Era of 1-bit LLMs"
☆155Oct 15, 2024Updated last year
benchen4395 / KuaiSearch
View on GitHub
The codebase and database of KuaiSearch: A Large-Scale E-Commerce Search Dataset for Recall, Ranking, and Relevance
☆24Updated this week
google-ai-edge / ai-edge-quantizer
View on GitHub
AI Edge Quantizer: flexible post training quantization for LiteRT models.
☆185Updated this week
feifeibear / DPSKV3MFU
View on GitHub
Estimate MFU for DeepSeekV3
☆26Jan 5, 2025Updated last year
urchade / EnriCo
View on GitHub
EnriCo: Enriched Representation and Globally Constrained Inference for Entity and Relation Extraction
☆26May 22, 2024Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
reka-ai / rekaquant
View on GitHub
☆63Jul 10, 2025Updated last year
tliby / UniFork
View on GitHub
UniFork: Exploring Modality Alignment for Unified Multimodal Understanding and Generation
☆48Aug 26, 2025Updated 11 months ago
warner-benjamin / optimi
View on GitHub
Fast, Modern, and Low Precision PyTorch Optimizers
☆129May 16, 2026Updated 2 months ago
VITA-Group / Q-GaLore
View on GitHub
Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.
☆206Jul 17, 2024Updated 2 years ago
the-seeds / cardinal
View on GitHub
Build LLM Application with Local Documents
☆20Jun 13, 2025Updated last year
vast-ai / vast-pyworker
View on GitHub
☆12May 20, 2025Updated last year
zhixuan-lin / forgetting-transformer
View on GitHub
[ICLR 2025 & COLM 2025] Official PyTorch implementation of the Forgetting Transformer and Adaptive Computation Pruning
☆150Feb 25, 2026Updated 5 months ago