Lightweight toolkit package to train and fine-tune 1.58bit Language models
☆118May 19, 2025Updated 10 months ago
Alternatives and similar repositories for onebitllms
Users that are interested in onebitllms are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Starbucks: Improved Training for 2D Matryoshka Embeddings☆22Jun 30, 2025Updated 8 months ago
- Create 3D files in the CLI with Small Language Model☆44Oct 15, 2025Updated 5 months ago
- Sparse Embedding Compression for Scalable Retrieval in Recommender Systems☆35Nov 21, 2025Updated 4 months ago
- BitLinear implementation☆35Jan 1, 2026Updated 2 months ago
- FlexiTokens☆18Dec 27, 2025Updated 2 months ago
- Personal voice assistant, with voice interruption and Twilio support☆18Feb 24, 2025Updated last year
- Rank-DistiLLM: Closing the Effectiveness Gap Between Cross-Encoders and LLMs for Passage Re-Ranking☆25Apr 4, 2025Updated 11 months ago
- ☆53Jul 18, 2024Updated last year
- Official Repository for "Hypencoder: Hypernetworks for Information Retrieval"☆35Sep 20, 2025Updated 6 months ago
- Fast, Modern, and Low Precision PyTorch Optimizers☆128Dec 29, 2025Updated 2 months ago
- ModernBERT model optimized for Apple Neural Engine.☆31Jan 10, 2025Updated last year
- A novel approach for transformer model introspection that enables saving, compressing, and manipulating internal thought states for advan…☆31Mar 3, 2026Updated 3 weeks ago
- An efficent implementation of the method proposed in "The Era of 1-bit LLMs"☆155Oct 15, 2024Updated last year
- A pipeline parallel training script for LLMs.☆166Apr 30, 2025Updated 10 months ago
- EnriCo: Enriched Representation and Globally Constrained Inference for Entity and Relation Extraction☆26May 22, 2024Updated last year
- ☆10Aug 14, 2023Updated 2 years ago
- ☆16Jul 29, 2025Updated 7 months ago
- ☆21Oct 2, 2024Updated last year
- My NER Experiments with ModernBERT and Ettin☆26Jul 17, 2025Updated 8 months ago
- Build LLM Application with Local Documents☆19Jun 13, 2025Updated 9 months ago
- Linear Attention Sequence Parallelism (LASP)☆88Jun 4, 2024Updated last year
- Minimalistic large language model 3D-parallelism training☆2,617Feb 19, 2026Updated last month
- A RAG that can scale 🧑🏻💻☆11May 28, 2024Updated last year
- An unofficial implementation of SOLAR-10.7B model and the newly proposed interlocked-DUS(iDUS) implementation and experiment details.☆14Mar 20, 2024Updated 2 years ago
- An fully autonomous agent that accesses the browser and performs tasks.☆18Apr 25, 2025Updated 11 months ago
- Model implementation for the contextual embeddings project☆43Jun 2, 2025Updated 9 months ago
- ☆11Sep 7, 2024Updated last year
- Exploring limitations of LLM-as-a-judge☆20Aug 17, 2024Updated last year
- A real-time shared memory layer for multi-agent LLM systems.☆61Jan 12, 2026Updated 2 months ago
- Set-Encoder: Permutation-Invariant Inter-Passage Attention for Listwise Passage Re-Ranking with Cross-Encoders☆18May 23, 2025Updated 10 months ago
- The Fastest Way to Fine-Tune LLMs Locally☆338Dec 18, 2025Updated 3 months ago
- ☆10Oct 2, 2024Updated last year
- ☆10Oct 20, 2023Updated 2 years ago
- win32 native frontend for llama-cli☆12Nov 2, 2024Updated last year
- rl from zero pretrain, can it be done? yes.☆290Sep 28, 2025Updated 5 months ago
- an auto-sleeping and -waking framework around llama.cpp☆12Feb 8, 2025Updated last year
- ☆56Aug 6, 2025Updated 7 months ago
- ☆15Apr 2, 2025Updated 11 months ago
- Code for the examples presented in the talk "Training a Llama in your backyard: fine-tuning very large models on consumer hardware" given…☆15Oct 16, 2023Updated 2 years ago