Lightweight toolkit package to train and fine-tune 1.58bit Language models
β125May 19, 2025Updated 10 months ago
Alternatives and similar repositories for onebitllms
Users that are interested in onebitllms are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Starbucks: Improved Training for 2D Matryoshka Embeddingsβ23Jun 30, 2025Updated 9 months ago
- π± A little course on Reinforcement Learning Environments for evaluating and training Language Modelsβ67Updated this week
- Create 3D files in the CLI with Small Language Modelβ44Oct 15, 2025Updated 5 months ago
- Sparse Embedding Compression for Scalable Retrieval in Recommender Systemsβ35Nov 21, 2025Updated 4 months ago
- BitLinear implementationβ35Jan 1, 2026Updated 3 months ago
- Wordpress hosting with auto-scaling - Free Trial β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- FlexiTokensβ19Dec 27, 2025Updated 3 months ago
- Latent Large Language Modelsβ19Aug 24, 2024Updated last year
- Personal voice assistant, with voice interruption and Twilio supportβ18Feb 24, 2025Updated last year
- Linear Relational Embeddings (LREs) and Linear Relational Concepts (LRCs) for LLMs in PyTorchβ10Aug 7, 2024Updated last year
- β53Jul 18, 2024Updated last year
- Official Repository for "Hypencoder: Hypernetworks for Information Retrieval"β35Sep 20, 2025Updated 6 months ago
- Fast, Modern, and Low Precision PyTorch Optimizersβ130Dec 29, 2025Updated 3 months ago
- All information and news with respect to Falcon-H1 seriesβ114Oct 9, 2025Updated 6 months ago
- Implementation of BitNet-1.58 instruct tuningβ27Apr 14, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean β’ AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A novel approach for transformer model introspection that enables saving, compressing, and manipulating internal thought states for advanβ¦β31Mar 22, 2026Updated 3 weeks ago
- An efficent implementation of the method proposed in "The Era of 1-bit LLMs"β155Oct 15, 2024Updated last year
- Quantize transformers to any learned arbitrary 4-bit numeric formatβ52Apr 8, 2026Updated last week
- A pipeline parallel training script for LLMs.β166Apr 30, 2025Updated 11 months ago
- EnriCo: Enriched Representation and Globally Constrained Inference for Entity and Relation Extractionβ26May 22, 2024Updated last year
- β10Aug 14, 2023Updated 2 years ago
- β21Oct 2, 2024Updated last year
- My NER Experiments with ModernBERT and Ettinβ27Jul 17, 2025Updated 8 months ago
- Build LLM Application with Local Documentsβ19Jun 13, 2025Updated 10 months ago
- Managed Database hosting by DigitalOcean β’ AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Open source RAG with Llama Index for Japanese LLM in low resource setttingβ10May 12, 2025Updated 11 months ago
- A RAG that can scale π§π»βπ»β11May 28, 2024Updated last year
- Minimalistic large language model 3D-parallelism trainingβ2,644Apr 7, 2026Updated last week
- An fully autonomous agent that accesses the browser and performs tasks.β18Apr 25, 2025Updated 11 months ago
- An unofficial implementation of SOLAR-10.7B model and the newly proposed interlocked-DUS(iDUS) implementation and experiment details.β14Mar 20, 2024Updated 2 years ago
- β12Sep 7, 2024Updated last year
- Exploring limitations of LLM-as-a-judgeβ20Aug 17, 2024Updated last year
- Model implementation for the contextual embeddings projectβ47Jun 2, 2025Updated 10 months ago
- A real-time shared memory layer for multi-agent LLM systems.β61Jan 12, 2026Updated 3 months ago
- Simple, predictable pricing with DigitalOcean hosting β’ AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- The Fastest Way to Fine-Tune LLMs Locallyβ339Dec 18, 2025Updated 3 months ago
- Set-Encoder: Permutation-Invariant Inter-Passage Attention for Listwise Passage Re-Ranking with Cross-Encodersβ18May 23, 2025Updated 10 months ago
- β10Oct 2, 2024Updated last year
- Simple examples using Argilla tools to build AIβ57Nov 18, 2024Updated last year
- β10Oct 20, 2023Updated 2 years ago
- an auto-sleeping and -waking framework around llama.cppβ12Feb 8, 2025Updated last year
- Official implementation of UnifiedReward & UnifiedReward-Thinkβ18Jun 18, 2025Updated 9 months ago