rafacelente/bllama

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/rafacelente/bllama)

rafacelente / bllama

1.58-bit LLaMa model

☆84

Alternatives and similar repositories for bllama

Users that are interested in bllama are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

anthonymartin / RKDO-recursive-kl-divergence-optimization
View on GitHub
☆16Jun 4, 2025Updated last year
kotak-ai / 1.58BitNet
View on GitHub
Experimental BitNet Implementation
☆73Nov 27, 2025Updated 7 months ago
mgerstgrasser / tacheles
View on GitHub
a lightweight, open-source blueprint for building powerful and scalable LLM chat applications
☆28Jun 7, 2024Updated 2 years ago
znfgnu / easy-agent
View on GitHub
Simple agent framework using Ollama tool calling
☆10Aug 27, 2024Updated last year
astramind-ai / BitMat
View on GitHub
An efficent implementation of the method proposed in "The Era of 1-bit LLMs"
☆154Oct 15, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
iboB / git-lfs-download
View on GitHub
Download full or partial git-lfs repos without temporarily using 2x disk space
☆32Oct 13, 2023Updated 2 years ago
SaschaHeyer / Real-Time-Deep-Learning-Vector-Similarity-Search
View on GitHub
☆12Feb 23, 2023Updated 3 years ago
myrakrusemark / llm-gpt4-browser
View on GitHub
☆18Feb 22, 2024Updated 2 years ago
dalnefre / kernel_abe
View on GitHub
John Shutt's "Kernel" language implemented on ABE (C) runtime.
☆13Sep 3, 2018Updated 7 years ago
broskicodes / slms
View on GitHub
Experimenting with small language models
☆75Jan 16, 2024Updated 2 years ago
garrisonhess / llama2.c
View on GitHub
Inference Llama 2 in one file of pure C
☆14Jul 24, 2023Updated 2 years ago
Cornell-RelaxML / qtip
View on GitHub
☆180Jun 22, 2025Updated last year
rombodawg / Easy_training
View on GitHub
☆51Feb 19, 2025Updated last year
GreenBitAI / low_bit_llama
View on GitHub
Advanced Ultra-Low Bitrate Compression Techniques for the LLaMA Family of LLMs
☆110Jan 11, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
pranavjad / tinyllama-bitnet
View on GitHub
Train your own small bitnet model
☆85Oct 20, 2024Updated last year
thad0ctor / KrunchWrapper
View on GitHub
☆18Jul 1, 2025Updated last year
Beomi / BitNet-Transformers
View on GitHub
0️⃣1️⃣🤗 BitNet-Transformers: Huggingface Transformers Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" i…
☆316Mar 17, 2024Updated 2 years ago
Cornell-RelaxML / quip-sharp
View on GitHub
☆600Oct 29, 2024Updated last year
nyunAI / PruneGPT
View on GitHub
☆50May 31, 2024Updated 2 years ago
Oxen-AI / Self-Rewarding-Language-Models
View on GitHub
This is work done by the Oxen.ai Community, trying to reproduce the Self-Rewarding Language Model paper from MetaAI.
☆135Nov 16, 2024Updated last year
kleinlee / MiniQwen
View on GitHub
☆14Dec 6, 2023Updated 2 years ago
NJUNLP / MoE-LPR
View on GitHub
☆22Dec 11, 2024Updated last year
kyegomez / BitNet
View on GitHub
Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch
☆1,938Updated this week
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
rawsh / mirrorllm
View on GitHub
various experiments for scaling inference time compute with small reasoning models
☆17Jan 16, 2025Updated last year
idoh / fast_mamba.np
View on GitHub
A pure and fast NumPy implementation of Mamba with cache support.
☆18Jun 16, 2024Updated 2 years ago
OpenGVLab / EfficientQAT
View on GitHub
[ACL 2025 Main] EfficientQAT: Efficient Quantization-Aware Training for Large Language Models
☆342Apr 10, 2026Updated 3 months ago
vsraptor / aide
View on GitHub
LLM shell and document interogator
☆14Jul 24, 2023Updated 2 years ago
lernapparat / torchhacks
View on GitHub
Hacks for PyTorch
☆19Apr 18, 2023Updated 3 years ago
RichardKelley / hflm
View on GitHub
A simple library for working with Hugging Face models.
☆14Dec 30, 2024Updated last year
raven38 / image_edit
View on GitHub
Demos of neural image editing
☆11Mar 15, 2021Updated 5 years ago
PlugOvr-ai / PlugOvr
View on GitHub
AI Assistant
☆21Feb 21, 2026Updated 5 months ago
turboderp-org / exllamav2
View on GitHub
A fast inference library for running LLMs locally on modern consumer-class GPUs
☆4,586Mar 4, 2026Updated 4 months ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
Hajime-Y / BitNet-b158
View on GitHub
☆20Apr 29, 2024Updated 2 years ago
mathlingua / mathlore-content
View on GitHub
A repository of mathematical knowledge written in the Mathlingua language.
☆17Nov 21, 2024Updated last year
firstbatchxyz / function-calling-eval
View on GitHub
The DPAB-α Benchmark
☆32Jan 15, 2025Updated last year
shinomakoi / AI-Messenger
View on GitHub
A QT GUI for large language models
☆40Dec 27, 2023Updated 2 years ago
ChiScraper / ChiScraper
View on GitHub
Your personal ArXiv Feed
☆23Dec 18, 2024Updated last year
sandipchitale / kubernetes-file-system-explorer
View on GitHub
Kubernetes Pod File System Explorer
☆12Feb 12, 2024Updated 2 years ago
SJTU-IPADS / Bamboo
View on GitHub
Bamboo-7B Large Language Model
☆95Mar 28, 2024Updated 2 years ago