Oxen-AI/BitNet-1.58-Instruct

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Oxen-AI/BitNet-1.58-Instruct)

Oxen-AI / BitNet-1.58-Instruct

Implementation of BitNet-1.58 instruct tuning

☆32

Alternatives and similar repositories for BitNet-1.58-Instruct

Users that are interested in BitNet-1.58-Instruct are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

cg123 / bitnet
View on GitHub
Modeling code for a BitNet b1.58 Llama-style model.
☆25Apr 30, 2024Updated 2 years ago
thu-ml / TetraJet-MXFP4Training
View on GitHub
Pytorch implementation of "Oscillation-Reduced MXFP4 Training for Vision Transformers" on DeiT Model Pre-training
☆40May 4, 2026Updated 2 months ago
amanchadha / stanford-cs-229-machine-learning
View on GitHub
VIP cheatsheets for Stanford's CS 229 Machine Learning
☆10May 20, 2020Updated 6 years ago
tianyi-lab / C3PO
View on GitHub
[COLM 2025] "C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing"
☆21Apr 9, 2025Updated last year
see-- / pneumothorax-segmentation
View on GitHub
https://www.kaggle.com/c/siim-acr-pneumothorax-segmentation
☆11Sep 11, 2019Updated 6 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
RenzeLou / AAAR-1.0
View on GitHub
The source code for running LLMs on the AAAR-1.0 benchmark.
☆20Apr 5, 2025Updated last year
jiamingkong / rwkv_reward
View on GitHub
Training a reward model for RLHF using RWKV.
☆15Jun 5, 2023Updated 3 years ago
magicffourier / VQ-NeRV
View on GitHub
This repository implements the training, testing and evaluation code for the "VQ-NeRV: A Vector Quantised Neural Representation for Video…
☆10Feb 19, 2024Updated 2 years ago
amazon-science / controllable-readability-summarization
View on GitHub
Generating Summaries with Controllable Readability Levels (EMNLP 2023)
☆15Jul 2, 2026Updated 2 weeks ago
samuelepapa / neural-field-arena
View on GitHub
☆12Jan 9, 2024Updated 2 years ago
usdot-fhwa-stol / carma-streets
View on GitHub
CARMA Streets is a component of CARMA ecosystem, which enables such a coordination among different transportation users. This component p…
☆11Jun 23, 2026Updated 3 weeks ago
AIAnytime / Create-Vector-Store-from-Scratch
View on GitHub
Create Vector Store from Scratch in pure Python.
☆13Dec 15, 2023Updated 2 years ago
chris-512 / InceptionV3-SSD
View on GitHub
Tensorflow implementation of InceptionV3-SSD
☆17Jun 20, 2018Updated 8 years ago
williamFalcon / Predicting-floor-level-for-911-Calls-with-Neural-Networks-and-Smartphone-Sensor-Data
View on GitHub
Code + data for predicting floor location from smartphone sensor data
☆11Mar 16, 2018Updated 8 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
nmandic78 / AI-VoiceAssistant
View on GitHub
A Python-based voice assistant integrating speech-to-text (STT), text-to-speech (TTS), and powerful AI capabilities using either a local …
☆19Jun 28, 2026Updated 3 weeks ago
lukasmatta / cbfjs
View on GitHub
Cross-browser fingerprinting library that generates fingerprint of a device. It is written in JavaScript.
☆13Mar 13, 2019Updated 7 years ago
hazdzz / tiger
View on GitHub
A Tight-fisted Optimizer (Tiger), implemented in PyTorch.
☆12Jun 26, 2024Updated 2 years ago
SecondShiftEngineer / EthernetIP
View on GitHub
This project is based on an archived implemenation of Ethernet/IP from CodePlex. I am in the process of testing this.
☆22Mar 7, 2018Updated 8 years ago
bipashasen / INR-V-VideoGenerationSpace
View on GitHub
The Official Implementation for INR-V: A Continuous Representation Space for Video-based Generative Tasks
☆15Mar 31, 2023Updated 3 years ago
pavlosdais / ai-berkeley
View on GitHub
My solutions to projects 1, 2 & 3 of Berkeley's AI course
☆14Mar 3, 2023Updated 3 years ago
tanaymeh / mamba-train
View on GitHub
A single repo with all scripts and utils to train / fine-tune the Mamba model with or without FIM
☆62Apr 8, 2024Updated 2 years ago
qiuzh20 / RMoE
View on GitHub
Official implementation of RMoE (Layerwise Recurrent Router for Mixture-of-Experts)
☆33Aug 4, 2024Updated last year
Hon-Wong / ByteVideoLLM
View on GitHub
[ICCV 2025] Dynamic-VLM
☆28Dec 16, 2024Updated last year
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
belkakari / implicit-kan
View on GitHub
Kolmogorov-Arnold networks (KAN) as implicit functions (like NeRF but simpler)
☆15May 16, 2024Updated 2 years ago
jiahansu / GPUAR
View on GitHub
A CUDA implementation of Arithmetic Coding
☆18Jan 21, 2025Updated last year
ytyz1307zzh / PLUG
View on GitHub
Code for the ACL 2024 paper "PLUG: Leveraging Pivot Language in Cross-Lingual Instruction Tuning"
☆13Aug 13, 2025Updated 11 months ago
PoseLib / posebench
View on GitHub
☆13Mar 18, 2026Updated 4 months ago
kts / gzip-knn
View on GitHub
Reimplentation of paper using gzip + knn for text classification
☆18Aug 1, 2023Updated 2 years ago
ashishpatel26 / Audio-Masking-Methods
View on GitHub
Audio Masking Methods
☆12Nov 15, 2019Updated 6 years ago
levipereira / yolo_e2e
View on GitHub
Implementation of End-to-End YOLO Models
☆10Dec 30, 2025Updated 6 months ago
OpenPerceptionX / Openpilot-Deepdive
View on GitHub
Our insights of Openpilot, a deepdive project on it
☆10Jun 1, 2023Updated 3 years ago
nebuly-ai / nebuly-ai
View on GitHub
☆17Jan 11, 2023Updated 3 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
orashi / EBGAN_pytorch
View on GitHub
Implementation of EBGAN in pytorch
☆17Apr 3, 2017Updated 9 years ago
UVA-DSA / openpilot-CARLA
View on GitHub
This Repository includes the simulation platform of a advanced driver assistance system openpilot and urban driving simulator CARLA.
☆14Jun 24, 2022Updated 4 years ago
Aleph-Alpha-Research / trigrams
View on GitHub
☆60Nov 18, 2025Updated 8 months ago
suito555 / bitnet158b
View on GitHub
Implementation of BitNet1.58b
☆15Jul 9, 2024Updated 2 years ago
IntendedConsequence / vadc
View on GitHub
Uses the excellent silero VAD with onnxruntime C api for fast detection of audio segments with speech
☆16Sep 20, 2024Updated last year
angelocatalani / kaleidoscope
View on GitHub
Kaleidoscope is a toy programming language built from scratch using the LLVM libraries.
☆25Jun 23, 2025Updated last year
manifesto-ai / core
View on GitHub
Semantic layer for deterministic domain state
☆18Updated this week