Experiments with BitNet inference on CPU
☆55Apr 1, 2024Updated last year
Alternatives and similar repositories for bitnet_cpu
Users that are interested in bitnet_cpu are comparing it to the libraries listed below
Sorting:
- speex aec kalman filter☆15Mar 17, 2024Updated 2 years ago
- ☆14Jan 31, 2023Updated 3 years ago
- Baidu's CTC Decoders, including Greedy, Beam Search and Beam Search with KenLM Language Model☆24Oct 28, 2023Updated 2 years ago
- Inference Llama 2 in one file of pure JavaScript(HTML)☆36May 20, 2025Updated 10 months ago
- KWS demo based on CTC prefix beam search.☆17Oct 21, 2023Updated 2 years ago
- SDXL GPU cluster scripts☆16Oct 28, 2023Updated 2 years ago
- Training a reward model for RLHF using RWKV.☆15Jun 5, 2023Updated 2 years ago
- 0️⃣1️⃣🤗 BitNet-Transformers: Huggingface Transformers Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" i…☆98Mar 1, 2024Updated 2 years ago
- ☆22Dec 12, 2024Updated last year
- Implementation of BitNet-1.58 instruct tuning☆27Apr 14, 2024Updated last year
- ☆580Oct 29, 2024Updated last year
- TensorflowTTS in Tensorflow.js☆18Aug 11, 2021Updated 4 years ago
- qwen2 and llama3 cpp implementation☆49Jun 7, 2024Updated last year
- Babylon.cpp is a C and C++ library for grapheme to phoneme conversion and text to speech synthesis. For phonemization a ONNX runtime port…☆30Mar 9, 2026Updated last week
- Collection of autoregressive model implementation☆85Feb 23, 2026Updated 3 weeks ago
- Went online decode demo☆31Apr 28, 2021Updated 4 years ago
- Latent Large Language Models☆19Aug 24, 2024Updated last year
- Code for the ACL 2024 paper "PLUG: Leveraging Pivot Language in Cross-Lingual Instruction Tuning"☆14Aug 13, 2025Updated 7 months ago
- Colby Hall's C++ Standard Library☆11Jan 13, 2020Updated 6 years ago
- Open sourced result for The Agent Company☆21Nov 11, 2025Updated 4 months ago
- Hand-Rolled GPU communications library☆87Nov 25, 2025Updated 3 months ago
- a version of baby agi using dspy and typed predictors☆16Mar 9, 2024Updated 2 years ago
- Use AI to edit your documents in real-time. Provide feedback and let the AI do all the work.☆29Jul 24, 2024Updated last year
- An experiment with modern C++, suffix trees, and Ukkonen's algorithm for suffix tree construction.☆12Mar 15, 2019Updated 7 years ago
- ☆16Apr 2, 2025Updated 11 months ago
- Instant Neural Graphics Primitives from scratch, zero dependencies. Learning by doing.☆10Aug 18, 2023Updated 2 years ago
- A simple sample that shows what you need to package an F# app as a flatpak☆10Jul 5, 2023Updated 2 years ago
- This is our own implementation of 'Layer Selective Rank Reduction'☆240May 26, 2024Updated last year
- Lowpass FIR filter implemented in C using Portaudio☆12Mar 17, 2020Updated 6 years ago
- Train your own small bitnet model☆78Oct 20, 2024Updated last year
- Supporting code for the paper "A study on more realistic room simulation for far-field keyword spotting".☆34Oct 27, 2020Updated 5 years ago
- Rust FTL + WebRTC live streaming software.☆13Mar 12, 2022Updated 4 years ago
- Express.js ported to a Service Worker context☆18Mar 6, 2025Updated last year
- A collection of all my single-header libraries.☆17Dec 22, 2023Updated 2 years ago
- Algorithms that work on generic C arrays☆11Feb 13, 2017Updated 9 years ago
- The Codec 2 speech codec, compiled to WASM using Emscripten.☆13Apr 27, 2023Updated 2 years ago
- End to End Machine Learning Pipeline with scikit learn☆12Mar 10, 2021Updated 5 years ago
- Inplace complex-valued ANSI C fast fourier transform☆11Nov 7, 2016Updated 9 years ago
- BFloat16 Fused Adam Operator for PyTorch☆17Nov 16, 2024Updated last year