Experiments with BitNet inference on CPU
☆57Apr 1, 2024Updated 2 years ago
Alternatives and similar repositories for bitnet_cpu
Users that are interested in bitnet_cpu are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- KWS demo based on CTC prefix beam search.☆18Oct 21, 2023Updated 2 years ago
- DistantSpeech☆22Oct 9, 2023Updated 2 years ago
- SDXL GPU cluster scripts☆16Oct 28, 2023Updated 2 years ago
- Training a reward model for RLHF using RWKV.☆15Jun 5, 2023Updated 2 years ago
- ☆22Dec 12, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- using microphone☆17Sep 2, 2021Updated 4 years ago
- Tiny ASIC implementation for "The Era of 1-bit LLMs All Large Language Models are in 1.58 Bits" matrix multiplication unit☆192Apr 19, 2024Updated 2 years ago
- Implementation of BitNet-1.58 instruct tuning☆29Apr 14, 2024Updated 2 years ago
- Minimal ZX Spectrum for Ulx3s ECP5 board☆12May 7, 2020Updated 6 years ago
- Collection of autoregressive model implementation☆85Feb 23, 2026Updated 2 months ago
- Went online decode demo☆31Apr 28, 2021Updated 5 years ago
- Latent Large Language Models☆19Aug 24, 2024Updated last year
- Code for the ACL 2024 paper "PLUG: Leveraging Pivot Language in Cross-Lingual Instruction Tuning"☆14Aug 13, 2025Updated 9 months ago
- Open sourced result for The Agent Company☆21Nov 11, 2025Updated 6 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A very basic 160 4bpp snes superfx demo☆16Apr 17, 2026Updated last month
- ☆15Jun 30, 2025Updated 10 months ago
- a version of baby agi using dspy and typed predictors☆16Mar 9, 2024Updated 2 years ago
- Crossassembler for changing x86 into 6502 assembly code. Other CPU's like Z80 will be added in the future.☆15Oct 9, 2018Updated 7 years ago
- Scripts to create cartoons of 3D genomes☆12Feb 27, 2024Updated 2 years ago
- AI voicebox on Raspberry Pi☆13Jan 27, 2026Updated 3 months ago
- Use AI to edit your documents in real-time. Provide feedback and let the AI do all the work.☆30Jul 24, 2024Updated last year
- Thin wrapper around GGML to make life easier☆45Nov 5, 2025Updated 6 months ago
- MPEG-TS section (PSI/SI etc.) archiver☆18Jun 8, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- An algorithm for weight-activation quantization (W4A4, W4A8) of LLMs, supporting both static and dynamic quantization☆171Nov 26, 2025Updated 5 months ago
- Instant Neural Graphics Primitives from scratch, zero dependencies. Learning by doing.☆10Aug 18, 2023Updated 2 years ago
- ☆10Dec 17, 2022Updated 3 years ago
- some ncnn demos of FunASR☆28Sep 23, 2024Updated last year
- A simple sample that shows what you need to package an F# app as a flatpak☆10Jul 5, 2023Updated 2 years ago
- Lowpass FIR filter implemented in C using Portaudio☆12Mar 17, 2020Updated 6 years ago
- This is our own implementation of 'Layer Selective Rank Reduction'☆240May 26, 2024Updated last year
- Train your own small bitnet model☆80Oct 20, 2024Updated last year
- Supporting code for the paper "A study on more realistic room simulation for far-field keyword spotting".☆34Oct 27, 2020Updated 5 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Rust FTL + WebRTC live streaming software.☆13Mar 12, 2022Updated 4 years ago
- ☆18Apr 2, 2025Updated last year
- A graph based approach to type inference written in F#☆22Apr 22, 2026Updated 3 weeks ago
- End to End Machine Learning Pipeline with scikit learn☆12Mar 10, 2021Updated 5 years ago
- Inference slice of marian for bergamot's tiny11 models. Faster to compile, and wield. Fewer model-archs than bergamot-translator.☆14Oct 24, 2024Updated last year
- BFloat16 Fused Adam Operator for PyTorch☆19Nov 16, 2024Updated last year
- ☆12Feb 4, 2024Updated 2 years ago