Experiments with BitNet inference on CPU
☆56Apr 1, 2024Updated 2 years ago
Alternatives and similar repositories for bitnet_cpu
Users that are interested in bitnet_cpu are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Baidu's CTC Decoders, including Greedy, Beam Search and Beam Search with KenLM Language Model☆24Oct 28, 2023Updated 2 years ago
- Inference Llama 2 in one file of pure JavaScript(HTML)☆36May 20, 2025Updated last year
- A C++ implementation of tinyllama inference on CPU.☆17Feb 28, 2024Updated 2 years ago
- The course work repo for UoSurrey EEEM071 (2023 Spring)☆11May 9, 2023Updated 3 years ago
- DistantSpeech☆22Oct 9, 2023Updated 2 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ☆10Jul 17, 2023Updated 2 years ago
- SDXL GPU cluster scripts☆16Oct 28, 2023Updated 2 years ago
- Training a reward model for RLHF using RWKV.☆15Jun 5, 2023Updated 3 years ago
- ☆22Dec 12, 2024Updated last year
- using microphone☆17Sep 2, 2021Updated 4 years ago
- Llama2 inference in one TypeScript file☆20May 29, 2025Updated last year
- ☆599Oct 29, 2024Updated last year
- qwen2 and llama3 cpp implementation☆50Jun 7, 2024Updated 2 years ago
- Collection of autoregressive model implementation☆85Jun 10, 2026Updated 3 weeks ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Latent Large Language Models☆19Aug 24, 2024Updated last year
- Colby Hall's C++ Standard Library☆11Jan 13, 2020Updated 6 years ago
- Open sourced result for The Agent Company☆21Updated this week
- 🐧🐦 Generate HTML pages for Twitter statuses.☆14Jul 22, 2018Updated 7 years ago
- This research project aims at studying and finding a suitable method to implement audio bandwidth extension to bandlimited audio files.☆27Jan 24, 2018Updated 8 years ago
- a version of baby agi using dspy and typed predictors☆16Mar 9, 2024Updated 2 years ago
- A Python implementation of Toolformer using Huggingface Transformers☆14Mar 20, 2023Updated 3 years ago
- Use AI to edit your documents in real-time. Provide feedback and let the AI do all the work.☆30Jul 24, 2024Updated last year
- ☆32Oct 28, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- An algorithm for weight-activation quantization (W4A4, W4A8) of LLMs, supporting both static and dynamic quantization☆175Nov 26, 2025Updated 7 months ago
- some ncnn demos of FunASR☆28Sep 23, 2024Updated last year
- Lowpass FIR filter implemented in C using Portaudio☆12Mar 17, 2020Updated 6 years ago
- This is our own implementation of 'Layer Selective Rank Reduction'☆240May 26, 2024Updated 2 years ago
- Multiple Producers / Multiple Consumers Message Passing Pool☆19Feb 20, 2015Updated 11 years ago
- Train your own small bitnet model☆84Oct 20, 2024Updated last year
- A one-stop font atlas generator☆14Jan 27, 2018Updated 8 years ago
- Supporting code for the paper "A study on more realistic room simulation for far-field keyword spotting".☆34Oct 27, 2020Updated 5 years ago
- Rust FTL + WebRTC live streaming software.☆13Mar 12, 2022Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Testing paligemma2 finetuning on reasoning dataset☆18Dec 28, 2024Updated last year
- A collection of all my single-header libraries.☆17Dec 22, 2023Updated 2 years ago
- Express.js ported to a Service Worker context☆17Mar 6, 2025Updated last year
- ☆19Apr 2, 2025Updated last year
- Algorithms that work on generic C arrays☆11Feb 13, 2017Updated 9 years ago
- A graph based approach to type inference written in F#☆22Apr 22, 2026Updated 2 months ago
- End to End Machine Learning Pipeline with scikit learn☆12Mar 10, 2021Updated 5 years ago