kurhula / microsoft_BitNetLinks
Official inference framework for 1-bit LLMs
☆17Updated last year
Alternatives and similar repositories for microsoft_BitNet
Users that are interested in microsoft_BitNet are comparing it to the libraries listed below
Sorting:
- ☆11Updated 2 years ago
- Use safetensors with ONNX 🤗☆84Updated 3 weeks ago
- A converter and basic tester for rwkv onnx☆43Updated 2 years ago
- ☆22Updated 2 years ago
- qwen2 and llama3 cpp implementation☆49Updated last year
- Self-trained Large Language Models based on Meta LLaMa☆30Updated 2 years ago
- Windows version of NVIDIA's NCCL ('Nickel') for multi-GPU training - please use https://github.com/NVIDIA/nccl for changes.☆61Updated 2 months ago
- xllamacpp - a Python wrapper of llama.cpp☆72Updated last week
- Open deep learning compiler stack for cpu, gpu and specialized accelerators☆35Updated 3 years ago
- ☆14Updated 2 years ago
- Trying to deconstruct RWKV in understandable terms☆14Updated 2 years ago
- ☆15Updated 3 years ago
- ☆125Updated 2 years ago
- Inference Llama 2 in one file of pure C++☆87Updated 2 years ago
- cortex.llamacpp is a high-efficiency C++ inference engine for edge computing. It is a dynamic library that can be loaded by any server a…☆42Updated 7 months ago
- llama.cpp fork used by GPT4All☆55Updated 11 months ago
- run chatglm3-6b in BM1684X☆39Updated last year
- 🦙 LaMa Image Inpainting, Resolution-robust Large Mask Inpainting with Fourier Convolutions, WACV 2022☆19Updated last year
- EdgeInfer enables efficient edge intelligence by running small AI models, including embeddings and OnnxModels, on resource-constrained de…☆50Updated last year
- BlinkDL's RWKV-v4 running in the browser☆48Updated 2 years ago
- Probabilistic question-asking system: the program asks, the users answer. The minimal goal of the program is to identify what the user ne…☆73Updated 3 years ago
- 基于RWKV模 型的角色扮演,实际上是个改的妈都不认识的 RWKV_Role_Playing☆17Updated 2 years ago
- C++ version of ailia models repository☆24Updated last month
- An open source embedding vector similarity search engine powered by Faiss, NMSLIB and Annoy☆22Updated 2 years ago
- 基于WebSocket协议实现实时弹幕信息爬取与信息通信。通过MaxKB容器训练直播互动模型,具备智能互动能力,通过微调预训练的语言模型来适应特定的直播场景需求,提升数字人的交互体验。基于TTS和Wav2lip开发语音克隆和唇形同步算法,通过预训练数字人模型的方式压缩生成时…☆13Updated last year
- Recording models☆12Updated 2 years ago
- Minimal example of using a traced huggingface transformers model with libtorch☆35Updated 5 years ago
- A chat UI for Llama.cpp☆15Updated 2 months ago
- GFPGAN face reconstruction with ncnn on a bare Raspberry Pi☆14Updated 3 years ago
- minichatgpt - To Train ChatGPT In 5 Minutes☆169Updated 2 years ago