Train your own small bitnet model
☆80Oct 20, 2024Updated last year
Alternatives and similar repositories for tinyllama-bitnet
Users that are interested in tinyllama-bitnet are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- BitLinear implementation☆36May 4, 2026Updated 3 weeks ago
- Milk-V Duo. Access to Internet throw USB RNDIS connection to host machine☆16Jan 11, 2024Updated 2 years ago
- Mic-controlled mouse clicks☆17Oct 6, 2025Updated 7 months ago
- A Model Agnostic function to directly remove specified layers from the LLM☆10May 23, 2024Updated 2 years ago
- ☆32Mar 30, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- My version of an LLM Websearch Agent using a local SearXNG server because SearXNG is great.☆45Jan 27, 2026Updated 3 months ago
- A proxy that hosts multiple single-model runners such as LLama.cpp and vLLM☆13May 30, 2025Updated 11 months ago
- 1.58-bit LLaMa model☆83Apr 3, 2024Updated 2 years ago
- An efficent implementation of the method proposed in "The Era of 1-bit LLMs"☆155Oct 15, 2024Updated last year
- Javascript implementation of 9p with JSON wire protocol.☆14Jul 25, 2025Updated 10 months ago
- Experimental interface environment for open source LLM, designed to democratize the use of AI. Powered by llama-cpp, llama-cpp-python and…☆18Oct 11, 2025Updated 7 months ago
- ☆16Dec 16, 2024Updated last year
- Inference Llama 2 in one file of pure JavaScript(HTML)☆36May 20, 2025Updated last year
- ☆13Feb 17, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- This is a fork of SGLang for hip-attention integration. Please refer to hip-attention for detail.☆18Mar 31, 2026Updated last month
- Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch☆1,933Apr 27, 2026Updated 3 weeks ago
- Micro Llama is a small Llama based model with 300M parameters trained from scratch with $500 budget☆167Aug 11, 2025Updated 9 months ago
- ☆33Jun 29, 2025Updated 10 months ago
- The official implementation of "NAS-BNN: Neural Architecture Search for Binary Neural Networks"☆14Aug 30, 2024Updated last year
- ALC project 2. Creating a "mini-netflix"☆10Jan 7, 2023Updated 3 years ago
- new optimizer☆20Aug 4, 2024Updated last year
- A Python script that saves your CrewAI agents crew output to a Notion Database☆15Feb 17, 2024Updated 2 years ago
- Minimal Implimentation of VCRec (2024) for collapse provention.☆18Jan 28, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆30Feb 27, 2024Updated 2 years ago
- ☆16Aug 1, 2024Updated last year
- 主要写er-nerf从零到一所有部署过程☆44Aug 28, 2024Updated last year
- WebAISum is a Python script that allows you to summarize web pages using AI models. It supports both local models like Ollama and remote …☆15Apr 28, 2024Updated 2 years ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Apr 13, 2022Updated 4 years ago
- https://www.kaggle.com/c/siim-acr-pneumothorax-segmentation☆11Sep 11, 2019Updated 6 years ago
- A simple updated colab doc that will allow you to run the Ooba Booga Text-Generation-Webui for free with just a few lines of codes.☆24Sep 30, 2024Updated last year
- GraphRag vs Embeddings☆16Jul 14, 2024Updated last year
- An unsupervised model merging algorithm for Transformers-based language models.☆108Apr 29, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Genertaes control vectors for use with llama.cpp in GGUF format.☆41Mar 19, 2025Updated last year
- Create text chunks which end at natural stopping points without using a tokenizer☆26Nov 26, 2025Updated 5 months ago
- maestro-compatible e2e test runner for React Native☆93May 17, 2026Updated last week
- My Implementation of Q-Sparse: All Large Language Models can be Fully Sparsely-Activated☆37Aug 14, 2024Updated last year
- Let AI agents run research experiments + train small language models (in your browser!)☆61Mar 16, 2026Updated 2 months ago
- interactive semantic search demo using Qwen3-0.6B-Embedding in your browser☆60Feb 25, 2026Updated 3 months ago
- Stop messing around with finicky sampling parameters and just use DRµGS!☆364Jun 1, 2024Updated last year