Train your own small bitnet model
☆79Oct 20, 2024Updated last year
Alternatives and similar repositories for tinyllama-bitnet
Users that are interested in tinyllama-bitnet are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- BitLinear implementation☆35Jan 1, 2026Updated 4 months ago
- Experimental BitNet Implementation☆74Nov 27, 2025Updated 5 months ago
- Milk-V Duo. Access to Internet throw USB RNDIS connection to host machine☆16Jan 11, 2024Updated 2 years ago
- Mic-controlled mouse clicks☆17Oct 6, 2025Updated 6 months ago
- A Model Agnostic function to directly remove specified layers from the LLM☆10May 23, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- My version of an LLM Websearch Agent using a local SearXNG server because SearXNG is great.☆45Jan 27, 2026Updated 3 months ago
- A proxy that hosts multiple single-model runners such as LLama.cpp and vLLM☆13May 30, 2025Updated 11 months ago
- ☆16Dec 16, 2024Updated last year
- Binius circuits web demos☆14Dec 15, 2024Updated last year
- Inference Llama 2 in one file of pure JavaScript(HTML)☆36May 20, 2025Updated 11 months ago
- ☆13Feb 17, 2025Updated last year
- Port of Facebook's LLaMA model in C/C++☆13Mar 19, 2023Updated 3 years ago
- ☆58Mar 30, 2026Updated last month
- Cute layout visualization☆38Jan 18, 2026Updated 3 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆16May 27, 2025Updated 11 months ago
- This is a fork of SGLang for hip-attention integration. Please refer to hip-attention for detail.☆18Mar 31, 2026Updated last month
- Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch☆1,924Apr 27, 2026Updated last week
- ☆32Jun 29, 2025Updated 10 months ago
- new optimizer☆20Aug 4, 2024Updated last year
- A Python script that saves your CrewAI agents crew output to a Notion Database☆15Feb 17, 2024Updated 2 years ago
- Minimal Implimentation of VCRec (2024) for collapse provention.☆18Jan 28, 2025Updated last year
- Some preliminary explorations of Mamba's context scaling.☆13Dec 18, 2024Updated last year
- ☆21Feb 5, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Matrix Product State algorithm for computing characters of the symmetric group S_n☆11Sep 26, 2025Updated 7 months ago
- MiniLM (BERT) embeddings from scratch☆20Aug 14, 2025Updated 8 months ago
- WebAISum is a Python script that allows you to summarize web pages using AI models. It supports both local models like Ollama and remote …☆15Apr 28, 2024Updated 2 years ago
- An recognition oriented deep learning framework for biometric sample quality assessment☆12Aug 24, 2023Updated 2 years ago
- https://www.kaggle.com/c/siim-acr-pneumothorax-segmentation☆11Sep 11, 2019Updated 6 years ago
- GraphRag vs Embeddings☆16Jul 14, 2024Updated last year
- ☆14Dec 3, 2023Updated 2 years ago
- Giza Platform CLI☆19Sep 10, 2024Updated last year
- One Line To Build Zero-Data Classifiers in Minutes☆65Sep 25, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Kanban board made with TailwindCSS☆11Jun 10, 2021Updated 4 years ago
- An unsupervised model merging algorithm for Transformers-based language models.☆108Apr 29, 2024Updated 2 years ago
- ☆11Jun 14, 2019Updated 6 years ago
- Smart contracts, tools, and skills for AI agents that transact on Starknet☆79Updated this week
- ☆17Aug 7, 2024Updated last year
- Create text chunks which end at natural stopping points without using a tokenizer☆26Nov 26, 2025Updated 5 months ago
- My Implementation of Q-Sparse: All Large Language Models can be Fully Sparsely-Activated☆35Aug 14, 2024Updated last year