Implementation of BitNet-1.58 instruct tuning
☆31Apr 14, 2024Updated 2 years ago
Alternatives and similar repositories for BitNet-1.58-Instruct
Users that are interested in BitNet-1.58-Instruct are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Modeling code for a BitNet b1.58 Llama-style model.☆25Apr 30, 2024Updated 2 years ago
- Pytorch implementation of "Oscillation-Reduced MXFP4 Training for Vision Transformers" on DeiT Model Pre-training☆40May 4, 2026Updated last month
- Generating Summaries with Controllable Readability Levels (EMNLP 2023)☆15Apr 8, 2026Updated 2 months ago
- Official implementation of RMoE (Layerwise Recurrent Router for Mixture-of-Experts)☆32Aug 4, 2024Updated last year
- Distributed Optimization Infra for learning CLIP models☆31Oct 3, 2024Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Matrix Product State algorithm for computing characters of the symmetric group S_n☆11Sep 26, 2025Updated 9 months ago
- The source code for running LLMs on the AAAR-1.0 benchmark.☆20Apr 5, 2025Updated last year
- Rust FTL + WebRTC live streaming software.☆13Mar 12, 2022Updated 4 years ago
- https://www.kaggle.com/c/siim-acr-pneumothorax-segmentation☆11Sep 11, 2019Updated 6 years ago
- ☆11Jun 14, 2019Updated 7 years ago
- AIME API Server - Scalable AI Model Inference API Server☆15Sep 19, 2025Updated 9 months ago
- Using Demucs in comfyUI, make Music Source Separation☆12Dec 12, 2025Updated 6 months ago
- Realtime Face detection demo using YOLO v2 and OpenCV DNN module☆17Mar 10, 2018Updated 8 years ago
- Training a reward model for RLHF using RWKV.☆15Jun 5, 2023Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Lyrics crawling, pre-processing, embedding generation, model training, and lyrics generation - all in one tool☆14Nov 4, 2018Updated 7 years ago
- ☆13Sep 8, 2020Updated 5 years ago
- extract chords from an audio file (using ohollo/chord-extractor & Chordino)☆15May 24, 2026Updated last month
- ☆11Dec 9, 2020Updated 5 years ago
- Experiments with BitNet inference on CPU☆56Apr 1, 2024Updated 2 years ago
- CARMA Streets is a component of CARMA ecosystem, which enables such a coordination among different transportation users. This component p…☆11Jun 23, 2026Updated last week
- Language modeling with linear-cost context☆118Sep 25, 2025Updated 9 months ago
- [ICLR 2026] Official code of "Segment any Events with Language"☆49Apr 10, 2026Updated 2 months ago
- ☆17Jan 30, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Create Vector Store from Scratch in pure Python.☆13Dec 15, 2023Updated 2 years ago
- Tensorflow implementation of InceptionV3-SSD☆17Jun 20, 2018Updated 8 years ago
- Pack of scripts providing customizable YouTube Music Videos generation.☆12Oct 10, 2023Updated 2 years ago
- ☆37Jul 4, 2025Updated 11 months ago
- System Architecture of an EdTech Platform powered by Deep Learning (NCF) Recommendations system, ETL Data pipelines and GenAI for queries☆20Jun 13, 2026Updated 2 weeks ago
- The Official Implementation for INR-V: A Continuous Representation Space for Video-based Generative Tasks☆15Mar 31, 2023Updated 3 years ago
- ☆18Jan 7, 2019Updated 7 years ago
- [NeurIPS 2025 Spotlight] A Token is Worth over 1,000 Tokens: Efficient Knowledge Distillation through Low-Rank Clone.☆48Oct 29, 2025Updated 8 months ago
- [ICCV 2025] Dynamic-VLM☆28Dec 16, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A CUDA implementation of Arithmetic Coding☆18Jan 21, 2025Updated last year
- Kolmogorov-Arnold networks (KAN) as implicit functions (like NeRF but simpler)☆15May 16, 2024Updated 2 years ago
- Qt GUI for LLM assisted co-writing☆12Jul 28, 2024Updated last year
- ☆17Oct 18, 2022Updated 3 years ago
- [ICLR 2026] Code for Evolutionary Caching to Accelerate Your Off-the-Shelf Diffusion Model☆30Mar 1, 2026Updated 4 months ago
- Download all versions of Winamp Here☆23Oct 21, 2018Updated 7 years ago
- Windows compatible code for the paper "Jukebox: A Generative Model for Music"☆13Oct 30, 2022Updated 3 years ago