Implementation of BitNet-1.58 instruct tuning
☆27Apr 14, 2024Updated 2 years ago
Alternatives and similar repositories for BitNet-1.58-Instruct
Users that are interested in BitNet-1.58-Instruct are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Modeling code for a BitNet b1.58 Llama-style model.☆25Apr 30, 2024Updated 2 years ago
- Pytorch implementation of "Oscillation-Reduced MXFP4 Training for Vision Transformers" on DeiT Model Pre-training☆39Jun 20, 2025Updated 10 months ago
- Generating Summaries with Controllable Readability Levels (EMNLP 2023)☆15Apr 8, 2026Updated 3 weeks ago
- Agentic Keyframe Search for Video Question Answering☆18Apr 7, 2025Updated last year
- Official implementation of RMoE (Layerwise Recurrent Router for Mixture-of-Experts)☆30Aug 4, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Distributed Optimization Infra for learning CLIP models☆30Oct 3, 2024Updated last year
- Matrix Product State algorithm for computing characters of the symmetric group S_n☆11Sep 26, 2025Updated 7 months ago
- The source code for running LLMs on the AAAR-1.0 benchmark.☆18Apr 5, 2025Updated last year
- Rust FTL + WebRTC live streaming software.☆13Mar 12, 2022Updated 4 years ago
- Large language models to diffusion finetuning code☆26Jun 2, 2025Updated 10 months ago
- AIME API Server - Scalable AI Model Inference API Server☆15Sep 19, 2025Updated 7 months ago
- An Earley parser in C#☆10Sep 18, 2010Updated 15 years ago
- Training a reward model for RLHF using RWKV.☆15Jun 5, 2023Updated 2 years ago
- ☆12Jan 9, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Various video readers for PyTorch models training and a benchmark☆12Updated this week
- Here is my implementation of Center Loss with Keras☆11May 2, 2018Updated 7 years ago
- This repository is the official implementation of the Hybrid Self-Attention NEAT algorithm. It contains the code to reproduce the results…☆14Jun 19, 2023Updated 2 years ago
- extract chords from an audio file (using ohollo/chord-extractor & Chordino)☆13Mar 30, 2026Updated last month
- Testing Difference Target Propagation (DTP) on MNIST.☆13Oct 12, 2020Updated 5 years ago
- CARMA Streets is a component of CARMA ecosystem, which enables such a coordination among different transportation users. This component p…☆11Apr 9, 2026Updated 3 weeks ago
- [ICLR 2026] Official code of "Segment any Events with Language"☆48Apr 10, 2026Updated 3 weeks ago
- ☆17Jan 30, 2024Updated 2 years ago
- An alternative implementation of the SQLite database engine using C#☆14Oct 23, 2009Updated 16 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Tensorflow implementation of InceptionV3-SSD☆17Jun 20, 2018Updated 7 years ago
- Implement reinforcement learning(RL) based on parameterized quantum circuits with quantum computing cloud Quafu.☆11Oct 19, 2023Updated 2 years ago
- ☆18Jan 7, 2019Updated 7 years ago
- The Official Implementation for INR-V: A Continuous Representation Space for Video-based Generative Tasks☆15Mar 31, 2023Updated 3 years ago
- [NeurIPS 2025 Spotlight] A Token is Worth over 1,000 Tokens: Efficient Knowledge Distillation through Low-Rank Clone.☆47Oct 29, 2025Updated 6 months ago
- [ICCV 2025] Dynamic-VLM☆28Dec 16, 2024Updated last year
- A CUDA implementation of Arithmetic Coding☆18Jan 21, 2025Updated last year
- ☆17Oct 18, 2022Updated 3 years ago
- High-performance ASR tool using Faster Whisper, supporting custom models, multi-language transcription, and real-time processing feedback…☆10Sep 17, 2025Updated 7 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Python library written in Rust for creating/transporting/parsing AI characters between different frontends (TavernAI, SillyTavern, TextGe…☆21Nov 14, 2025Updated 5 months ago
- Balloon Tower Defense 5 (Browser Game, 2011, Ninja Kiwi) Playable Clone written in Python. In the game, players attempt to prevent balloo…☆12Dec 10, 2022Updated 3 years ago
- Hypernetwork training considerations and implementation types in PyTorch. Includes classification and time-series examples alongside 1D G…☆25Jan 4, 2023Updated 3 years ago
- Standalone basic request server implementation☆12Sep 1, 2021Updated 4 years ago
- Website for ML course at MIPT☆10Sep 6, 2021Updated 4 years ago
- Reimplentation of paper using gzip + knn for text classification☆18Aug 1, 2023Updated 2 years ago
- The GitHub repository for the paper "Denoising Application of Magnetotelluric Low-Frequency Signal Processing"☆11Feb 22, 2023Updated 3 years ago