Implementation of BitNet-1.58 instruct tuning
☆30Apr 14, 2024Updated 2 years ago
Alternatives and similar repositories for BitNet-1.58-Instruct
Users that are interested in BitNet-1.58-Instruct are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Modeling code for a BitNet b1.58 Llama-style model.☆25Apr 30, 2024Updated 2 years ago
- Pytorch implementation of "Oscillation-Reduced MXFP4 Training for Vision Transformers" on DeiT Model Pre-training☆40May 4, 2026Updated last month
- Generating Summaries with Controllable Readability Levels (EMNLP 2023)☆15Apr 8, 2026Updated 2 months ago
- Agentic Keyframe Search for Video Question Answering☆18Apr 7, 2025Updated last year
- Official implementation of RMoE (Layerwise Recurrent Router for Mixture-of-Experts)☆31Aug 4, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- The source code for running LLMs on the AAAR-1.0 benchmark.☆18Apr 5, 2025Updated last year
- [COLM 2025] "C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing"☆20Apr 9, 2025Updated last year
- Large language models to diffusion finetuning code☆26Jun 2, 2025Updated last year
- ☆11Jun 14, 2019Updated 6 years ago
- AIME API Server - Scalable AI Model Inference API Server☆15Sep 19, 2025Updated 8 months ago
- ☆12Jan 9, 2024Updated 2 years ago
- A python implementation of the ABC sofware metric.☆11Jan 2, 2026Updated 5 months ago
- Various video readers for PyTorch models training and a benchmark☆12Jun 1, 2026Updated last week
- Lyrics crawling, pre-processing, embedding generation, model training, and lyrics generation - all in one tool☆14Nov 4, 2018Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Testing Difference Target Propagation (DTP) on MNIST.☆13Oct 12, 2020Updated 5 years ago
- Open source multimodal OpenLKA dataset☆18Feb 13, 2026Updated 3 months ago
- ☆11Dec 9, 2020Updated 5 years ago
- Language modeling with linear-cost context☆119Sep 25, 2025Updated 8 months ago
- [ICLR 2026] Official code of "Segment any Events with Language"☆48Apr 10, 2026Updated 2 months ago
- ☆17Jan 30, 2024Updated 2 years ago
- Create Vector Store from Scratch in pure Python.☆13Dec 15, 2023Updated 2 years ago
- Pack of scripts providing customizable YouTube Music Videos generation.☆12Oct 10, 2023Updated 2 years ago
- Implement reinforcement learning(RL) based on parameterized quantum circuits with quantum computing cloud Quafu.☆11Oct 19, 2023Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- System Architecture of an EdTech Platform powered by Deep Learning (NCF) Recommendations system, ETL Data pipelines and GenAI for queries☆20Updated this week
- MMLU eval for RU/EN☆16Jul 31, 2023Updated 2 years ago
- A Tight-fisted Optimizer (Tiger), implemented in PyTorch.☆12Jun 26, 2024Updated last year
- [ICCV 2025] Dynamic-VLM☆28Dec 16, 2024Updated last year
- A single repo with all scripts and utils to train / fine-tune the Mamba model with or without FIM☆62Apr 8, 2024Updated 2 years ago
- Agent Skills for the Zig Programming Language☆37Apr 28, 2026Updated last month
- Custom comfyui https://github.com/comfyanonymous/ComfyUI Nodes for interacting with Ollama https://ollama.com/ using the Instructor http…☆12Aug 20, 2024Updated last year
- Train vector quantized CLIP models using pytorch lightning☆20Jul 14, 2024Updated last year
- A fully cuda implementation of DCNv2(deformable convolution) forward. Without dependent of cuTorch(THC).☆10Dec 9, 2019Updated 6 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Align large amount of images based on words. Great for the "trending" effect.☆28Apr 15, 2026Updated last month
- fan-hosted mirror of sources☆18Sep 2, 2013Updated 12 years ago
- [ICLR 2026] Code for Evolutionary Caching to Accelerate Your Off-the-Shelf Diffusion Model☆30Mar 1, 2026Updated 3 months ago
- High-performance ASR tool using Faster Whisper, supporting custom models, multi-language transcription, and real-time processing feedback…☆10Sep 17, 2025Updated 8 months ago
- Download all versions of Winamp Here☆23Oct 21, 2018Updated 7 years ago
- Hypernetwork training considerations and implementation types in PyTorch. Includes classification and time-series examples alongside 1D G…☆25Jan 4, 2023Updated 3 years ago
- ESLTTS dataset☆16Feb 6, 2025Updated last year