Implementation of BitNet-1.58 instruct tuning
☆27Apr 14, 2024Updated last year
Alternatives and similar repositories for BitNet-1.58-Instruct
Users that are interested in BitNet-1.58-Instruct are comparing it to the libraries listed below
Sorting:
- Modeling code for a BitNet b1.58 Llama-style model.☆25Apr 30, 2024Updated last year
- Pytorch implementation of "Oscillation-Reduced MXFP4 Training for Vision Transformers" on DeiT Model Pre-training☆37Jun 20, 2025Updated 9 months ago
- Generating Summaries with Controllable Readability Levels (EMNLP 2023)☆15Aug 6, 2025Updated 7 months ago
- Agentic Keyframe Search for Video Question Answering☆16Apr 7, 2025Updated 11 months ago
- Official implementation of RMoE (Layerwise Recurrent Router for Mixture-of-Experts)☆29Aug 4, 2024Updated last year
- Distributed Optimization Infra for learning CLIP models☆27Oct 3, 2024Updated last year
- VIP cheatsheets for Stanford's CS 229 Machine Learning☆10May 20, 2020Updated 5 years ago
- Matrix Product State algorithm for computing characters of the symmetric group S_n☆11Sep 26, 2025Updated 5 months ago
- The source code for running LLMs on the AAAR-1.0 benchmark.☆18Apr 5, 2025Updated 11 months ago
- [COLM 2025] "C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing"☆20Apr 9, 2025Updated 11 months ago
- Squanchy is a brand new, easy to learn, general purpose, multi-paradigm, compiled programming language. The language is written from scra…☆15Dec 7, 2019Updated 6 years ago
- This repository implements the training, testing and evaluation code for the "VQ-NeRV: A Vector Quantised Neural Representation for Video…☆10Feb 19, 2024Updated 2 years ago
- ☆12Jan 9, 2024Updated 2 years ago
- CARMA Streets is a component of CARMA ecosystem, which enables such a coordination among different transportation users. This component p…☆11Mar 10, 2026Updated last week
- Language modeling with linear-cost context☆117Sep 25, 2025Updated 5 months ago
- Tensorflow implementation of InceptionV3-SSD☆17Jun 20, 2018Updated 7 years ago
- Pack of scripts providing customizable YouTube Music Videos generation.☆12Oct 10, 2023Updated 2 years ago
- Implement reinforcement learning(RL) based on parameterized quantum circuits with quantum computing cloud Quafu.☆11Oct 19, 2023Updated 2 years ago
- A Tight-fisted Optimizer (Tiger), implemented in PyTorch.☆12Jun 26, 2024Updated last year
- [NeurIPS 2025 Spotlight] A Token is Worth over 1,000 Tokens: Efficient Knowledge Distillation through Low-Rank Clone.☆46Oct 29, 2025Updated 4 months ago
- Kolmogorov-Arnold networks (KAN) as implicit functions (like NeRF but simpler)☆15May 16, 2024Updated last year
- Gaussian Splatting for Robotic Simulation☆22Nov 7, 2025Updated 4 months ago
- ☆17Oct 18, 2022Updated 3 years ago
- A fully cuda implementation of DCNv2(deformable convolution) forward. Without dependent of cuTorch(THC).☆10Dec 9, 2019Updated 6 years ago
- GPS software using open street maps. Draw tracks, waypoints. Can find actual position.☆11Jun 1, 2011Updated 14 years ago
- ComfyUI workflows☆11Sep 19, 2024Updated last year
- High-performance ASR tool using Faster Whisper, supporting custom models, multi-language transcription, and real-time processing feedback…☆10Sep 17, 2025Updated 6 months ago
- ⚡️ Transform AI/ML operations: Transparency, Control and Cost Optimization. ⚡️☆23Oct 8, 2023Updated 2 years ago
- Hypernetwork training considerations and implementation types in PyTorch. Includes classification and time-series examples alongside 1D G…☆24Jan 4, 2023Updated 3 years ago
- ESLTTS dataset☆16Feb 6, 2025Updated last year
- Windows compatible code for the paper "Jukebox: A Generative Model for Music"☆13Oct 30, 2022Updated 3 years ago
- Code for the ACL 2024 paper "PLUG: Leveraging Pivot Language in Cross-Lingual Instruction Tuning"☆14Aug 13, 2025Updated 7 months ago
- Standalone basic request server implementation☆12Sep 1, 2021Updated 4 years ago
- The GitHub repository for the paper "Denoising Application of Magnetotelluric Low-Frequency Signal Processing"☆11Feb 22, 2023Updated 3 years ago
- ☆28Jan 24, 2017Updated 9 years ago
- A simple demo showing how to use the Ideogram inpainting model on Replicate using Node.js.☆14Oct 24, 2024Updated last year
- Autoware V2X module with Zenoh☆12Mar 2, 2026Updated 2 weeks ago
- Implementation of End-to-End YOLO Models☆10Dec 30, 2025Updated 2 months ago
- ☆17Jan 11, 2023Updated 3 years ago