Implementation of BitNet-1.58 instruct tuning
☆27Apr 14, 2024Updated last year
Alternatives and similar repositories for BitNet-1.58-Instruct
Users that are interested in BitNet-1.58-Instruct are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Modeling code for a BitNet b1.58 Llama-style model.☆25Apr 30, 2024Updated last year
- Pytorch implementation of "Oscillation-Reduced MXFP4 Training for Vision Transformers" on DeiT Model Pre-training☆39Jun 20, 2025Updated 9 months ago
- A Python-based voice assistant integrating speech-to-text (STT), text-to-speech (TTS), and powerful AI capabilities using either a local …☆18Dec 8, 2025Updated 4 months ago
- Generating Summaries with Controllable Readability Levels (EMNLP 2023)☆15Updated this week
- Agentic Keyframe Search for Video Question Answering☆17Apr 7, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Official implementation of RMoE (Layerwise Recurrent Router for Mixture-of-Experts)☆30Aug 4, 2024Updated last year
- Distributed Optimization Infra for learning CLIP models☆27Oct 3, 2024Updated last year
- Matrix Product State algorithm for computing characters of the symmetric group S_n☆11Sep 26, 2025Updated 6 months ago
- The source code for running LLMs on the AAAR-1.0 benchmark.☆18Apr 5, 2025Updated last year
- [COLM 2025] "C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing"☆20Apr 9, 2025Updated last year
- https://www.kaggle.com/c/siim-acr-pneumothorax-segmentation☆11Sep 11, 2019Updated 6 years ago
- ☆11Jun 14, 2019Updated 6 years ago
- Realtime Face detection demo using YOLO v2 and OpenCV DNN module☆17Mar 10, 2018Updated 8 years ago
- Training a reward model for RLHF using RWKV.☆15Jun 5, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Various video readers for PyTorch models training and a benchmark☆12Updated this week
- Here is my implementation of Center Loss with Keras☆11May 2, 2018Updated 7 years ago
- Testing Difference Target Propagation (DTP) on MNIST.☆13Oct 12, 2020Updated 5 years ago
- Language modeling with linear-cost context☆117Sep 25, 2025Updated 6 months ago
- [ICLR 2026] Official code of "Segment any Events with Language"☆44Apr 4, 2026Updated last week
- ☆17Jan 30, 2024Updated 2 years ago
- Tensorflow implementation of InceptionV3-SSD☆17Jun 20, 2018Updated 7 years ago
- Cross-browser fingerprinting library that generates fingerprint of a device. It is written in JavaScript.☆13Mar 13, 2019Updated 7 years ago
- A Tight-fisted Optimizer (Tiger), implemented in PyTorch.☆12Jun 26, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Keras implementation of `Decoupled Neural Interfaces using Synthetic Gradients`☆12Oct 19, 2018Updated 7 years ago
- ☆18Jan 7, 2019Updated 7 years ago
- The Official Implementation for INR-V: A Continuous Representation Space for Video-based Generative Tasks☆15Mar 31, 2023Updated 3 years ago
- [ICCV 2025] Dynamic-VLM☆28Dec 16, 2024Updated last year
- Kolmogorov-Arnold networks (KAN) as implicit functions (like NeRF but simpler)☆15May 16, 2024Updated last year
- GPS software using open street maps. Draw tracks, waypoints. Can find actual position.☆11Jun 1, 2011Updated 14 years ago
- [ICLR 2026] Code for Evolutionary Caching to Accelerate Your Off-the-Shelf Diffusion Model☆31Mar 1, 2026Updated last month
- 2D laser datasets☆15Jan 4, 2019Updated 7 years ago
- Hypernetwork training considerations and implementation types in PyTorch. Includes classification and time-series examples alongside 1D G…☆24Jan 4, 2023Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ESLTTS dataset☆16Feb 6, 2025Updated last year
- ☆13Mar 18, 2026Updated 3 weeks ago
- Code for the ACL 2024 paper "PLUG: Leveraging Pivot Language in Cross-Lingual Instruction Tuning"☆14Aug 13, 2025Updated 7 months ago
- ☆17Mar 8, 2023Updated 3 years ago
- Website for ML course at MIPT☆10Sep 6, 2021Updated 4 years ago
- The GitHub repository for the paper "Denoising Application of Magnetotelluric Low-Frequency Signal Processing"☆11Feb 22, 2023Updated 3 years ago
- A simple demo showing how to use the Ideogram inpainting model on Replicate using Node.js.☆14Oct 24, 2024Updated last year