Implementation of BitNet-1.58 instruct tuning
☆27Apr 14, 2024Updated last year
Alternatives and similar repositories for BitNet-1.58-Instruct
Users that are interested in BitNet-1.58-Instruct are comparing it to the libraries listed below
Sorting:
- Generating Summaries with Controllable Readability Levels (EMNLP 2023)☆15Aug 6, 2025Updated 6 months ago
- Modeling code for a BitNet b1.58 Llama-style model.☆25Apr 30, 2024Updated last year
- Official implementation of RMoE (Layerwise Recurrent Router for Mixture-of-Experts)☆29Aug 4, 2024Updated last year
- [COLM 2025] "C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing"☆20Apr 9, 2025Updated 10 months ago
- The source code for running LLMs on the AAAR-1.0 benchmark.☆18Apr 5, 2025Updated 10 months ago
- Distributed Optimization Infra for learning CLIP models☆27Oct 3, 2024Updated last year
- [ICCV 2025] Dynamic-VLM☆28Dec 16, 2024Updated last year
- Pytorch implementation of "Oscillation-Reduced MXFP4 Training for Vision Transformers" on DeiT Model Pre-training☆36Jun 20, 2025Updated 8 months ago
- CARMA Streets is a component of CARMA ecosystem, which enables such a coordination among different transportation users. This component p…☆11Aug 21, 2025Updated 6 months ago
- [ICLR 2026] Official code of "Segment any Events with Language"☆35Feb 7, 2026Updated 3 weeks ago
- The GitHub repository for the paper "Denoising Application of Magnetotelluric Low-Frequency Signal Processing"☆11Feb 22, 2023Updated 3 years ago
- A Python-based voice assistant integrating speech-to-text (STT), text-to-speech (TTS), and powerful AI capabilities using either a local …☆13Dec 8, 2025Updated 2 months ago
- An open source AI health assistant☆48May 29, 2024Updated last year
- MEXMA: Token-level objectives improve sentence representations☆43Jan 6, 2025Updated last year
- A python implementation of the ABC sofware metric.☆11Jan 2, 2026Updated last month
- Create a Scale-able Full Stack Education Platform with React-Tailwind, MongoDB & Nodejs☆10Nov 23, 2025Updated 3 months ago
- ☆31Updated this week
- Shiftly is an Android app which provides an easy and interactive way for both employers and employees to manage scheduling at work.☆11Sep 24, 2019Updated 6 years ago
- [NeurIPS 2025] Official code for "Tropical Attention: Neural Algorithmic Reasoning for Combinatorial Algorithms"☆23Oct 23, 2025Updated 4 months ago
- [IROS 2025] EgoLoc: Zero-Shot Temporal Interaction Localization for Egocentric Videos☆32Jan 13, 2026Updated last month
- GBM implementation on Legate☆14Jan 28, 2026Updated last month
- A semi print-in-place hand for human-like manipulation, designed to be built by anyone.☆17Jan 5, 2026Updated last month
- A benchmark dataset designed to support the development and evaluation of large language models (LLMs) for conversational mental health a…☆16Feb 24, 2025Updated last year
- Create Vector Store from Scratch in pure Python.☆14Dec 15, 2023Updated 2 years ago
- ☆11Jul 17, 2023Updated 2 years ago
- GPS software using open street maps. Draw tracks, waypoints. Can find actual position.☆11Jun 1, 2011Updated 14 years ago
- Language modeling with linear-cost context☆116Sep 25, 2025Updated 5 months ago
- extract chords from an audio file (using ohollo/chord-extractor & Chordino)☆12Mar 23, 2025Updated 11 months ago
- Fast search index for SPLADE sparse retrieval models implemented in Python using Numpy and Numba☆35Oct 16, 2025Updated 4 months ago
- ☆12May 23, 2024Updated last year
- Code for the paper "FinRLlama: A Solution to LLM-Engineered Signals Challenge at FinRL Contest 2024"☆13Feb 14, 2025Updated last year
- ☆25Oct 13, 2025Updated 4 months ago
- Counterfactual Explanation Based on Gradual Construction for Deep Networks Pytorch☆11Apr 7, 2021Updated 4 years ago
- offical code for Dense-TSNet☆12Sep 17, 2024Updated last year
- Here is my implementation of Center Loss with Keras☆11May 2, 2018Updated 7 years ago
- Code of paper "HyperVLA: Efficient Inference in Vision-Language-Action Models via Hypernetworks"☆22Oct 8, 2025Updated 4 months ago
- Lyrics crawling, pre-processing, embedding generation, model training, and lyrics generation - all in one tool☆14Nov 4, 2018Updated 7 years ago
- Custom comfyui https://github.com/comfyanonymous/ComfyUI Nodes for interacting with Ollama https://ollama.com/ using the Instructor http…☆12Aug 20, 2024Updated last year
- Array APIs to write ONNX Graphs☆11Jan 18, 2026Updated last month