SamsungSAILMontreal / ByteCraftLinks
☆30Updated 2 months ago
Alternatives and similar repositories for ByteCraft
Users that are interested in ByteCraft are comparing it to the libraries listed below
Sorting:
- implementation of https://arxiv.org/pdf/2312.09299☆20Updated 11 months ago
- Training hybrid models for dummies.☆23Updated 5 months ago
- This repository has code for fine-tuning LLMs with GRPO specifically for Rust Programming using cargo as feedback☆95Updated 3 months ago
- GoldFinch and other hybrid transformer components☆10Updated last month
- Nexusflow function call, tool use, and agent benchmarks.☆20Updated 6 months ago
- Implementation of Mind Evolution, Evolving Deeper LLM Thinking, from Deepmind☆53Updated 3 weeks ago
- ☆21Updated 3 months ago
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆29Updated this week
- Rust bindings for CTranslate2☆14Updated 2 years ago
- A library for simplifying fine tuning with multi gpu setups in the Huggingface ecosystem.☆16Updated 8 months ago
- Latent Large Language Models☆18Updated 10 months ago
- Very minimal (and stateless) agent framework☆44Updated 5 months ago
- An example implementation of RLHF (or, more accurately, RLAIF) built on MLX and HuggingFace.☆29Updated last year
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆26Updated 7 months ago
- A novel approach for transformer model introspection that enables saving, compressing, and manipulating internal thought states for advan…☆20Updated 2 months ago
- This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.☆20Updated last year
- ☆63Updated last month
- ☆38Updated 11 months ago
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆22Updated 7 months ago
- Public Goods Game (PGG) Benchmark: Contribute & Punish is a multi-agent benchmark that tests cooperative and self-interested strategies a…☆36Updated 2 months ago
- AirLLM 70B inference with single 4GB GPU☆13Updated last week
- Data preparation code for CrystalCoder 7B LLM☆45Updated last year
- Trying to deconstruct RWKV in understandable terms☆14Updated 2 years ago
- 👷 Build compute kernels☆68Updated this week
- Fine-tunes a student LLM using teacher feedback for improved reasoning and answer quality. Implements GRPO with teacher-provided evaluati…☆44Updated last month
- ☆48Updated 11 months ago
- Simple high-throughput inference library☆119Updated last month
- Easily convert HuggingFace models to GGUF-format for llama.cpp☆21Updated 10 months ago
- [ICML 2025] Roll the dice & look before you leap: Going beyond the creative limits of next-token prediction☆27Updated last month
- Lego for GRPO☆28Updated 3 weeks ago