SamsungSAILMontreal / ByteCraft
☆27Updated 2 weeks ago
Alternatives and similar repositories for ByteCraft:
Users that are interested in ByteCraft are comparing it to the libraries listed below
- Training hybrid models for dummies.☆20Updated 3 months ago
- An example implementation of RLHF (or, more accurately, RLAIF) built on MLX and HuggingFace.☆25Updated 10 months ago
- Latent Large Language Models☆18Updated 8 months ago
- aesthetic tensor visualiser☆15Updated this week
- ☆19Updated last month
- implementation of https://arxiv.org/pdf/2312.09299☆20Updated 9 months ago
- A fast, local, and secure approach for training LLMs for coding tasks using GRPO with WebAssembly and interpreter feedback.☆22Updated 3 weeks ago
- Very minimal (and stateless) agent framework☆42Updated 3 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆39Updated 2 months ago
- Nexusflow function call, tool use, and agent benchmarks.☆19Updated 4 months ago
- A library for simplifying fine tuning with multi gpu setups in the Huggingface ecosystem.☆16Updated 6 months ago
- A tree-based prefix cache library that allows rapid creation of looms: hierarchal branching pathways of LLM generations.☆68Updated 2 months ago
- ☆37Updated 2 years ago
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆26Updated 5 months ago
- alternative way to calculating self attention☆18Updated 11 months ago
- The code repository for the CURLoRA research paper. Stable LLM continual fine-tuning and catastrophic forgetting mitigation.☆43Updated 8 months ago
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆29Updated last week
- Implementation of Mind Evolution, Evolving Deeper LLM Thinking, from Deepmind☆48Updated 2 months ago
- Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zeta☆13Updated 5 months ago
- ☆21Updated last month
- ☆11Updated this week
- ☆38Updated 9 months ago
- Alpha-Zero Connect Four NN trained via self play☆16Updated last month
- ☆46Updated 9 months ago
- EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything☆17Updated last year
- RWKV-7: Surpassing GPT☆83Updated 5 months ago
- A novel approach for transformer model introspection that enables saving, compressing, and manipulating internal thought states for advan…☆18Updated 3 weeks ago
- [ICML 2023] "Outline, Then Details: Syntactically Guided Coarse-To-Fine Code Generation", Wenqing Zheng, S P Sharan, Ajay Kumar Jaiswal, …☆40Updated last year
- This repository has code for fine-tuning LLMs with GRPO specifically for Rust Programming using cargo as feedback☆80Updated last month
- Repository to create traveling waves integrate special information through time☆50Updated last month