Open-Superintelligence-Lab / blueberry-llmLinks
☆25Updated 2 weeks ago
Alternatives and similar repositories for blueberry-llm
Users that are interested in blueberry-llm are comparing it to the libraries listed below
Sorting:
- An overview of GRPO & DeepSeek-R1 Training with Open Source GRPO Model Fine Tuning☆37Updated 4 months ago
- Measuring Thinking Efficiency in Reasoning Models - Research Repository☆37Updated last week
- ☆177Updated 2 months ago
- Entropy Based Sampling and Parallel CoT Decoding☆17Updated last year
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆105Updated 7 months ago
- The code repository of the paper: Competition and Attraction Improve Model Fusion☆159Updated last month
- A simple MLX implementation for pretraining LLMs on Apple Silicon.☆83Updated last month
- ☆46Updated 6 months ago
- ☆53Updated 2 months ago
- Marketplace ML experiment - training without backprop☆25Updated last month
- Official PyTorch implementation for Hogwild! Inference: Parallel LLM Generation with a Concurrent Attention Cache☆125Updated last month
- ☆124Updated 9 months ago
- First-principle implementations of groundbreaking AI algorithms using a wide range of deep learning frameworks, accompanied by supporting…☆177Updated 2 months ago
- ☆68Updated 4 months ago
- ☆24Updated last year
- The simplest, fastest repository for training/finetuning medium-sized xLSTMs.☆41Updated last year
- V is an AI Personal Trainer, built with NVIDIA and LangChain tools.☆14Updated last year
- ☆155Updated 5 months ago
- Verifiers for LLM Reinforcement Learning☆75Updated last month
- Train your own SOTA deductive reasoning model☆107Updated 7 months ago
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆172Updated 8 months ago
- Real-Time Detection of Hallucinated Entities in Long-Form Generation☆254Updated last month
- KAN (Kolmogorov–Arnold Networks) in the MLX framework for Apple Silicon☆24Updated 3 months ago
- MLX-based QA pair generator and LLM finetuning tool in Streamlit☆37Updated 10 months ago
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆62Updated 11 months ago
- working implimention of deepseek MLA☆44Updated 9 months ago
- ☆86Updated last year
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆77Updated 6 months ago
- ☆86Updated last week
- ☆119Updated last year