Open-Superintelligence-Lab / blueberry-llmLinks
☆41Updated last week
Alternatives and similar repositories for blueberry-llm
Users that are interested in blueberry-llm are comparing it to the libraries listed below
Sorting:
- An overview of GRPO & DeepSeek-R1 Training with Open Source GRPO Model Fine Tuning☆37Updated 6 months ago
- ☆25Updated last year
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆62Updated last year
- ☆180Updated 3 months ago
- Measuring Thinking Efficiency in Reasoning Models - Research Repository☆37Updated last month
- Learn the building blocks of how to build gpt-oss from scratch☆104Updated last month
- One click templates for inferencing Language Models☆218Updated 3 months ago
- Verifiers for LLM Reinforcement Learning☆78Updated 2 months ago
- ☆55Updated 4 months ago
- V is an AI Personal Trainer, built with NVIDIA and LangChain tools.☆14Updated last year
- Inference, Fine Tuning and many more recipes with Gemma family of models☆275Updated 4 months ago
- ☆86Updated last year
- A simple MLX implementation for pretraining LLMs on Apple Silicon.☆84Updated 3 months ago
- Collection of scripts and notebooks for OpenAI's latest GPT OSS models☆478Updated 2 months ago
- ☆68Updated 5 months ago
- a LLM cookbook, for building your own from scratch, all the way from gathering data to training a model☆164Updated last year
- The code repository of the paper: Competition and Attraction Improve Model Fusion☆165Updated 2 months ago
- ☆104Updated 4 months ago
- ☆46Updated 7 months ago
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆82Updated 8 months ago
- MLX-based QA pair generator and LLM finetuning tool in Streamlit☆41Updated last month
- Hub for researchers exploring VLMs and Multimodal Learning:)☆57Updated this week
- RAG example using DSPy, Gradio, FastAPI☆86Updated last year
- Improving AI Systems with Self-Defense Mechanisms☆22Updated 8 months ago
- ☆36Updated 3 months ago
- All credits go to HuggingFace's Daily AI papers (https://huggingface.co/papers) and the research community. 🔉Audio summaries here (https…☆209Updated 3 weeks ago
- Implementation of a GPT-4o like Multimodal from Scratch using Python☆73Updated 7 months ago
- Various installation guides for Large Language Models☆76Updated 6 months ago
- An open source implementation of LFMs from Liquid AI: Liquid Foundation Models☆113Updated last year
- So, I trained a Llama a 130M architecture I coded from ground up to build a small instruct model from scratch. Trained on FineWeb dataset…☆16Updated 7 months ago