Mayankpratapsingh022 / DeepSeek-from-ScratchLinks
☆51Updated last month
Alternatives and similar repositories for DeepSeek-from-Scratch
Users that are interested in DeepSeek-from-Scratch are comparing it to the libraries listed below
Sorting:
- The code repository of the paper: Competition and Attraction Improve Model Fusion☆106Updated last week
- Inference, Fine Tuning and many more recipes with Gemma family of models☆267Updated last month
- Verifiers for LLM Reinforcement Learning☆75Updated 3 weeks ago
- Source code and utilities for the Genesys distributed language model architecture discovery system.☆47Updated 2 months ago
- ☆154Updated 4 months ago
- An Automatic Prompt Optimization Framework for Large Language Models☆105Updated last month
- ☆292Updated 3 weeks ago
- frozen-in-time version of our Paper Finder agent for reproducing evaluation results☆152Updated 2 weeks ago
- Anemoi: A Semi-Centralized Multi-agent Systems Based on Agent-to-Agent Communication MCP server from Coral Protocol☆170Updated last week
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorch☆103Updated 8 months ago
- ☆66Updated this week
- ☆94Updated 5 months ago
- ☆86Updated 11 months ago
- Fine tune Gemma 3 on an object detection task☆78Updated last month
- ☆128Updated last month
- An overview of GRPO & DeepSeek-R1 Training with Open Source GRPO Model Fine Tuning☆34Updated 3 months ago
- purpose of this repo is to Implement LLMOPs as shared in Deeplearning AI course☆32Updated last week
- Coding an LLM and its building blocks from scratch.☆89Updated 5 months ago
- Collection of scripts and notebooks for OpenAI's latest GPT OSS models☆410Updated last week
- ☆181Updated 6 months ago
- Solving data for LLMs - Create quality synthetic datasets!☆151Updated 7 months ago
- ☆44Updated 4 months ago
- META‑AGENTIC α‑AGI 👁️✨ — Mission 🎯 End‑to‑end: Identify 🔍 → Out‑Learn 📚 → Out‑Think 🧠 → Out‑Design 🎨 → Out‑Strategise ♟️ → Out‑Exe…☆255Updated this week
- ☆46Updated 5 months ago
- Luth is a state-of-the-art series of fine-tuned LLMs for French☆29Updated last week
- Implementation of a GPT-4o like Multimodal from Scratch using Python☆69Updated 5 months ago
- Simple examples using Argilla tools to build AI☆55Updated 9 months ago
- ☆68Updated 3 months ago
- Context Engineering Course with DSPy☆169Updated last month
- Turn topics, links, and files into AI-generated research notebooks — summarize, explore, and ask anything.☆136Updated 2 months ago