Mayankpratapsingh022 / DeepSeek-from-ScratchLinks
☆53Updated 3 months ago
Alternatives and similar repositories for DeepSeek-from-Scratch
Users that are interested in DeepSeek-from-Scratch are comparing it to the libraries listed below
Sorting:
- Inference, Fine Tuning and many more recipes with Gemma family of models☆274Updated 3 months ago
- Verifiers for LLM Reinforcement Learning☆77Updated last month
- Collection of impressive LLM apps with a focus on the financial sector☆140Updated last week
- Code to accompany the Universal Deep Research paper (https://arxiv.org/abs/2509.00244)☆447Updated 2 months ago
- ☆182Updated 8 months ago
- ☆300Updated 3 months ago
- Repository of implementations of classic and sota rl algorithms from scratch in PyTorch☆204Updated 2 months ago
- Real-Time Detection of Hallucinated Entities in Long-Form Generation☆264Updated 3 weeks ago
- The code repository of the paper: Competition and Attraction Improve Model Fusion☆163Updated 2 months ago
- Context Engineering Course with DSPy☆200Updated 3 months ago
- An Automatic Prompt Optimization Framework for Large Language Models☆136Updated 3 months ago
- 📓 A collection of generative AI open-source repositories that are actively being developed. If you are looking to build a solid profile …☆83Updated last month
- Salesforce Enterprise Deep Research☆732Updated last week
- A repository consisting of paper/architecture replications of classic/SOTA AI/ML papers in pytorch☆386Updated last month
- Code base of 'PAPER2WEB: LET’S MAKE YOUR PAPER ALIVE!' integrating the other work package throughout the entire "paper2present" process…☆186Updated 2 weeks ago
- Anemoi: A Semi-Centralized Multi-agent Systems Based on Agent-to-Agent Communication MCP server from Coral Protocol☆367Updated 2 months ago
- "LLM from Zero to Hero: An End-to-End Large Language Model Journey from Data to Application!"☆134Updated last month
- ☆211Updated 5 months ago
- A truly open version of gpt-oss which shows the entire pre-training from scratch☆73Updated 2 months ago
- An interface library for RL post training with environments.☆628Updated this week
- Examples, end-2-end tutorials and apps built using Liquid AI Foundational Models (LFM) and the LEAP SDK☆182Updated this week
- An overview of GRPO & DeepSeek-R1 Training with Open Source GRPO Model Fine Tuning☆37Updated 5 months ago
- Luth is a state-of-the-art series of fine-tuned LLMs for French☆38Updated 3 weeks ago
- ☆96Updated 7 months ago
- Solving data for LLMs - Create quality synthetic datasets!☆150Updated 9 months ago
- ☆89Updated 7 months ago
- META‑AGENTIC α‑AGI 👁️✨ — Mission 🎯 End‑to‑end: Identify 🔍 → Out‑Learn 📚 → Out‑Think 🧠 → Out‑Design 🎨 → Out‑Strategise ♟️ → Out‑Exe…☆263Updated last month
- Implementation of a GPT-4o like Multimodal from Scratch using Python☆73Updated 7 months ago
- ☆86Updated last year
- Train Large Language Models on MLX.☆205Updated last month