Mayankpratapsingh022 / DeepSeek-from-ScratchLinks
☆54Updated 2 months ago
Alternatives and similar repositories for DeepSeek-from-Scratch
Users that are interested in DeepSeek-from-Scratch are comparing it to the libraries listed below
Sorting:
- Inference, Fine Tuning and many more recipes with Gemma family of models☆268Updated 2 months ago
- Code to accompany the Universal Deep Research paper (https://arxiv.org/abs/2509.00244)☆434Updated 3 weeks ago
- The code repository of the paper: Competition and Attraction Improve Model Fusion☆155Updated last month
- Verifiers for LLM Reinforcement Learning☆75Updated 2 weeks ago
- "LLM from Zero to Hero: An End-to-End Large Language Model Journey from Data to Application!"☆129Updated this week
- ☆95Updated 6 months ago
- Collection of impressive LLM apps with a focus on the financial sector☆134Updated last month
- An Automatic Prompt Optimization Framework for Large Language Models☆119Updated last month
- Paper2Agent is a multi-agent AI system that automatically transforms research papers into interactive AI agents with minimal human input.☆575Updated this week
- ☆155Updated 5 months ago
- Simple examples using Argilla tools to build AI☆55Updated 10 months ago
- ☆182Updated 7 months ago
- purpose of this repo is to Implement LLMOPs as shared in Deeplearning AI course☆33Updated this week
- Real-Time Detection of Hallucinated Entities in Long-Form Generation☆244Updated 2 weeks ago
- CodeScientist: An automated scientific discovery system for code-based experiments☆289Updated 3 months ago
- Turn topics, links, and files into AI-generated research notebooks — summarize, explore, and ask anything.☆140Updated 3 months ago
- ☆296Updated last month
- Implementation of a GPT-4o like Multimodal from Scratch using Python☆71Updated 5 months ago
- frozen-in-time version of our Paper Finder agent for reproducing evaluation results☆175Updated last month
- Coding an LLM and its building blocks from scratch.☆94Updated 5 months ago
- META‑AGENTIC α‑AGI 👁️✨ — Mission 🎯 End‑to‑end: Identify 🔍 → Out‑Learn 📚 → Out‑Think 🧠 → Out‑Design 🎨 → Out‑Strategise ♟️ → Out‑Exe …☆259Updated last week
- Luth is a state-of-the-art series of fine-tuned LLMs for French☆31Updated this week
- Repository of implementations of classic and sota rl algorithms from scratch in PyTorch☆166Updated 3 weeks ago
- An overview of GRPO & DeepSeek-R1 Training with Open Source GRPO Model Fine Tuning☆36Updated 4 months ago
- Fine tune Gemma 3 on an object detection task☆84Updated 2 months ago
- Anemoi: A Semi-Centralized Multi-agent Systems Based on Agent-to-Agent Communication MCP server from Coral Protocol☆365Updated 3 weeks ago
- ☆206Updated 3 months ago
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorch☆103Updated 9 months ago
- ☆89Updated 5 months ago
- RunAgent simplifies serverless deployment of your AI agents. With a powerful CLI, multi-language SDK support, built-in agent invocation &…☆324Updated this week