NotShrirang / tinygptLinks
π A series of lightweight GPT models featuring TinyGPT Base (~51M params) and TinyGPT-MoE (~85M params). Fast, creative text generation trained on whimsical stories.
β15Updated last week
Alternatives and similar repositories for tinygpt
Users that are interested in tinygpt are comparing it to the libraries listed below
Sorting:
- Simple examples using Argilla tools to build AIβ56Updated last year
- GPT-4 Level Conversational QA Trained In a Few Hoursβ66Updated last year
- β57Updated 10 months ago
- β101Updated 8 months ago
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)β91Updated 10 months ago
- β55Updated 3 months ago
- Luth is a state-of-the-art series of fine-tuned LLMs for Frenchβ40Updated last month
- An overview of GRPO & DeepSeek-R1 Training with Open Source GRPO Model Fine Tuningβ37Updated 6 months ago
- purpose of this repo is to Implement LLMOPs as shared in Deeplearning AI courseβ35Updated last week
- LLM reads a paper and produce a working prototypeβ60Updated 8 months ago
- β45Updated 7 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optunaβ59Updated last month
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafteβ¦β78Updated last year
- A fast, local, and secure approach for training LLMs for coding tasks using GRPO with WebAssembly and interpreter feedback.β40Updated 8 months ago
- Learn the building blocks of how to build gpt-oss from scratchβ105Updated 2 months ago
- β86Updated last year
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?β84Updated 8 months ago
- Tutorial for DSPyβ25Updated last year
- β40Updated 11 months ago
- Testing paligemma2 finetuning on reasoning datasetβ18Updated 11 months ago
- β68Updated 6 months ago
- II-Thought-RL is our initial attempt at developing a large-scale, multi-domain Reinforcement Learning (RL) datasetβ30Updated 8 months ago
- β67Updated 8 months ago
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Modelsβ115Updated 8 months ago
- β101Updated last year
- Python Server for C3 AI app. A project that brings the power of Large Language Models (LLM) and Retrieval-Augmented Generation (RAG) withβ¦β24Updated last year
- This is an open-source version of OpenAI's O1 Model Series by Siraj Raval & O1-Previewβ97Updated last year
- An intelligent code optimization system leveraging AI analysis, automated refactoring, and test generation. Built with DSPy and Gradio, iβ¦β19Updated 10 months ago
- Very minimal (and stateless) agent frameworkβ44Updated 10 months ago
- [EMNLP 2025] The official implementation for paper "Agentic-R1: Distilled Dual-Strategy Reasoning"β100Updated 3 months ago