NotShrirang / tinygptLinks
π A series of lightweight GPT models featuring TinyGPT Base (~51M params) and TinyGPT-MoE (~85M params). Fast, creative text generation trained on whimsical stories.
β12Updated last month
Alternatives and similar repositories for tinygpt
Users that are interested in tinygpt are comparing it to the libraries listed below
Sorting:
- purpose of this repo is to Implement LLMOPs as shared in Deeplearning AI courseβ35Updated last week
- β57Updated 8 months ago
- A fast, local, and secure approach for training LLMs for coding tasks using GRPO with WebAssembly and interpreter feedback.β39Updated 6 months ago
- Testing paligemma2 finetuning on reasoning datasetβ18Updated 9 months ago
- β45Updated 5 months ago
- Luth is a state-of-the-art series of fine-tuned LLMs for Frenchβ34Updated last week
- Simple examples using Argilla tools to build AIβ56Updated 11 months ago
- An overview of GRPO & DeepSeek-R1 Training with Open Source GRPO Model Fine Tuningβ37Updated 5 months ago
- β95Updated 6 months ago
- β86Updated last year
- β54Updated last month
- LLM reads a paper and produce a working prototypeβ57Updated 6 months ago
- β68Updated 4 months ago
- β40Updated 10 months ago
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?β77Updated 7 months ago
- Dynamic Metadata based RAG Frameworkβ75Updated last year
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optunaβ55Updated 8 months ago
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)β91Updated 8 months ago
- Python Server for C3 AI app. A project that brings the power of Large Language Models (LLM) and Retrieval-Augmented Generation (RAG) withβ¦β24Updated last year
- Analysis code for Neurips 2025 paper "SciArena: An Open Evaluation Platform for Foundation Models in Scientific Literature Tasks"β52Updated 2 months ago
- Advanced Coding AI Assistant that uses a Gradio interface to stream coding related responses. ChatRAG supports local and API inference anβ¦β22Updated 5 months ago
- β54Updated last week
- Training setup for Langchain's Open Deep Researchβ65Updated last month
- Query Expension for Better Query Embedding using LLMsβ58Updated 8 months ago
- β19Updated 7 months ago
- A collection of lightweight interpretability scripts to understand how LLMs thinkβ59Updated this week
- β80Updated last year
- Agent Watch is an AgentOps monitoring library designed for Crew AI applications.β20Updated 10 months ago
- [EMNLP 2025] The official implementation for paper "Agentic-R1: Distilled Dual-Strategy Reasoning"β101Updated last month
- GPT-4 Level Conversational QA Trained In a Few Hoursβ65Updated last year