NotShrirang / tinygptLinks
π A series of lightweight GPT models featuring TinyGPT Base (~51M params) and TinyGPT-MoE (~85M params). Fast, creative text generation trained on whimsical stories.
β15Updated last month
Alternatives and similar repositories for tinygpt
Users that are interested in tinygpt are comparing it to the libraries listed below
Sorting:
- Simple examples using Argilla tools to build AIβ57Updated last year
- β55Updated 4 months ago
- β57Updated 11 months ago
- Python Server for C3 AI app. A project that brings the power of Large Language Models (LLM) and Retrieval-Augmented Generation (RAG) withβ¦β24Updated 2 years ago
- Testing paligemma2 finetuning on reasoning datasetβ18Updated last year
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optunaβ59Updated 3 months ago
- GPT-4 Level Conversational QA Trained In a Few Hoursβ66Updated last year
- A fast, local, and secure approach for training LLMs for coding tasks using GRPO with WebAssembly and interpreter feedback.β41Updated 9 months ago
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafteβ¦β82Updated last year
- β73Updated 6 months ago
- β104Updated 9 months ago
- A Qwen .5B reasoning model trained on OpenR1-Math-220kβ14Updated 3 months ago
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)β91Updated 11 months ago
- Complete example of how to build an Agentic RAG architecture with Redis, Amazon Bedrock, and LlamaIndex.β101Updated last year
- β87Updated last year
- An agent to generate stunning images :)β23Updated 7 months ago
- β24Updated 6 months ago
- β39Updated last year
- LLM reads a paper and produce a working prototypeβ60Updated 9 months ago
- Dynamic Metadata based RAG Frameworkβ78Updated last month
- Easy to use, High Performant Knowledge Distillation for LLMsβ96Updated 8 months ago
- entropix style sampling + GUIβ27Updated last year
- Example implementation of Iteration of Tought - Gives a star if you like the projectβ41Updated last year
- Luth is a state-of-the-art series of fine-tuned LLMs for Frenchβ41Updated 3 months ago
- β101Updated last year
- β62Updated 6 months ago
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?β88Updated 10 months ago
- β45Updated 8 months ago
- An overview of GRPO & DeepSeek-R1 Training with Open Source GRPO Model Fine Tuningβ37Updated 8 months ago
- β26Updated last year