Fine-tune LLMs with GRPO algorithm tutorial
☆14Mar 4, 2025Updated last year
Alternatives and similar repositories for Fine-tune-LLMS-with-grpo
Users that are interested in Fine-tune-LLMS-with-grpo are comparing it to the libraries listed below
Sorting:
- ☆19Jul 19, 2024Updated last year
- Best Movie App with Ionic 4 using The Movie DB API☆16May 24, 2019Updated 6 years ago
- End-to-end integration of HuggingFace's models for sequence labeling.☆11Oct 4, 2020Updated 5 years ago
- UE4_TPE (Thrid Person Exercise) with Darksouls☆11Jul 4, 2019Updated 6 years ago
- This project integrates VideoSDK, OpenAI Realtime APIs to create an AI Translator Agent. Below are the setup instructions.☆14Dec 20, 2025Updated 2 months ago
- Here is a collection of cool applications that I've built with AssemblyAI☆35Aug 20, 2024Updated last year
- ☆13Apr 23, 2025Updated 10 months ago
- ☆12Mar 3, 2025Updated last year
- On-device real-time RAG App built using Jina Reader, Mediapipe, Gemma 2b IT LLM.☆15Apr 15, 2024Updated last year
- ☆11Nov 11, 2022Updated 3 years ago
- Hugo Boilerplate with Laravel Mix + Tailwind CSS☆11May 1, 2019Updated 6 years ago
- ☆12Apr 24, 2024Updated last year
- ☆12Jul 28, 2024Updated last year
- LiveKit + Python AI voice agent☆10Feb 21, 2025Updated last year
- ☆10Oct 14, 2020Updated 5 years ago
- Open source markdown editor☆11Sep 4, 2020Updated 5 years ago
- ☆13Jan 14, 2025Updated last year
- ☆20Sep 5, 2025Updated 6 months ago
- ☆12Jan 19, 2024Updated 2 years ago
- Actor based version of Three is a Crowd☆12Jun 24, 2016Updated 9 years ago
- A local browser automation agent based on Microsoft Fara-7B model optimized for LM Studio inference.☆26Nov 25, 2025Updated 3 months ago
- Use Gemma3:4b model on Ollama to make a fully functional streamlit OCR App using Vibe Coding with Cursor Code Editor☆17Mar 17, 2025Updated 11 months ago
- Crops the Steam Desktop overlay in SteamVR to the content of the primary display. This probably isn't useful anymore unless when forcing …☆11Jun 8, 2019Updated 6 years ago
- CORS Proxy which can take multiple url requests at a time☆10Sep 26, 2021Updated 4 years ago
- [Konvens21] This repository contains the DFKI MobIE Corpus, a dataset of 3,232 German-language documents that have been annotated with fi…☆12Sep 17, 2024Updated last year
- 📄 Nano JSX Template using Isomorphic JSX.☆13Oct 7, 2022Updated 3 years ago
- DSPy Experiments☆10Aug 28, 2025Updated 6 months ago
- HSML Dynamic version for ICML 2019☆12Jul 11, 2019Updated 6 years ago
- ☆12May 20, 2025Updated 9 months ago
- Code for ACL2018 paper "Learn How to Actively Learn: An Imitation Learning Approach"☆10Mar 8, 2019Updated 7 years ago
- All the content of my youtube channel : https://youtube.com/@florenzerstling?si=7t10PBr6MDha74PO☆14May 28, 2025Updated 9 months ago
- Agentic Artifacts is a tool for generating and managing CodeSandbox artifacts using AI. It leverages the power of Ai to create React comp…☆13Jun 24, 2024Updated last year
- Managed Service Extensibility Framework☆11Jan 4, 2020Updated 6 years ago
- PeerPlayer plays various video/audio files from .torrent file.☆14Nov 8, 2016Updated 9 years ago
- ☆13Jan 8, 2021Updated 5 years ago
- This repositary hosts my experiments for the project, I did with OffNote Labs.☆10Apr 12, 2021Updated 4 years ago
- Layerwise Relevance Visualization in Convolutional Text Graph Classifiers☆12Jun 2, 2021Updated 4 years ago
- Minimal (truly) muP implementation, consistent with TP4 and TP5 papers notation☆14Jan 2, 2026Updated 2 months ago
- AI Explans AI demo with a RAG application built using LangFlow and StreamLit☆14Apr 30, 2024Updated last year