HarleyCoops / OneShotGRPO
One click away from a locally downloaded, fine-tuned model, hosted on hugging face, with inference built in. In two hours.
☆21Updated 2 weeks ago
Alternatives and similar repositories for OneShotGRPO:
Users that are interested in OneShotGRPO are comparing it to the libraries listed below
- ☆76Updated 9 months ago
- Simple examples using Argilla tools to build AI☆53Updated 4 months ago
- ☆61Updated last month
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆39Updated last month
- A reimplementation of langgraph's customer support example in Rasa's CALM paradigm and a quantiative evaluation of the 2 approaches☆76Updated this week
- Solving data for LLMs - Create quality synthetic datasets!☆145Updated 2 months ago
- ☆87Updated this week
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆63Updated 4 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated 8 months ago
- A comprehensive repository of reasoning tasks for Medical LLMs (and beyond)☆115Updated 6 months ago
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆52Updated last week
- Simple GRPO scripts and configurations.☆58Updated last month
- A seamless matchmaking application that is programmed with Cohere Command R+, Stanford NLP DSPy framework, Weaviate Vector store and Crew…☆59Updated 11 months ago
- Recipes for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases. 👨🏻🍳☆263Updated 3 months ago
- ☆88Updated last year
- ☆29Updated last year
- RAG example using DSPy, Gradio, FastAPI☆75Updated 11 months ago
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆91Updated 2 months ago
- Research repository on interfacing LLMs with Weaviate APIs. Inspired by the Berkeley Gorilla LLM.☆117Updated 3 weeks ago
- A list of AI memory projects☆84Updated 2 months ago
- Build a Recommendation System Agent using LATS Agent Approach☆28Updated last month
- ☆45Updated 11 months ago
- ☆61Updated 5 months ago
- AI agent with RAG+ReAct on Indian Constitution & BNS☆60Updated 5 months ago
- Build a Streamlit Chatbot using Langchain, ColBERT, Ragatouille, and ChromaDB☆119Updated last year
- ☆85Updated 2 months ago
- Train your own SOTA deductive reasoning model☆81Updated 3 weeks ago
- ☆120Updated 2 weeks ago
- LLM reads a paper and produce a working prototype☆51Updated last week
- ☆140Updated last month