RLDiary / Wordle-GRPOLinks
A $100 Agent - Reinforcement tuning a language model to play the game of Wordle
☆16Updated 6 months ago
Alternatives and similar repositories for Wordle-GRPO
Users that are interested in Wordle-GRPO are comparing it to the libraries listed below
Sorting:
- ☆52Updated 8 months ago
- Collection of resources for RL and Reasoning☆27Updated last year
- ☆80Updated last year
- Seemless interface of using PyTOrch distributed with Jupyter notebooks☆57Updated 4 months ago
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆108Updated 4 months ago
- Automating enterprise workflows with multimodal agents☆115Updated last year
- ☆79Updated last year
- PyTorch implementation for MRL☆21Updated last year
- Low latency, High Accuracy, Custom Query routers for Humans and Agents. Built by Prithivi Da☆119Updated 10 months ago
- Named Entity Recognition with an decoder-only (autoregressive) LLM using HuggingFace☆47Updated 5 months ago
- ☆214Updated 2 weeks ago
- ☆85Updated last year
- Examining how large language models (LLMs) perform across various synthetic regression tasks when given (input, output) examples in their…☆160Updated 4 months ago
- Python library to use Pleias-RAG models☆68Updated 9 months ago
- 📝 Reference-Free automatic summarization evaluation with potential hallucination detection☆103Updated 2 years ago
- ☆259Updated 2 months ago
- Use 0plot to automatically build matplotlib plots using ChatGPT.☆19Updated 2 years ago
- 🤗 HuggingFace Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but for Vertex AI and unofficial)☆17Updated last year
- Training LLMs to reason and analyze data with notebooks☆61Updated 5 months ago
- ☆147Updated last year
- A comprehensive guide to LLM evaluation methods designed to assist in identifying the most suitable evaluation techniques for various use…☆177Updated this week
- Data for the Chat With Your Data benchmark.☆148Updated 2 years ago
- Sythetic data generation and normalization functions powered by LLMs☆58Updated last year
- ☆31Updated last year
- Code for "Counterfactual Token Generation in Large Language Models", Arxiv 2024.☆32Updated last year
- Repository for our "RAG in Practice (2025)" event!☆17Updated 10 months ago
- ☆47Updated 2 years ago
- ☆210Updated 7 months ago
- This repo is the central repo for all the RAG Evaluation reference material and partner workshop☆80Updated 9 months ago
- cheap & easy LLM experiments for amateurs (alpha)☆25Updated 2 months ago