Pleias / RL-ReasoningLinks
Collection of resources for RL and Reasoning
☆26Updated 7 months ago
Alternatives and similar repositories for RL-Reasoning
Users that are interested in RL-Reasoning are comparing it to the libraries listed below
Sorting:
- Python library to use Pleias-RAG models☆62Updated 4 months ago
- ☆135Updated last month
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated last year
- An introduction to LLM Sampling☆79Updated 9 months ago
- A blueprint for AI development, focusing on applied examples of RAG, information extraction, analysis and fine-tuning in the age of LLMs …☆59Updated 7 months ago
- ☆145Updated last year
- ☆54Updated 10 months ago
- Using open source LLMs to build synthetic datasets for direct preference optimization☆65Updated last year
- Simple examples using Argilla tools to build AI☆55Updated 10 months ago
- ☆49Updated 7 months ago
- ☆68Updated 3 months ago
- Train your own SOTA deductive reasoning model☆106Updated 6 months ago
- code for training & evaluating Contextual Document Embedding models☆197Updated 4 months ago
- Source code for the collaborative reasoner research project at Meta FAIR.☆103Updated 5 months ago
- ☆95Updated 5 months ago
- GraphER: A Structure-aware Text-to-Graph Model for Entity and Relation Extraction☆80Updated last year
- This is the reproduction repository for my 🤗 Hugging Face blog post on synthetic data☆68Updated last year
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆111Updated 5 months ago
- Simple UI for debugging correlations of text embeddings☆291Updated 3 months ago
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" 🤖☆75Updated 9 months ago
- ☆80Updated last year
- Codebase accompanying the Summary of a Haystack paper.☆79Updated last year
- Simple GRPO scripts and configurations.☆59Updated 7 months ago
- A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive arguments☆89Updated 11 months ago
- The first dense retrieval model that can be prompted like an LM☆87Updated 4 months ago
- Training an LLM to use a calculator with multi-turn reinforcement learning, achieving a **62% absolute increase in evaluation accuracy**.☆52Updated 4 months ago
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆92Updated 7 months ago
- ARAGOG- Advanced RAG Output Grading. Exploring and comparing various Retrieval-Augmented Generation (RAG) techniques on AI research paper…☆111Updated last year
- Generalist and Lightweight Model for Text Classification☆159Updated 3 months ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆67Updated 10 months ago