Pleias / RL-ReasoningLinks
Collection of resources for RL and Reasoning
☆27Updated 11 months ago
Alternatives and similar repositories for RL-Reasoning
Users that are interested in RL-Reasoning are comparing it to the libraries listed below
Sorting:
- ☆105Updated 10 months ago
- ☆140Updated 5 months ago
- ☆147Updated last year
- ☆53Updated 11 months ago
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" 🤖☆76Updated last year
- Source code for the collaborative reasoner research project at Meta FAIR.☆112Updated 9 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆51Updated last year
- code for training & evaluating Contextual Document Embedding models☆202Updated 8 months ago
- An introduction to LLM Sampling☆79Updated last year
- Banishing LLM Hallucinations Requires Rethinking Generalization☆276Updated last year
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆114Updated 9 months ago
- A blueprint for AI development, focusing on applied examples of RAG, information extraction, analysis and fine-tuning in the age of LLMs …☆61Updated 11 months ago
- Simple UI for debugging correlations of text embeddings☆305Updated 8 months ago
- Train your own SOTA deductive reasoning model☆107Updated 10 months ago
- Using open source LLMs to build synthetic datasets for direct preference optimization☆72Updated last year
- ☆210Updated 7 months ago
- Generalist and Lightweight Model for Text Classification☆168Updated 2 weeks ago
- Python library to use Pleias-RAG models☆68Updated 8 months ago
- Trully flash implementation of DeBERTa disentangled attention mechanism.☆67Updated this week
- Codebase accompanying the Summary of a Haystack paper.☆80Updated last year
- Set of scripts to finetune LLMs☆38Updated last year
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆69Updated 2 months ago
- Simple examples using Argilla tools to build AI☆57Updated last year
- ☆48Updated 2 months ago
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆150Updated 3 weeks ago
- Let's build better datasets, together!☆269Updated last year
- ☆91Updated last month
- GraphER: A Structure-aware Text-to-Graph Model for Entity and Relation Extraction☆84Updated last year
- ☆80Updated last year
- ☆55Updated last year