Pleias / RL-ReasoningLinks
Collection of resources for RL and Reasoning
☆26Updated 6 months ago
Alternatives and similar repositories for RL-Reasoning
Users that are interested in RL-Reasoning are comparing it to the libraries listed below
Sorting:
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated last year
- Python library to use Pleias-RAG models☆61Updated 3 months ago
- ☆129Updated 4 months ago
- An introduction to LLM Sampling☆79Updated 7 months ago
- A blueprint for AI development, focusing on applied examples of RAG, information extraction, analysis and fine-tuning in the age of LLMs …☆58Updated 6 months ago
- Source code for the collaborative reasoner research project at Meta FAIR.☆100Updated 3 months ago
- Simple UI for debugging correlations of text embeddings☆288Updated 2 months ago
- ☆145Updated last year
- ☆93Updated 4 months ago
- Train your own SOTA deductive reasoning model☆104Updated 5 months ago
- code for training & evaluating Contextual Document Embedding models☆196Updated 2 months ago
- Banishing LLM Hallucinations Requires Rethinking Generalization☆276Updated last year
- ☆49Updated 6 months ago
- ☆65Updated 2 months ago
- A RAG that can scale 🧑🏻💻☆11Updated last year
- Using open source LLMs to build synthetic datasets for direct preference optimization☆65Updated last year
- Generalist and Lightweight Model for Text Classification☆148Updated last month
- ☆118Updated 11 months ago
- EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language M…☆232Updated 9 months ago
- awesome synthetic (text) datasets☆291Updated last month
- GraphER: A Structure-aware Text-to-Graph Model for Entity and Relation Extraction☆76Updated last year
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" 🤖☆73Updated 8 months ago
- ☆211Updated last month
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆111Updated 4 months ago
- Simple examples using Argilla tools to build AI☆53Updated 8 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆55Updated 6 months ago
- ☆53Updated 9 months ago
- This is the reproduction repository for my 🤗 Hugging Face blog post on synthetic data☆68Updated last year
- ☆76Updated this week
- Pre-train Static Word Embeddings☆85Updated 2 months ago