Pleias / RL-ReasoningLinks
Collection of resources for RL and Reasoning
☆26Updated 8 months ago
Alternatives and similar repositories for RL-Reasoning
Users that are interested in RL-Reasoning are comparing it to the libraries listed below
Sorting:
- Simple UI for debugging correlations of text embeddings☆296Updated 5 months ago
- Python library to use Pleias-RAG models☆64Updated 6 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆50Updated last year
- ☆136Updated 2 months ago
- An introduction to LLM Sampling☆79Updated 10 months ago
- Simple examples using Argilla tools to build AI☆56Updated 11 months ago
- ☆50Updated 8 months ago
- ☆146Updated last year
- Train your own SOTA deductive reasoning model☆109Updated 7 months ago
- code for training & evaluating Contextual Document Embedding models☆199Updated 5 months ago
- Banishing LLM Hallucinations Requires Rethinking Generalization☆275Updated last year
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆111Updated 6 months ago
- ☆80Updated last year
- A blueprint for AI development, focusing on applied examples of RAG, information extraction, analysis and fine-tuning in the age of LLMs …☆61Updated 8 months ago
- ☆96Updated 7 months ago
- ☆119Updated last year
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆145Updated 8 months ago
- Source code for the collaborative reasoner research project at Meta FAIR.☆103Updated 6 months ago
- ☆210Updated 4 months ago
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆78Updated last year
- Let's build better datasets, together!☆263Updated 10 months ago
- The first dense retrieval model that can be prompted like an LM☆89Updated 5 months ago
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).☆79Updated last year
- This is the reproduction repository for my 🤗 Hugging Face blog post on synthetic data☆68Updated last year
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆91Updated 9 months ago
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…☆78Updated last year
- Using open source LLMs to build synthetic datasets for direct preference optimization☆68Updated last year
- ☆68Updated 5 months ago
- Pre-train Static Word Embeddings☆87Updated last month
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆67Updated 11 months ago