Pleias / RL-ReasoningLinks
Collection of resources for RL and Reasoning
☆26Updated 6 months ago
Alternatives and similar repositories for RL-Reasoning
Users that are interested in RL-Reasoning are comparing it to the libraries listed below
Sorting:
- ☆134Updated last week
- ☆145Updated last year
- Source code for the collaborative reasoner research project at Meta FAIR.☆103Updated 4 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated last year
- An introduction to LLM Sampling☆79Updated 8 months ago
- Simple UI for debugging correlations of text embeddings☆290Updated 3 months ago
- Python library to use Pleias-RAG models☆61Updated 4 months ago
- Train your own SOTA deductive reasoning model☆104Updated 5 months ago
- code for training & evaluating Contextual Document Embedding models☆197Updated 3 months ago
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" 🤖☆75Updated 8 months ago
- ☆49Updated 6 months ago
- Codebase accompanying the Summary of a Haystack paper.☆79Updated 11 months ago
- Banishing LLM Hallucinations Requires Rethinking Generalization☆276Updated last year
- Set of scripts to finetune LLMs☆37Updated last year
- Using open source LLMs to build synthetic datasets for direct preference optimization☆65Updated last year
- ☆94Updated 5 months ago
- GraphER: A Structure-aware Text-to-Graph Model for Entity and Relation Extraction☆79Updated last year
- ☆54Updated 9 months ago
- A blueprint for AI development, focusing on applied examples of RAG, information extraction, analysis and fine-tuning in the age of LLMs …☆59Updated 6 months ago
- Pre-train Static Word Embeddings☆84Updated 2 months ago
- awesome synthetic (text) datasets☆295Updated last month
- 📝 Reference-Free automatic summarization evaluation with potential hallucination detection☆103Updated last year
- ☆118Updated last year
- EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language M…☆238Updated 10 months ago
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆78Updated 10 months ago
- Generalist and Lightweight Model for Text Classification☆156Updated 2 months ago
- ☆66Updated 3 months ago
- ☆80Updated last year
- A RAG that can scale 🧑🏻💻☆11Updated last year
- Let's build better datasets, together!☆262Updated 8 months ago