☆217Feb 22, 2025Updated last year
Alternatives and similar repositories for Reinforcement-Learning-Enhanced-LLMs-A-Survey
Users that are interested in Reinforcement-Learning-Enhanced-LLMs-A-Survey are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- BatchedWallet is a simple example implementation of an ERC-4337-compliant smart wallet.☆17Dec 11, 2023Updated 2 years ago
- Performance Profiling Tool For React Native☆71Jun 8, 2025Updated 11 months ago
- The Multi-Platform Haveno App for Monero☆60Sep 13, 2025Updated 8 months ago
- A handy lib for smooth interaction with large language models (LLMs) and crafting AI apps.☆107Apr 29, 2026Updated last month
- Rad UI is an open-source, headless UI component library for building modern, fast, performant, accessible React applications☆58May 21, 2026Updated last week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- comit for project☆15Mar 6, 2025Updated last year
- Dynamiq is an orchestration framework for agentic AI and LLM applications☆1,052Updated this week
- AgentTrace is a lightweight observability library to trace and evaluate agentic systems.☆60Apr 1, 2025Updated last year
- Manage AWS IAM Identity Center permission sets and account assignments with Terraform.☆35May 19, 2026Updated last week
- ☆10May 1, 2025Updated last year
- API Docs☆12Mar 30, 2025Updated last year
- SpyGame: An interactive multi-agent framework to evaluate intelligence with large language models :D☆15Nov 9, 2023Updated 2 years ago
- A React (React.js) library of Lightweight-charts components written in Typescript☆111May 18, 2026Updated last week
- [ICML 2026] InnoEval: On Research Idea Evaluation as a Knowledge-Grounded, Multi-Perspective Reasoning Problem☆22Apr 7, 2026Updated last month
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Implementation of our paper "Scaling Back-Translation with Domain Text Generation for Sign Language Gloss Translation". Accepted in EACL …☆11May 22, 2023Updated 3 years ago
- ☆11Jul 17, 2021Updated 4 years ago
- Evaluation Pipeline for medical tasks.☆12Apr 8, 2026Updated last month
- This is the official code repository for the paper 'Cross-modality Data Augmentation for End-to-End Sign Language Translation'. Accepted…☆16Oct 18, 2023Updated 2 years ago
- The official code of TACL 2022, "Break, Perturb, Build: Automatic Perturbation of Reasoning Paths Through Question Decomposition".☆12Oct 18, 2021Updated 4 years ago
- Official implementation of the ACL Findings 2023 paper: Interpretable Automatic Fine-grained Inconsistency Detection in Text Summarizatio…☆14Jan 25, 2024Updated 2 years ago
- Project for the paper entitled `Instruction Tuning for Large Language Models: A Survey`☆232Aug 10, 2025Updated 9 months ago
- Artifact for TOSEM Submission: GiantRepair☆13Jun 26, 2024Updated last year
- Understanding and Improving Encoder Layer Fusion in Sequence-to-Sequence Learning (ICLR 2021)☆24Mar 18, 2021Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- An implementation of Tare.☆12Feb 23, 2024Updated 2 years ago
- 2019年春哈工大数据库☆12Nov 21, 2019Updated 6 years ago
- Code and data for the paper: On the Reliability of Psychological Scales on Large Language Models☆30Dec 15, 2025Updated 5 months ago
- Code of "Improving Machine Translation with Human Feedback: An Exploration of Quality Estimation as a Reward Model"☆22Jun 28, 2024Updated last year
- Models, data, and codes for the paper: MetaAligner: Towards Generalizable Multi-Objective Alignment of Language Models☆25Sep 26, 2024Updated last year
- A collection of instruction data and scripts for machine translation.☆20Sep 23, 2023Updated 2 years ago
- [ICLR 2025] Official implementation of DICL (Disentangled In-Context Learning), featured in the paper "Zero-shot Model-based Reinforcemen…☆25Feb 14, 2025Updated last year
- Implementation of our paper "Data Rejuvenation: Exploiting Inactive Training Examples for Neural Machine Translation" in EMNLP-2020.☆23Aug 20, 2021Updated 4 years ago
- LogicBench is a natural language question-answering dataset consisting of 25 different reasoning patterns spanning over propositional, fi…☆39May 2, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆32Jul 8, 2024Updated last year
- A comprehensive paper list of Reasoning over Tables.☆30Nov 6, 2022Updated 3 years ago
- ☆33Dec 29, 2024Updated last year
- The source code for running LLMs on the AAAR-1.0 benchmark.☆18Apr 5, 2025Updated last year
- [ICLR 2025] This is the official implementation for the paper: "Large Language Models Meet Symbolic Provers for Logical Reasoning Evaluat…☆45Jun 11, 2025Updated 11 months ago
- The First International Workshop on Large Language Model for Code 2024 (Co-Located with ICSE 2024)☆17Oct 4, 2024Updated last year
- 2022年春哈工大软件架构与中间件课程资料☆18Dec 18, 2022Updated 3 years ago