Reinforcement Learning in LLM and NLP.
☆63Dec 31, 2025Updated 3 months ago
Alternatives and similar repositories for rl-llm-nlp
Users that are interested in rl-llm-nlp are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆13May 12, 2025Updated 11 months ago
- LLM-MapBook: AI-Powered Maps for Storytelling. Extracts geo-coordinates from books, visualizes on interactive maps, offering immersive st…☆12Aug 27, 2024Updated last year
- (Accepted By EMNLP2022 main long)Knowledge Prompting in Pre-trained Language Model for Natural Language Understanding☆15Oct 29, 2022Updated 3 years ago
- Official Implementation of Avoiding spurious correlations via logit correction☆17May 6, 2023Updated 2 years ago
- (ACL 2025 Main) Distilling RAG for SLMs from LLMs to Transfer Knowledge and Mitigate Hallucination via Evidence and Graph-based Distillat…☆34Aug 23, 2025Updated 7 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Documentation at☆14Mar 27, 2025Updated last year
- Prediction of glycopeptide fragment mass spectra by deep learning☆11Feb 20, 2024Updated 2 years ago
- ☆10Feb 4, 2025Updated last year
- Text generation using language models with multiple exit heads☆16Sep 18, 2025Updated 7 months ago
- ☆33Sep 14, 2025Updated 7 months ago
- ☆10Mar 2, 2021Updated 5 years ago
- 2024广西数字开放创新应用大赛,多模态新闻谣言分类☆19Jan 18, 2025Updated last year
- Source codes for the paper "Personalized Dynamic Music Emotion Recognition with Dual-Scale Attention-Based Meta-Learning" (PDMER) which p…☆13Mar 24, 2025Updated last year
- Code and Data for ACL 2025 Paper "Aristotle: Mastering Logical Reasoning with A Logic-Complete Decompose-Search-Resolve Framework".☆25Oct 3, 2025Updated 6 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Text Adventure Learning Environment Suite - Benchmark to evaluate language models on interactive text environments.☆26Apr 1, 2026Updated 2 weeks ago
- ☆18Nov 22, 2025Updated 4 months ago
- [EMNLP 2025] Reasoning-to-Defend: Safety-Aware Reasoning Can Defend Large Language Models from Jailbreaking☆12Aug 22, 2025Updated 7 months ago
- Koishi's Day 2025 Paper (NeurIPS 2025): "Codifying Character Logic in Role-Playing"☆23Jan 15, 2026Updated 3 months ago
- (ACL 2025 Main) Safe: Enhancing Mathematical Reasoning in Large Language Models via Retrospective Step-aware Formal Verification - Offici…☆20Dec 26, 2025Updated 3 months ago
- Targeted Data Generation with Large Language Models☆19Jun 25, 2024Updated last year
- Some shell scripts related to Thermo Fisher raw format conversion using msconvert☆15Feb 10, 2026Updated 2 months ago
- Official implementation of the paper "Sparse Feature Factorization for Recommender Systems with Knowledge Graphs"☆22Oct 13, 2022Updated 3 years ago
- A Survey of Direct Preference Optimization (DPO)☆93Jul 4, 2025Updated 9 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆20May 14, 2025Updated 11 months ago
- Official repository of the ACL 2024 paper "Rethinking Task-Oriented Dialogue Systems: From Complex Modularity to Zero-Shot Autonomous Age…☆20May 28, 2024Updated last year
- Website for release of TellMeWhy dataset for why question answering☆14Nov 11, 2022Updated 3 years ago
- ☆21Aug 18, 2024Updated last year
- Data and baseline code of EMNLP 2021 paper "MLEC-QA: A Chinese Multi-Choice Biomedical Question Answering Dataset".☆31Nov 5, 2021Updated 4 years ago
- An AI-powered content conversion tool that transforms text, web content, or HTML code into beautifully designed card images.一款基于AI的内容转换工…☆33Jul 29, 2025Updated 8 months ago
- JLU drcom client written in golang.☆12Sep 4, 2019Updated 6 years ago
- Official Implementation of the paper "Jointly Reinforcing Diversity and Quality in Language Model Generations"☆58Dec 26, 2025Updated 3 months ago
- ☆132Mar 4, 2025Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆10Oct 20, 2020Updated 5 years ago
- ☆11Apr 4, 2018Updated 8 years ago
- [NeurIPS 2021] “Improving Contrastive Learning on Imbalanced Data via Open-World Sampling”, Ziyu Jiang, Tianlong Chen, Ting Chen, Zhangya…☆29Dec 30, 2021Updated 4 years ago
- When Reasoning Meets Its Laws☆37Jan 2, 2026Updated 3 months ago
- Generates random utf-8 strings for fuzz t�sting character encoding probl�ms☆11Aug 21, 2015Updated 10 years ago
- 吴恩达 LangChain 课程中英双语字幕☆16Jun 3, 2023Updated 2 years ago
- Segmenting a given document using recursive xy-cut algorithm.☆12Oct 9, 2018Updated 7 years ago