Reinforcement Learning in LLM and NLP.
☆63Dec 31, 2025Updated 2 months ago
Alternatives and similar repositories for rl-llm-nlp
Users that are interested in rl-llm-nlp are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- LLM-MapBook: AI-Powered Maps for Storytelling. Extracts geo-coordinates from books, visualizes on interactive maps, offering immersive st…☆12Aug 27, 2024Updated last year
- repo for the paper titled “CodeGen4Libs: A Two-Stage Approach for Library-Oriented Code Generation”☆14Oct 4, 2023Updated 2 years ago
- Finds acronyms and their expansion in text☆10Aug 3, 2018Updated 7 years ago
- (ACL 2025 Main) Distilling RAG for SLMs from LLMs to Transfer Knowledge and Mitigate Hallucination via Evidence and Graph-based Distillat…☆35Aug 23, 2025Updated 7 months ago
- Documentation at☆14Mar 27, 2025Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Classify image and text with ResNet and BERT models using Pytorch☆13Jul 7, 2020Updated 5 years ago
- ☆16Jun 10, 2025Updated 9 months ago
- Text generation using language models with multiple exit heads☆16Sep 18, 2025Updated 6 months ago
- Trains Sparse Autoencoders based on outputs from language models☆11Oct 7, 2024Updated last year
- ☆16Jul 12, 2024Updated last year
- 2024广西数字开放创新应用大赛,多模态新闻谣言分类☆19Jan 18, 2025Updated last year
- ☆32Sep 14, 2025Updated 6 months ago
- FashionAI challenge based on OpenPose☆17Apr 1, 2019Updated 6 years ago
- ☆20Jun 21, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- [AAAI 2025] Neural-Symbolic Collaborative Distillation: Advancing Small Language Models for Complex Reasoning Tasks☆12Jun 19, 2025Updated 9 months ago
- Source codes for the paper "Personalized Dynamic Music Emotion Recognition with Dual-Scale Attention-Based Meta-Learning" (PDMER) which p…☆13Mar 24, 2025Updated last year
- Official repo for "AlignGPT: Multi-modal Large Language Models with Adaptive Alignment Capability"☆34Jul 12, 2024Updated last year
- SG-Bench: Evaluating LLM Safety Generalization Across Diverse Tasks and Prompt Types☆24Nov 29, 2024Updated last year
- Text Adventure Learning Environment Suite - Benchmark to evaluate language models on interactive text environments.☆26Updated this week
- ☆18Nov 22, 2025Updated 4 months ago
- GPTCloneBench is a clone detection benchmark based on SemanticCloneBench and GPT.☆16Feb 5, 2025Updated last year
- Nexusflow function call, tool use, and agent benchmarks.☆30Dec 13, 2024Updated last year
- [EMNLP 2025] Reasoning-to-Defend: Safety-Aware Reasoning Can Defend Large Language Models from Jailbreaking☆12Aug 22, 2025Updated 7 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A lightweight, continuously-updated catalog of research papers on AI agents.☆29Oct 13, 2025Updated 5 months ago
- Koishi's Day 2025 Paper (NeurIPS 2025): "Codifying Character Logic in Role-Playing"☆23Jan 15, 2026Updated 2 months ago
- A curated collection of research and techniques for protecting intellectual property of large language models, including watermarking, fi…☆47Feb 15, 2026Updated last month
- ☆17Nov 3, 2024Updated last year
- (ACL 2025 Main) Safe: Enhancing Mathematical Reasoning in Large Language Models via Retrospective Step-aware Formal Verification - Offici…☆20Dec 26, 2025Updated 3 months ago
- OpenLLMDE: An open source data engineering framework for LLMs☆18Sep 9, 2023Updated 2 years ago
- Targeted Data Generation with Large Language Models☆19Jun 25, 2024Updated last year
- ☆20May 14, 2025Updated 10 months ago
- Code for Multi-Aspect Cross-modal Quantization for Generative Recommendation. (AAAI 2026 Oral)☆35Dec 9, 2025Updated 3 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Website for release of TellMeWhy dataset for why question answering☆14Nov 11, 2022Updated 3 years ago
- ☆21Aug 18, 2024Updated last year
- QiDiHui: RAG, appbuilder, ErnieBot, multi-model, 十万个为什么☆21Aug 7, 2024Updated last year
- Official Implementation of the paper "Jointly Reinforcing Diversity and Quality in Language Model Generations"☆57Dec 26, 2025Updated 3 months ago
- ☆131Mar 4, 2025Updated last year
- ☆10Oct 20, 2020Updated 5 years ago
- ☆11Apr 4, 2018Updated 7 years ago