hscspring / rl-llm-nlpView external linksLinks
Reinforcement Learning in LLM and NLP.
☆62Dec 31, 2025Updated last month
Alternatives and similar repositories for rl-llm-nlp
Users that are interested in rl-llm-nlp are comparing it to the libraries listed below
Sorting:
- (ACL 2025 Main) Distilling RAG for SLMs from LLMs to Transfer Knowledge and Mitigate Hallucination via Evidence and Graph-based Distillat…☆33Aug 23, 2025Updated 5 months ago
- ☆10Oct 20, 2020Updated 5 years ago
- [AAAI 2025] Neural-Symbolic Collaborative Distillation: Advancing Small Language Models for Complex Reasoning Tasks☆11Jun 19, 2025Updated 7 months ago
- Documentation at☆14Mar 27, 2025Updated 10 months ago
- 桂林电子科技大学Evolution战队2021雷达站视觉代码开源☆12Sep 3, 2021Updated 4 years ago
- Flame graphs for JVMs running inside Docker containers☆11May 6, 2019Updated 6 years ago
- Expected edit distance implementation using OpenFst tools☆11May 13, 2015Updated 10 years ago
- Website for release of TellMeWhy dataset for why question answering☆14Nov 11, 2022Updated 3 years ago
- MelGAN and Tacotron 2 in PyTorch☆11Oct 22, 2019Updated 6 years ago
- Code and Data for ACL 2025 Paper "Aristotle: Mastering Logical Reasoning with A Logic-Complete Decompose-Search-Resolve Framework".☆23Oct 3, 2025Updated 4 months ago
- Low-latency live streaming PoC☆11Jul 30, 2019Updated 6 years ago
- 2024广西数字开放创新应用大赛,多模态新闻谣言分类☆19Jan 18, 2025Updated last year
- Code and pruned models for our paper: K. Gkrispanis, N. Gkalelis, V. Mezaris, "Filter-Pruning of Lightweight Face Detectors Using a Geome…☆14May 8, 2024Updated last year
- A vanilla implementation of ReAct: Synergizing Reasoning and Acting in Language Models☆15Mar 26, 2025Updated 10 months ago
- (Accepted By EMNLP2022 main long)Knowledge Prompting in Pre-trained Language Model for Natural Language Understanding☆14Oct 29, 2022Updated 3 years ago
- Keyphrase Extraction from Scholarly Documents - Thesis☆14Nov 3, 2021Updated 4 years ago
- ☆16Mar 17, 2025Updated 10 months ago
- Trains Sparse Autoencoders based on outputs from language models☆11Oct 7, 2024Updated last year
- Used for onset picking☆11Oct 14, 2019Updated 6 years ago
- Segmenting a given document using recursive xy-cut algorithm.☆12Oct 9, 2018Updated 7 years ago
- ☆16Jun 10, 2025Updated 8 months ago
- Official Implementation of Avoiding spurious correlations via logit correction☆17May 6, 2023Updated 2 years ago
- ☆13May 12, 2025Updated 9 months ago
- 2021全国大学生工程训练综合能力竞赛智能物流搬运赛道视觉开源代码.☆13Sep 27, 2022Updated 3 years ago
- The official repository of the Eesen project☆12Jun 20, 2018Updated 7 years ago
- JLU drcom client written in golang.☆12Sep 4, 2019Updated 6 years ago
- running LayoutLMv2☆11Apr 27, 2022Updated 3 years ago
- Command-line script to access global proxy via PKU VPN☆13Sep 10, 2022Updated 3 years ago
- repo for the paper titled “CodeGen4Libs: A Two-Stage Approach for Library-Oriented Code Generation”☆14Oct 4, 2023Updated 2 years ago
- Text generation using language models with multiple exit heads☆16Sep 18, 2025Updated 4 months ago
- Sample pytorch implementation of Covariant Compositional Networks☆13Feb 17, 2018Updated 7 years ago
- ☆17May 2, 2024Updated last year
- YouTube-Based Multimodal Recipe Recommender☆14Jul 11, 2024Updated last year
- Code for Multi-Aspect Cross-modal Quantization for Generative Recommendation. (AAAI 2026 Oral)☆29Dec 9, 2025Updated 2 months ago
- ☆16Apr 24, 2024Updated last year
- ☆16Jan 8, 2020Updated 6 years ago
- ☆14Mar 2, 2021Updated 4 years ago
- Koishi's Day 2025 Paper (NeurIPS 2025): "Codifying Character Logic in Role-Playing"☆23Jan 15, 2026Updated 3 weeks ago
- ☆20Dec 14, 2024Updated last year