Reinforcement Learning in LLM and NLP.
☆61Dec 31, 2025Updated 2 months ago
Alternatives and similar repositories for rl-llm-nlp
Users that are interested in rl-llm-nlp are comparing it to the libraries listed below
Sorting:
- Reranking for Multi-objective Optimized Recommender Systems☆11Aug 3, 2023Updated 2 years ago
- This repo contains demonstrations of an extensible Crystal Structure Type Recognition Network (CSTRNet), which consists of a variable num…☆12May 21, 2024Updated last year
- 算法导论☆10Dec 20, 2021Updated 4 years ago
- 桂林电子科技大学Evolution战队2021雷达站视觉代码开源☆11Sep 3, 2021Updated 4 years ago
- Paper dataset for "Factored Verification: Detecting and Reducing Hallucination in Summaries of Academic Papers"☆12Oct 20, 2024Updated last year
- Neuralizer.ai - Visual Neural Network Designer☆14Nov 8, 2022Updated 3 years ago
- Prediction of glycopeptide fragment mass spectra by deep learning☆10Feb 20, 2024Updated 2 years ago
- 2024广西数字开放创新应用大赛,多模态新闻谣言分类☆19Jan 18, 2025Updated last year
- Code and pruned models for our paper: K. Gkrispanis, N. Gkalelis, V. Mezaris, "Filter-Pruning of Lightweight Face Detectors Using a Geome…☆14May 8, 2024Updated last year
- ☆11Apr 4, 2018Updated 7 years ago
- Low-latency live streaming PoC☆11Jul 30, 2019Updated 6 years ago
- A mesh system for adapting multiple large language models.☆11Mar 20, 2024Updated last year
- MelGAN and Tacotron 2 in PyTorch☆11Oct 22, 2019Updated 6 years ago
- PDF Extraction Toolkit (wraps and trains LayoutLM)☆10Oct 8, 2021Updated 4 years ago
- Source codes for the paper "Personalized Dynamic Music Emotion Recognition with Dual-Scale Attention-Based Meta-Learning" (PDMER) which p…☆13Mar 24, 2025Updated 11 months ago
- ☆10Jul 13, 2022Updated 3 years ago
- Website for release of TellMeWhy dataset for why question answering☆14Nov 11, 2022Updated 3 years ago
- (Accepted By EMNLP2022 main long)Knowledge Prompting in Pre-trained Language Model for Natural Language Understanding☆15Oct 29, 2022Updated 3 years ago
- Expected edit distance implementation using OpenFst tools☆11May 13, 2015Updated 10 years ago
- Notes for CS294/194-196: Large Language Model Agents (Fall 2024, UC Berkeley), summarizing 12 lectures on LLM fundamentals, reasoning, pl…☆14Jan 7, 2025Updated last year
- 学习他人如何制作漂亮的notebook。「Java学习+面试指南」一份涵盖大部分 Java 程序员所需要掌握的核心知识。☆10Sep 24, 2021Updated 4 years ago
- Official Implementation of Avoiding spurious correlations via logit correction☆17May 6, 2023Updated 2 years ago
- ☆16Jun 10, 2025Updated 8 months ago
- ConceptNet to neo4j 2.2☆10Nov 6, 2015Updated 10 years ago
- LLM-MapBook: AI-Powered Maps for Storytelling. Extracts geo-coordinates from books, visualizes on interactive maps, offering immersive st…☆12Aug 27, 2024Updated last year
- Used for onset picking☆11Oct 14, 2019Updated 6 years ago
- ☆24Jan 12, 2016Updated 10 years ago
- Sequence to sequence model for Arabic punctuation prediction.☆12Feb 13, 2020Updated 6 years ago
- Trains Sparse Autoencoders based on outputs from language models☆11Oct 7, 2024Updated last year
- ☆13May 12, 2025Updated 9 months ago
- Code and Data for ACL 2025 Paper "Aristotle: Mastering Logical Reasoning with A Logic-Complete Decompose-Search-Resolve Framework".☆24Oct 3, 2025Updated 5 months ago
- Standalone commandline CLI tool for compiling Triton kernels☆20Sep 13, 2024Updated last year
- Segmenting a given document using recursive xy-cut algorithm.☆12Oct 9, 2018Updated 7 years ago
- JLU drcom client written in golang.☆12Sep 4, 2019Updated 6 years ago
- EAST-inspired Tensorflow-based Text Detector☆11Feb 18, 2021Updated 5 years ago
- Torch 7 + Android port of Neural style algorithm☆10May 10, 2016Updated 9 years ago
- Code to accompany the paper "Learning Grimaces By Watching TV" and FaceValue dataset☆12Aug 4, 2018Updated 7 years ago
- Birdiebot Target Prception And Decision Making Framework☆13Aug 29, 2022Updated 3 years ago
- Sample pytorch implementation of Covariant Compositional Networks☆13Feb 17, 2018Updated 8 years ago