JiwenJ / Awesome-RLView external linksLinks
A curated list of RL resources
☆50Aug 10, 2025Updated 6 months ago
Alternatives and similar repositories for Awesome-RL
Users that are interested in Awesome-RL are comparing it to the libraries listed below
Sorting:
- The source code of [WWW 2025] MoDiCF☆12Jul 12, 2025Updated 7 months ago
- Kernel Herding for probability density estimation☆14Feb 23, 2016Updated 9 years ago
- Knowledge Base Graph Attention Networks☆14Feb 22, 2020Updated 5 years ago
- Official implementation for "Knowledge Distillation with Refined Logits".☆21Aug 26, 2024Updated last year
- Code for "A Data-Centric Approach To Generate Faithful and High Quality Patient Summaries with Large Language Models"☆17Jul 20, 2025Updated 6 months ago
- ☆14Apr 21, 2023Updated 2 years ago
- [NAACL 2025] Representing Rule-based Chatbots with Transformers☆23Feb 9, 2025Updated last year
- An evaluation suite for Retrieval-Augmented Generation (RAG).☆23Apr 26, 2025Updated 9 months ago
- Official repository of paper "Parameters vs. Context: Fine-Grained Control of Knowledge Reliance in Language Models"☆23May 27, 2025Updated 8 months ago
- ☆18May 5, 2021Updated 4 years ago
- [ICLR 2025] Language Imbalance Driven Rewarding for Multilingual Self-improving☆24Aug 25, 2025Updated 5 months ago
- A repository listing important datasets for multimodal recommender systems☆29Mar 20, 2024Updated last year
- A better Alpaca Model Trained with Less Data (only 9k instructions of the original set)☆24Jul 26, 2024Updated last year
- ☆26May 29, 2022Updated 3 years ago
- Instruct Once, Chat Consistently in Multiple Rounds: An Efficient Tuning Framework for Dialogue (ACL 2024)☆25Oct 18, 2025Updated 3 months ago
- Understanding Factual Errors in Summarization: Errors, Summarizers, Datasets, Error Detectors (ACL 2023)☆28Mar 26, 2024Updated last year
- ☆87Aug 16, 2025Updated 6 months ago
- [WSDM 2025] Source code for "Spectrum-based Modality Representation Fusion Graph Convolutional Network for Multimodal Recommendation".☆36Dec 22, 2024Updated last year
- The Pre-lease github repository of ECHOPULSE: ECG CONTROLLED ECHOCARDIO- GRAMS VIDEO GENERATION☆42Feb 4, 2025Updated last year
- 第二届“泰迪杯”数据分析职业技能大赛A题☆10Sep 15, 2020Updated 5 years ago
- FeatureAlignment = Alignment + Mechanistic Interpretability☆34Mar 8, 2025Updated 11 months ago
- Official code implementation for the ACL 2025 paper: 'CoT-based Synthesizer: Enhancing LLM Performance through Answer Synthesis'☆32May 19, 2025Updated 8 months ago
- [AAAI’2025] SalM²: An Extremely Lightweight Saliency Mamba Model for Real-Time Cognitive Awareness of Driver Attention☆57Aug 12, 2025Updated 6 months ago
- Multi-Stage Vision Token Dropping: Towards Efficient Multimodal Large Language Model☆37Jan 8, 2025Updated last year
- Some resources (books, paper, video and online courses) about ML,DL,DM☆12Mar 14, 2021Updated 4 years ago
- Resources for our ACL 2023 paper: Distilling Script Knowledge from Large Language Models for Constrained Language Planning☆36Aug 19, 2023Updated 2 years ago
- ☆54Dec 17, 2025Updated 2 months ago
- Compressed LLMs for Efficient Text Generation [ICLR'24 Workshop]☆90Sep 13, 2024Updated last year
- [EMNLP 2025] WebAgent-R1: Training Web Agents via End-to-End Multi-Turn Reinforcement Learning☆73Nov 4, 2025Updated 3 months ago
- OCRVerse: Towards Holistic OCR in End-to-End Vision-Language Models☆26Feb 4, 2026Updated last week
- Azure Machine Learning - MLOps Python SDKv2☆10Jul 24, 2023Updated 2 years ago
- Evaluation Pipeline for medical tasks.☆12Updated this week
- the datasets of our paper☆11Feb 26, 2024Updated last year
- CDbw Index For Cluster Validation☆10Mar 26, 2019Updated 6 years ago
- ☆13Nov 5, 2024Updated last year
- 第八届“泰迪杯”数据挖掘挑战赛的一点心得☆10Nov 26, 2020Updated 5 years ago
- 机器学习(Machine Learning)、深度学习(Deep Learning)、对抗神经网络(GAN),图神经网络(GNN),NLP,大数据相关的发展路书(roadmap), 并附海量源码(python,pytorch)带大家消化基本知识点,突破面试,完成从新手到合格…☆10Feb 25, 2020Updated 5 years ago
- Modern normalizing flows in Python. Simple to use and easily extensible.☆11Updated this week
- ☆11Aug 20, 2025Updated 5 months ago