patrick-tssn / LM-Research-HubLinks
Language Modeling Research Hub, a comprehensive compendium for enthusiasts and scholars delving into the fascinating realm of language models (LMs), with a particular focus on large language models (LLMs)
☆19Updated 2 months ago
Alternatives and similar repositories for LM-Research-Hub
Users that are interested in LM-Research-Hub are comparing it to the libraries listed below
Sorting:
- ☆129Updated 10 months ago
- Paper collections of methods that using language to interact with environment, including interact with real world, simulated world or WWW…☆127Updated last year
- [ACL 2024] PCA-Bench: Evaluating Multimodal Large Language Models in Perception-Cognition-Action Chain☆104Updated last year
- Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents (ACL 2024 Main Conference)☆142Updated 7 months ago
- Paper collections of the continuous effort start from World Models.☆172Updated 11 months ago
- Official Repo of LangSuitE☆84Updated 9 months ago
- Data and code for the ICLR 2023 paper "Dynamic Prompt Learning via Policy Gradient for Semi-structured Mathematical Reasoning".☆154Updated last year
- ☆151Updated this week
- [ICLR2025 Spotlight] Agent Trajectory Synthesis via Guiding Replay with Web Tutorials☆32Updated 3 months ago
- 🤖ConvRe🤯: An Investigation of LLMs’ Inefficacy in Understanding Converse Relations (EMNLP 2023)☆23Updated last year
- Official Implementation of ARPO: End-to-End Policy Optimization for GUI Agents with Experience Replay☆71Updated last week
- ☆102Updated last month
- GenRM-CoT: Data release for verification rationales☆61Updated 7 months ago
- This is code for most of the experiments in the paper Understanding the Effects of RLHF on LLM Generalisation and Diversity☆43Updated last year
- ☆59Updated 3 months ago
- [NeurIPS 2023] Large Language Models Are Semi-Parametric Reinforcement Learning Agents☆34Updated last year
- ☆46Updated 7 months ago
- Self-Alignment with Principle-Following Reward Models☆161Updated 3 weeks ago
- Code for Paper: Autonomous Evaluation and Refinement of Digital Agents [COLM 2024]☆136Updated 6 months ago
- Homepage for ProLong (Princeton long-context language models) and paper "How to Train Long-Context Language Models (Effectively)"☆186Updated 3 months ago
- Reference implementation for Token-level Direct Preference Optimization(TDPO)☆140Updated 3 months ago
- [ICLR 2024 Spotlight] Code for the paper "Text2Reward: Reward Shaping with Language Models for Reinforcement Learning"☆162Updated 5 months ago
- The official code repository for PRMBench.☆73Updated 3 months ago
- ☆93Updated 11 months ago
- ☆231Updated last week
- Code for ACL2024 paper - Adversarial Preference Optimization (APO).☆54Updated last year
- Code accompanying the paper "Noise Contrastive Alignment of Language Models with Explicit Rewards" (NeurIPS 2024)☆53Updated 6 months ago
- Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision☆120Updated 8 months ago
- Natural Language Reinforcement Learning☆89Updated 5 months ago
- Pre-Trained Language Models for Interactive Decision-Making [NeurIPS 2022]☆126Updated 2 years ago