patrick-tssn / LM-Research-HubLinks

Language Modeling Research Hub, a comprehensive compendium for enthusiasts and scholars delving into the fascinating realm of language models (LMs), with a particular focus on large language models (LLMs)

☆19

Alternatives and similar repositories for LM-Research-Hub

Users that are interested in LM-Research-Hub are comparing it to the libraries listed below

Sorting:

szxiangjn / world-model-for-language-model
☆129Updated 10 months ago
Timothyxxx / EnvInteractiveLMPapers
Paper collections of methods that using language to interact with environment, including interact with real world, simulated world or WWW…
☆127Updated last year
pkunlp-icler / PCA-EVAL
[ACL 2024] PCA-Bench: Evaluating Multimodal Large Language Models in Perception-Cognition-Action Chain
☆104Updated last year
Yifan-Song793 / ETO
Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents (ACL 2024 Main Conference)
☆142Updated 7 months ago
Timothyxxx / WorldModelPapers
Paper collections of the continuous effort start from World Models.
☆172Updated 11 months ago
bigai-nlco / langsuite
Official Repo of LangSuitE
☆84Updated 9 months ago
lupantech / PromptPG
Data and code for the ICLR 2023 paper "Dynamic Prompt Learning via Policy Gradient for Semi-structured Mathematical Reasoning".
☆154Updated last year
RAGEN-AI / VAGEN
☆151Updated this week
xlang-ai / AgentTrek
[ICLR2025 Spotlight] Agent Trajectory Synthesis via Guiding Replay with Web Tutorials
☆32Updated 3 months ago
3B-Group / ConvRe
🤖ConvRe🤯: An Investigation of LLMs’ Inefficacy in Understanding Converse Relations (EMNLP 2023)
☆23Updated last year
dvlab-research / ARPO
Official Implementation of ARPO: End-to-End Policy Optimization for GUI Agents with Experience Replay
☆71Updated last week
DigiRL-agent / digiq
☆102Updated last month
genrm-star / genrm-critiques
GenRM-CoT: Data release for verification rationales
☆61Updated 7 months ago
facebookresearch / rlfh-gen-div
This is code for most of the experiments in the paper Understanding the Effects of RLHF on LLM Generalisation and Diversity
☆43Updated last year
amazon-science / PAE
☆59Updated 3 months ago
OpenDFM / Rememberer
[NeurIPS 2023] Large Language Models Are Semi-Parametric Reinforcement Learning Agents
☆34Updated last year
rookie-joe / AutoPSV
☆46Updated 7 months ago
IBM / SALMON
Self-Alignment with Principle-Following Reward Models
☆161Updated 3 weeks ago
Berkeley-NLP / Agent-Eval-Refine
Code for Paper: Autonomous Evaluation and Refinement of Digital Agents [COLM 2024]
☆136Updated 6 months ago
princeton-nlp / ProLong
Homepage for ProLong (Princeton long-context language models) and paper "How to Train Long-Context Language Models (Effectively)"
☆186Updated 3 months ago
Vance0124 / Token-level-Direct-Preference-Optimization
Reference implementation for Token-level Direct Preference Optimization(TDPO)
☆140Updated 3 months ago
xlang-ai / text2reward
[ICLR 2024 Spotlight] Code for the paper "Text2Reward: Reward Shaping with Language Models for Reinforcement Learning"
☆162Updated 5 months ago
ssmisya / PRMBench
The official code repository for PRMBench.
☆73Updated 3 months ago
abdulhaim / LMRL-Gym
☆93Updated 11 months ago
ruixin31 / Rethink_RLVR
☆231Updated last week
Linear95 / APO
Code for ACL2024 paper - Adversarial Preference Optimization (APO).
☆54Updated last year
thu-ml / Noise-Contrastive-Alignment
Code accompanying the paper "Noise Contrastive Alignment of Language Models with Explicit Rewards" (NeurIPS 2024)
☆53Updated 6 months ago
Edward-Sun / easy-to-hard
Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision
☆120Updated 8 months ago
waterhorse1 / Natural-language-RL
Natural Language Reinforcement Learning
☆89Updated 5 months ago
ShuangLI59 / Pre-Trained-Language-Models-for-Interactive-Decision-Making
Pre-Trained Language Models for Interactive Decision-Making [NeurIPS 2022]
☆126Updated 2 years ago