IouJenLiu / AFK
☆16Updated 2 years ago
Alternatives and similar repositories for AFK:
Users that are interested in AFK are comparing it to the libraries listed below
- ☆15Updated last year
- Code for LaMPP: Language Models as Probabilistic Priors for Perception and Action☆35Updated last year
- Advantage Leftover Lunch Reinforcement Learning (A-LoL RL): Improving Language Models with Advantage-based Offline Policy Gradients☆26Updated 5 months ago
- Learning to Identify Critical States for Reinforcement Learning from Videos (Accepted to ICCV'23)☆26Updated last year
- This is the pytorch implementation of the UAI2023 paper "A Trajectory is Worth Three Sentences: Multimodal Transformer for Offline Reinf…☆11Updated last year
- Code for the Ask4Help project☆22Updated 2 years ago
- [ICLR 2022] Linking Emergent and Natural Languages via Corpus Transfer☆30Updated 8 months ago
- ☆13Updated 3 months ago
- ☆14Updated 10 months ago
- BASALT Benchmark datasets, evaluation code and agent training example.☆20Updated last year
- Repository for Skill Set Optimization☆12Updated 6 months ago
- ReaSCAN is a synthetic navigation task that requires models to reason about surroundings over syntactically difficult languages. (NeurIPS…☆20Updated 3 years ago
- Differentiable First-Order Logic Reasoning for Visual Question Answering☆39Updated 3 years ago
- Code for Paper (Policy Optimization in RLHF: The Impact of Out-of-preference Data)☆26Updated last year
- ☆26Updated 2 years ago
- ☆26Updated last year
- [ICLR 2025] "Training LMs on Synthetic Edit Sequences Improves Code Synthesis" (Piterbarg, Pinto, Fergus)☆14Updated last week
- An RL-Friendly Vision-Language Model for Minecraft☆29Updated 4 months ago
- [ICLR 2022 Spotlight] Multi-Stage Episodic Control for Strategic Exploration in Text Games☆14Updated 2 years ago
- [ICLR2024] (EvALign-ICL Benchmark) Beyond Task Performance: Evaluating and Reducing the Flaws of Large Multimodal Models with In-Context …☆21Updated 11 months ago
- The official repository for our paper "The Dual Form of Neural Networks Revisited: Connecting Test Time Predictions to Training Patterns …☆16Updated last year
- Official Code Repository for EnvGen: Generating and Adapting Environments via LLMs for Training Embodied Agents (COLM 2024)☆28Updated 7 months ago
- Minimal code for A Generalist Agent☆38Updated 2 years ago
- ☆28Updated 2 months ago
- ☆29Updated 3 years ago
- Guide Your Agent with Adaptive Multimodal Rewards (NeurIPS 2023 Accepted)☆33Updated last year
- ☆31Updated 10 months ago
- [ICML 2024] Language Models Represent Beliefs of Self and Others☆31Updated 4 months ago
- ☆26Updated last year
- Open source code for paper "On the Learning and Learnability of Quasimetrics".☆32Updated 2 years ago