evanthebouncy / larc_gpt4
larc solving with gpt4
☆17Updated last year
Related projects ⓘ
Alternatives and complementary repositories for larc_gpt4
- ☆18Updated last year
- ☆18Updated 5 months ago
- Code for LaMPP: Language Models as Probabilistic Priors for Perception and Action☆35Updated last year
- ☆19Updated 2 years ago
- Code for the paper "VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment"☆80Updated last week
- Advantage Leftover Lunch Reinforcement Learning (A-LoL RL): Improving Language Models with Advantage-based Offline Policy Gradients☆26Updated 2 months ago
- ☆57Updated 4 months ago
- ☆73Updated 4 months ago
- Language-annotated Abstraction and Reasoning Corpus☆78Updated last year
- Repository for the code of the "PPL-MCTS: Constrained Textual Generation Through Discriminator-Guided Decoding" paper, NAACL'22☆64Updated 2 years ago
- GPT implementation in Flax☆18Updated 2 years ago
- PIGLeT: Language Grounding Through Neuro-Symbolic Interaction in a 3D World [ACL 2021]☆54Updated 3 years ago
- ☆20Updated 7 months ago
- An implementation of MuZero in JAX.☆53Updated 2 years ago
- ☆38Updated 3 years ago
- ☆15Updated last month
- ☆32Updated 3 years ago
- This repository contains the source code of the EMNLP 2020 paper Interactive Fiction Game Playing as Multi-Paragraph Reading Comprehensio…☆20Updated 4 years ago
- Scaling scaling laws with board games.☆43Updated last year
- A reinforcement learning environment for the IGLU 2022 at NeurIPS☆32Updated last year
- [ICLR 2022 Spotlight] Multi-Stage Episodic Control for Strategic Exploration in Text Games☆13Updated 2 years ago
- ☆54Updated 3 years ago
- ☆77Updated 3 months ago
- Super fast implementations of common benchmark text world games☆43Updated 2 weeks ago
- PyTorch Package For Quasimetric Learning☆42Updated 3 weeks ago
- LLM Dynamic Planner - Combining LLM with PDDL Planners to solve an embodied task☆36Updated 2 months ago
- Repository for Skill Set Optimization☆12Updated 3 months ago
- flexible meta-learning in jax☆12Updated last year
- Experiment code for the ICLR 2020 paper "RTFM: Generalising to New Environment Dynamics via Reading".☆38Updated 3 years ago
- Code used to run experiments for the ICLR 2023 paper "Computational Language Acquisition with Theory of Mind".☆14Updated last year