abenechehab / dicl
[ICLR 2025] Official implementation of DICL (Disentangled In-Context Learning), featured in the paper "Zero-shot Model-based Reinforcement Learning using Large Language Models".
☆15Updated 2 weeks ago
Alternatives and similar repositories for dicl:
Users that are interested in dicl are comparing it to the libraries listed below
- Q-Probe: A Lightweight Approach to Reward Maximization for Language Models☆40Updated 8 months ago
- MPI Code Generation through Domain-Specific Language Models☆13Updated 3 months ago
- Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for Transformer Agents"☆31Updated last year
- Code for "Accelerating Training with Neuron Interaction and Nowcasting Networks" [to appear at ICLR 2025]☆18Updated last month
- Automatic Integration for Neural Spatio-Temporal Point Process models (AI-STPP) is a new paradigm for exact, efficient, non-parametric inf…☆24Updated 4 months ago
- ☆60Updated 3 weeks ago
- Alpha-Zero Connect Four NN trained via self play☆13Updated 5 months ago
- Code Implementation, Evaluations, Documentation, Links and Resources for Min P paper☆24Updated this week
- A Data Source for Reasoning Embodied Agents☆19Updated last year
- Generative cellular automaton-like learning environments for RL.☆19Updated last month
- Learn online intrinsic rewards from LLM feedback☆34Updated 2 months ago
- The repository contains code for Adaptive Data Optimization☆20Updated 2 months ago
- Implementation of Spectral State Space Models☆16Updated last year
- ☆26Updated 8 months ago
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆32Updated last year
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated last year
- Official repository for the paper "Approximating Two-Layer Feedforward Networks for Efficient Transformers"☆36Updated last year
- The official implementation of the paper "Read to Play (R2-Play): Decision Transformer with Multimodal Game Instruction".☆34Updated last year
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆54Updated 6 months ago
- ☆21Updated 5 months ago
- ☆28Updated last month
- [NAACL 2025] Representing Rule-based Chatbots with Transformers☆19Updated 3 weeks ago
- ☆14Updated 5 months ago
- Causal Agent based on Large Language Model☆39Updated 6 months ago
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Updated last year
- Lottery Ticket Adaptation☆37Updated 3 months ago