repo for TMLR'26 paper "Reconciling In-Context and In-Weight Learning via Dual Representation Space Encoding"
☆25Feb 22, 2026Updated last week
Alternatives and similar repositories for dual-representation-space-encoding
Users that are interested in dual-representation-space-encoding are comparing it to the libraries listed below
Sorting:
- Soft-QMIX: Integrating Maximum Entropy For Monotonic Value Function Factorization☆15Jul 3, 2024Updated last year
- Code accompanying the ICML'24 paper "Feature Contamination: Neural Networks Learn Uncorrelated Features and Fail to Generalize"☆23Feb 13, 2025Updated last year
- CRIL: Continual Robot Imitation Learning via Generative Dynamics Model☆22Mar 13, 2021Updated 4 years ago
- Neuroevolution is a Competitive Alternative to Reinforcement Learning for Skill Discovery☆17Jul 11, 2023Updated 2 years ago
- PyTorch code accompanying the paper "Generating Adjacency-Constrained Subgoals in Hierarchical Reinforcement Learning" (NeurIPS 2020 spot…☆44Nov 6, 2023Updated 2 years ago
- This repo hosts the code for the Fast Trainable Projection (FTP) project.☆12Nov 16, 2023Updated 2 years ago
- ☆15Feb 22, 2024Updated 2 years ago
- ☆43Dec 19, 2025Updated 2 months ago
- The AI Arena: A framework for distributed multi-agent reinforcement learning☆14Aug 5, 2022Updated 3 years ago
- ☆14Jul 12, 2021Updated 4 years ago
- 清华大学生物,医学,药学等相关专业的毕业论文latex模板。也适用于其他专业。适合本硕博毕业论文和博后报告。本模板在tuna协会的thuthesis项目基础上,增补了和生医药相关同学的内容,也增添了对latex新手更加友好的注释。☆24Sep 14, 2023Updated 2 years ago
- Fine-tuned MARL algorithms on SMAC (100% win rates on most scenarios)☆19Aug 20, 2023Updated 2 years ago
- ☆18Oct 6, 2022Updated 3 years ago
- [ICCV 2023]The PyTorch implementation of TL-Align: Token-Label Alignment for Vision Transformers.☆23Jul 16, 2023Updated 2 years ago
- Invariant-feature Subspace Recovery (ISR)☆23Sep 23, 2022Updated 3 years ago
- ☆25Feb 20, 2026Updated last week
- This is the unofficial PyTorch implementation of Domain Generalization with Adversarial Feature Learning.☆19Jun 27, 2022Updated 3 years ago
- Implementation of Data Efficient Reinforcement Learning in Pytorch☆20Aug 6, 2019Updated 6 years ago
- ☆20Nov 14, 2022Updated 3 years ago
- General purpose environment wrappers for openai gym☆25Jun 5, 2019Updated 6 years ago
- Learning-based agent for Google Research Football (足球游戏智能体)☆123Apr 20, 2023Updated 2 years ago
- ☆140Dec 4, 2025Updated 2 months ago
- Explore what LLMs are really leanring over SFT☆28Mar 30, 2024Updated last year
- Verlog: A Multi-turn RL framework for LLM agents☆68Updated this week
- ☆34Nov 21, 2023Updated 2 years ago
- ☆66Jul 13, 2025Updated 7 months ago
- generate couplet(对联生成) Tensorflow☆35May 6, 2019Updated 6 years ago
- Build PyTorch CIFAR100 using coarse labels☆39Jun 20, 2020Updated 5 years ago
- ImageNetV2 Pytorch Dataset☆42Apr 17, 2023Updated 2 years ago
- [ICLR 2025] Semi-Supervised Vision-Centric 3D Occupancy World Model for Autonomous Driving☆53Feb 14, 2025Updated last year
- [ICLR 2025] OccProphet: Pushing Efficiency Frontier of Camera-Only 4D Occupancy Forecasting with Observer-Forecaster-Refiner Framework☆57Nov 20, 2025Updated 3 months ago
- Domain Generalization via Gradient Surgery☆51May 3, 2022Updated 3 years ago
- ☆45May 5, 2023Updated 2 years ago
- Algorithm that combines QMIX with SAC for Multi-Agent Reinforcement Learning.☆57May 20, 2022Updated 3 years ago
- ☆52Dec 13, 2024Updated last year
- 用来存放重要的公开文件☆49Aug 16, 2024Updated last year
- PyTorch Reimplementation of LoRA (featuring with supporting nn.MultiheadAttention in OpenCLIP)☆77Jun 10, 2025Updated 8 months ago
- A collection of research and survey papers of hierarchical reinforcement learning (HRL).☆54Apr 27, 2020Updated 5 years ago
- ☆68Mar 23, 2022Updated 3 years ago