Code for the paper "Learning Temporal Distances: Contrastive Successor Features Can Provide a Metric Structure for Decision-Making"
☆28Jul 11, 2024Updated last year
Alternatives and similar repositories for contrastive_metrics
Users that are interested in contrastive_metrics are comparing it to the libraries listed below
Sorting:
- Implementations of Temporal Difference InfoNCE (TD InfoNCE)☆33Nov 13, 2023Updated 2 years ago
- A library implementing the kernels for and experiments using extrinsic gauge equivariant vector field Gaussian Processes☆26Oct 28, 2021Updated 4 years ago
- Self Organizing Maps (SOM) ML model can be used to conduct semantic search to populate context required for Retrieval Augmented Generatio…☆15Mar 16, 2024Updated last year
- ☆15Sep 7, 2022Updated 3 years ago
- Code and data for the paper "Understanding Hidden Context in Preference Learning: Consequences for RLHF"☆33Dec 14, 2023Updated 2 years ago
- ☆18Jun 26, 2023Updated 2 years ago
- Code to accompany the paper "The Information Geometry of Unsupervised Reinforcement Learning"☆20Oct 6, 2021Updated 4 years ago
- Code for paper "Hierarchically Decoupled Imitation for Morphological Transfer"☆17Mar 24, 2023Updated 2 years ago
- ☆29Oct 30, 2023Updated 2 years ago
- Online Goal-Conditioned Reinforcement Learning in JAX. ICLR 2025 Spotlight.☆249Nov 23, 2025Updated 3 months ago
- HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)☆93Dec 1, 2024Updated last year
- [JAG'26] SpatialLLM: From Multi-modality Data to Urban Spatial Intelligence☆60Updated this week
- Public code for "Reinforcement Learning from Passive Data via Latent Intentions"☆89Nov 19, 2023Updated 2 years ago
- Reinforcing General Reasoning without Verifiers☆96Jun 24, 2025Updated 8 months ago
- Official implementation of the δ-model presented in the ICML 2024 paper "A Distributional Analogue to the Successor Representation".☆25Nov 8, 2024Updated last year
- ☆24Sep 27, 2022Updated 3 years ago
- Foundation Policies with Hilbert Representations (ICML 2024)☆105Sep 29, 2025Updated 5 months ago
- Open source code for paper "Optimal Goal-Reaching Reinforcement Learning via Quasimetric Learning" ICML 2023☆54May 19, 2025Updated 9 months ago
- Exploring the space of drug combinations to discover synergistic drugs using Active Learning☆24Aug 13, 2024Updated last year
- [ICLR 2024] Closing the Gap between TD Learning and Supervised Learning - A Generalisation Point of View.☆23Apr 19, 2024Updated last year
- code for "Semi-Discrete Normalizing Flows through Differentiable Tessellation"☆26Dec 10, 2022Updated 3 years ago
- Official repository for paper "Versatile Offline Imitation from Observations and Examples via Regularized State-Occupancy Matching" (ICML…☆28Jan 12, 2023Updated 3 years ago
- ☆28Nov 22, 2019Updated 6 years ago
- Few-shot Learning with Auxiliary Data☆31Dec 8, 2023Updated 2 years ago
- ☆78Nov 12, 2024Updated last year
- Code for RATIONALYST: Pre-training Process-Supervision for Improving Reasoning https://arxiv.org/pdf/2410.01044☆35Oct 3, 2024Updated last year
- Open source code for paper "On the Learning and Learnability of Quasimetrics".☆32Nov 28, 2022Updated 3 years ago
- Solving Simultaneous Target Assignment and Path Planning Efficiently with Time-Independent Execution (ICAPS-22; AIJ-23)☆31Aug 23, 2025Updated 6 months ago
- Code for building self-expanding knowledge graphs with Outlines, vLLM, neo4j, and Modal.☆38May 14, 2025Updated 9 months ago
- ☆27Jul 25, 2023Updated 2 years ago
- Controllability-Aware Unsupervised Skill Discovery (ICML 2023)☆33Jun 3, 2023Updated 2 years ago
- Learning from preferences is a common paradigm for fine-tuning language models. Yet, many algorithmic design decisions come into play. Ou…☆32Apr 20, 2024Updated last year
- Semi-Supervised Offline Reinforcement Learning with Action-Free Trajectories☆42Jul 16, 2023Updated 2 years ago
- METRA: Scalable Unsupervised RL with Metric-Aware Abstraction (ICLR 2024)☆87Oct 15, 2023Updated 2 years ago
- ☆36Sep 20, 2022Updated 3 years ago
- Martingale posterior neural networks for fast sequential decision making @ Neurips 2025☆23Nov 13, 2025Updated 3 months ago
- Code for LaMPP: Language Models as Probabilistic Priors for Perception and Action☆37Apr 3, 2023Updated 2 years ago
- EOSIO-Taurus - The Most Powerful Infrastructure for Decentralized Applications☆13Mar 29, 2024Updated last year
- ☆33Jul 30, 2024Updated last year