Model Predictive Task Sampling
☆87Feb 28, 2026Updated last month
Alternatives and similar repositories for MPTS
Users that are interested in MPTS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [KDD 2026] Can Prompt Difficulty be Online Predicted for Accelerating RL Finetuning of Reasoning Models?☆76Mar 4, 2026Updated last month
- [ICML 2025] Fast and Robust: Task Sampling with Posterior and Diversity Synergies for Adaptive Decision-Makers in Randomized Environments☆63Aug 19, 2025Updated 7 months ago
- ☆25Oct 27, 2023Updated 2 years ago
- Code for GO4Align: Group Optimization for Multi-Task Alignment☆20Sep 25, 2024Updated last year
- code for ACL24 "MELoRA: Mini-Ensemble Low-Rank Adapter for Parameter-Efficient Fine-Tuning"☆35Feb 19, 2025Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- [NeurIPS 2024] Doubly Mild Generalization for Offline Reinforcement Learning☆16Oct 29, 2025Updated 5 months ago
- [NeurIPS 2024] Offline Reinforcement Learning with OOD State Correction and OOD Action Suppression☆14Oct 29, 2025Updated 5 months ago
- A variant of Varibad that is robust to difficult tasks☆11Aug 30, 2023Updated 2 years ago
- ☆20Oct 22, 2024Updated last year
- Code of ICLR 2025 paper "DynaPrompt: Dynamic Test-Time Prompt Tuning"☆22Jan 29, 2025Updated last year
- ☆17Oct 31, 2023Updated 2 years ago
- MegEngine implementation of Diffusion Models.☆19Aug 8, 2022Updated 3 years ago
- The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.☆12May 2, 2024Updated last year
- Accompanying repository for Unsupervised Active Domain Randomization in Goal-Directed RL☆12Aug 4, 2020Updated 5 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆23Feb 8, 2022Updated 4 years ago
- uncertainty-guided matting on ICML2023☆12Aug 3, 2023Updated 2 years ago
- ☆15Jul 8, 2024Updated last year
- ☆16Feb 17, 2025Updated last year
- Implementation of the paper "Decentralized Counterfactual Value with Threat Detection for Multi-Agent Reinforcement Learning in Mixed Coo…☆17Dec 7, 2024Updated last year
- ☆13Mar 16, 2019Updated 7 years ago
- Official implementation for "Pure Noise to the Rescue of Insufficient Data: Improving Imbalanced Classification by Training on Random Noi…☆15Jun 11, 2022Updated 3 years ago
- Ray-Transfer Function scripts and paper figures☆11Dec 7, 2022Updated 3 years ago
- [ICLR 2023] Soft Neighbors are Positive Supporters in Contrastive Visual Representation Learning☆15Aug 2, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [S&P 2024] Replication Package for "Mind Your Data! Hiding Backdoors in Offline Reinforcement Learning Datasets".☆33Dec 30, 2024Updated last year
- The latex template for seuthesis-2020☆12Jan 22, 2021Updated 5 years ago
- intrinsic motivation in grid worlds☆26May 3, 2020Updated 5 years ago
- Open source version of the original ISET, a complement to ISETBIO☆161Apr 1, 2026Updated 2 weeks ago
- Official PyTorch implementation of "ACE:Off-Policy Actor-Critic with Causality-Aware Entropy Regularization"☆35May 13, 2024Updated last year
- PyTorch implementation of the End-to-End Memory Network with attention layer vizualisation support.☆12Jun 30, 2018Updated 7 years ago
- Temperature Schedules for self-supervised contrastive methods on long-tail data (ICLR'23)☆18Apr 25, 2023Updated 2 years ago
- The numpy version(python3.7) of MPI Skinned Models.☆17Feb 14, 2020Updated 6 years ago
- siamise networks☆14Apr 25, 2017Updated 8 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Code for the ECCV22 paper Demystifying Unsupervised Semantic Correspondence Estimation☆14Oct 18, 2022Updated 3 years ago
- ☆16Jan 12, 2023Updated 3 years ago
- Underwater Image Systems Simulation☆19Apr 1, 2021Updated 5 years ago
- (NeurIPS 2024) What Makes CLIP More Robust to Long-Tailed Pre-Training Data? A Controlled Study for Transferable Insights☆27Oct 28, 2024Updated last year
- Repository for SIGIR'18 paper: "Ranking for Relevance and Display Preferences in Complex Presentation Layouts"☆16Aug 28, 2018Updated 7 years ago
- CVPR2023: AttriCLIP: A Non-Incremental Learner for Incremental Knowledge Learning☆18May 19, 2023Updated 2 years ago
- Tensorflow implementation of "Meta Dropout: Learning to Perturb Latent Features for Generalization" (ICLR 2020)☆27Apr 27, 2020Updated 5 years ago