Online Preference Alignment for Language Models via Count-based Exploration
☆18Jan 14, 2025Updated last year
Alternatives and similar repositories for COPO
Users that are interested in COPO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official Implementation of "Steering Vision-Language-Action Models as Anti-Exploration: A Test-Time Scaling Approach"☆36Apr 6, 2026Updated last week
- [NeurIPS' 24] The PyTorch implementation of our paper: "Kaleidoscope: Learnable Masks for Heterogeneous Multi-agent Reinforcement Learnin…☆21Oct 10, 2024Updated last year
- A benchmark for evaluating reinforcement learning algorithms that train the policies using imaginary rollouts from LLMs.☆14Nov 4, 2025Updated 5 months ago
- [NeurIPS 2025] Official Implementation of "HumanoidGen: Data Generation for Bimanual Dexterous Manipulation via LLM Reasoning"☆87Nov 6, 2025Updated 5 months ago
- Official PyTorch Implementation of Paper -- "MoRE: Mixture of Residual Experts for Humanoid Lifelike Gaits Learning on Complex Terrains"☆241Nov 11, 2025Updated 5 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Official implementation of paper: LiNo: Advancing Recursive Residual Decomposition of Linear and Nonlinear Patterns for Robust Time Serie…☆18Dec 19, 2025Updated 3 months ago
- Robust and safe deep reinforcement learning algorithms☆17Mar 27, 2024Updated 2 years ago
- ☆16Jun 12, 2024Updated last year
- [TGRS 2024] CutMix-CD: Advancing Semi-Supervised Change Detection via Mixed Sample Consistency☆21Nov 30, 2025Updated 4 months ago
- ☆39May 19, 2025Updated 10 months ago
- ☆13Jun 4, 2025Updated 10 months ago
- [arXiv: 2502.05178] QLIP: Text-Aligned Visual Tokenization Unifies Auto-Regressive Multimodal Understanding and Generation☆96Mar 1, 2025Updated last year
- This is the source code of FUSION, a safety-aware causal representation for generalizable driving agents.☆26Oct 23, 2024Updated last year
- ☆12Nov 10, 2020Updated 5 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- The official implementation of "Transformer in Transformer as Backbone for Deep Reinforcement Learning"☆58Dec 27, 2023Updated 2 years ago
- ICML 2024 - Self-Driven Entropy Aggregation for Byzantine-Robust Heterogeneous Federated Learning☆10Jul 16, 2024Updated last year
- NeurIPS 2024: Bidirectional Recurrence for Cardiac Motion Tracking with Gaussian Process Latent Coding☆16Jun 20, 2025Updated 9 months ago
- [CVPR 2025] VASparse: Towards Efficient Visual Hallucination Mitigation via Visual-Aware Token Sparsification☆49Mar 24, 2025Updated last year
- ☆16Nov 1, 2023Updated 2 years ago
- ☆23May 28, 2025Updated 10 months ago
- Code for paper Feasible Actor-Critic: Constrained Reinforcement Learning for Ensuring Statewise Safety.☆20May 22, 2022Updated 3 years ago
- [CVPR 2025] Understanding Fine-tuning CLIP for Open-vocabulary Semantic Segmentation in Hyperbolic Space☆38Jul 18, 2025Updated 8 months ago
- This is the official git for team Kanaloa☆11Dec 9, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Official Implementation of "Align-Then-stEer: Adapting the Vision-Language Action Models through Unified Latent Guidance".☆65Oct 16, 2025Updated 6 months ago
- A PyTorch implementation of [VCT](https://github.com/google-research/google-research/tree/master/vct)☆10Nov 25, 2022Updated 3 years ago
- Source code of the paper titled "Digital Semantic Communications: An Alternating Multi-Phase Training Strategy with Mask Attack"☆14Oct 5, 2025Updated 6 months ago
- The codes are for the paper: ``Complete Dictionary Learning via \ell_p-norm Maximization'',Yifei Shen∗ , Ye Xue∗ , Jun Zhang , Khaled B. …☆11Nov 21, 2020Updated 5 years ago
- ☆33Jul 15, 2025Updated 9 months ago
- Repository for Skill Set Optimization☆14Jul 26, 2024Updated last year
- Code for paper "Efficient Sparse Coding using Hierarchical Riemannian Pursuit," in IEEE Transactions on Signal Processing, Y. Xue, V. K. …☆13Jul 20, 2021Updated 4 years ago
- J-BHI 2024: Exploiting Hierarchical Interactions for Protein Surface Learning☆17Jan 21, 2024Updated 2 years ago
- Benchmarking Deepseek R1 API response speeds across different providers for performance comparison.☆10Feb 15, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆106Jul 18, 2025Updated 8 months ago
- Decoupled Q-Chunking☆60Jan 10, 2026Updated 3 months ago
- CycleQD is a framework for parameter space model merging.☆49Feb 1, 2025Updated last year
- TextOp: Real-time Interactive Text-Driven Humanoid Robot Motion Generation and Control☆348Feb 7, 2026Updated 2 months ago
- NuART-Py: Python Library of Adaptive Resonance Theory Neural Network☆10Jan 26, 2020Updated 6 years ago
- Official Code for "Adversarial Locomotion and Motion Imitation for Humanoid Policy Learning"☆141May 16, 2025Updated 11 months ago
- Part of a research scholarship. I built a basic 2d driving sim with simulated lidar data to train Deep Q Neural Network. So far after abo…☆11Feb 15, 2017Updated 9 years ago