[NeurIPS 2023] Large Language Models Are Semi-Parametric Reinforcement Learning Agents
☆40May 2, 2024Updated 2 years ago
Alternatives and similar repositories for Rememberer
Users that are interested in Rememberer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Universal Platform for Training and Evaluation of Mobile Interaction☆62Sep 24, 2025Updated 8 months ago
- ☆15Mar 26, 2024Updated 2 years ago
- ☆12Jul 4, 2024Updated last year
- ☆12Jun 12, 2024Updated last year
- Text-to-Drive: Diverse Driving Behaviors Synthesis via Large Language Models☆11Mar 17, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆12Dec 6, 2024Updated last year
- [ECCV 2024] The first zero-shot setting for spatio-temporal video grounding.☆11Jul 16, 2024Updated last year
- ☆14Dec 25, 2024Updated last year
- Langchain Agent finetuning using 7B - LLAMA 2 , on hotpotQA (Retroformer framework)☆16Sep 5, 2023Updated 2 years ago
- ☆55Jul 21, 2022Updated 3 years ago
- [ACL 2025] Official code for ''Learning to Reason from Feedback at Test-Time''.☆13May 16, 2025Updated last year
- SCoRe: Training Language Models to Self-Correct via Reinforcement Learning☆16May 14, 2026Updated last week
- ☆13Aug 26, 2024Updated last year
- [ICML'24] TroVE: Inducing Verifiable and Efficient Toolboxes for Solving Programmatic Tasks☆33Sep 20, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆21Oct 12, 2024Updated last year
- ☆69Dec 15, 2024Updated last year
- [ICML 2025] EffiCoder: Enhancing Code Generation in Large Language Models through Efficiency-Aware Fine-tuning☆16May 24, 2025Updated last year
- ☆16Jun 25, 2025Updated 11 months ago
- ☆213Dec 20, 2024Updated last year
- [ICCV 2025] AdsQA: Towards Advertisement Video Understanding Arxiv: https://arxiv.org/abs/2509.08621☆34Oct 30, 2025Updated 6 months ago
- Official PyTorch implementation of "A Rotated Hyperbolic Wrapped Normal Distribution for Hierarchical Representation Learning"☆28Oct 12, 2022Updated 3 years ago
- Convert CVXPY expressions to PyTorch expressions☆18Jul 8, 2025Updated 10 months ago
- ☆16Oct 3, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆17Oct 25, 2023Updated 2 years ago
- It is about how to load and aggregate pretrained word embeddings in pytorch, e.g., ELMo\BERT\XLNET.☆12Mar 2, 2020Updated 6 years ago
- ☆20Aug 15, 2023Updated 2 years ago
- Implementation of TWOSOME☆82Jan 11, 2025Updated last year
- CoRL2024 | Hint-AD: Holistically Aligned Interpretability for End-to-End Autonomous Driving☆73Oct 30, 2024Updated last year
- Data and codes for EMNLP 2022 paper "CDConv: A Benchmark for Contradiction Detection in Chinese Conversations"☆13May 8, 2023Updated 3 years ago
- ☆22May 3, 2025Updated last year
- A variant of Varibad that is robust to difficult tasks☆11Aug 30, 2023Updated 2 years ago
- [ACL 2025] "World Modeling Makes a Better Planner: Dual Preference Optimization for Embodied Task Planning." https://arxiv.org/abs/2503.1…☆18Jul 22, 2025Updated 10 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Implementation of Mean Field Multi-Agent Reinforcement Learning in Pytorch☆22Apr 27, 2024Updated 2 years ago
- The repo for using the model https://huggingface.co/thu-coai/Attacker-v0.1☆13Apr 23, 2025Updated last year
- Intrinsic Motivation and Automatic Curricula via Asymmetric Self-Play☆14May 1, 2018Updated 8 years ago
- PyTorch implementation of DreamerV3, Mastering Diverse Domains through World Models.☆10Feb 16, 2024Updated 2 years ago
- Deep Reinforcement Learning in CARLA simulator☆16Mar 10, 2024Updated 2 years ago
- OSWorld-Human: Benchmarking the Efficiency of Computer-Use Agents☆25May 17, 2026Updated last week
- ☆11Mar 3, 2026Updated 2 months ago