使用transformer构建的机器翻译系统
☆10Jun 16, 2023Updated 2 years ago
Alternatives and similar repositories for Transformer_MT
Users that are interested in Transformer_MT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 基于Transform的机器翻译系统☆21Jun 1, 2020Updated 5 years ago
- Temporally Correlated Episodic Reinforcement Learning, ICLR 24☆12Apr 8, 2024Updated 2 years ago
- Policy learning of in-hand manipulation. Proximal policy optimization trains the Allegro hand to learn a stabilizing grasp☆14Feb 5, 2024Updated 2 years ago
- ☆11May 29, 2025Updated 11 months ago
- Isaac Gym environments and training for DexHand☆23Aug 21, 2025Updated 9 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- 基于Transformer的机器翻译系统☆12Jun 28, 2022Updated 3 years ago
- Solving Complex Dexterous Manipulation Tasks with Trajectory Optimisation and Reinforcement Learning☆23May 16, 2021Updated 5 years ago
- A robotic arm that learns to pick and place objects using reinforcement learning.☆22Jul 20, 2025Updated 10 months ago
- transformer,机器翻译,中文--英文☆86Feb 8, 2023Updated 3 years ago
- ☆27Apr 26, 2024Updated 2 years ago
- Code for running RL experiments on continuing (non-episodic) problems.☆21Feb 13, 2026Updated 3 months ago
- ☆25Jan 13, 2022Updated 4 years ago
- Privacy-first Chrome extension with React and FastAPI that auto-fills job applications and generates tailored responses using local LLMs …☆10Sep 9, 2025Updated 8 months ago
- CoRL 2025 TA-VLA: Elucidating the Design Space of Torque-aware Vision-Language-Action Models☆100Oct 25, 2025Updated 7 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- The official implementation for Sequential Recommendation with Latent Relations based on Large Language Model☆45Nov 3, 2025Updated 6 months ago
- [NeurIPS 2023] H-InDex: Visual Reinforcement Learning with Hand-Informed Representations for Dexterous Manipulation☆44Nov 6, 2023Updated 2 years ago
- Codes accompanying the paper "Score Regularized Policy Optimization through Diffusion Behavior" (ICLR 2024).☆48Feb 10, 2024Updated 2 years ago
- Any-step Dynamics Model for Policy Optimization☆67Feb 13, 2025Updated last year
- Challenging dexterous manipulation environments for RL that extend the hand manipulation environments introduced in OpenAI's Gym☆53Jan 3, 2022Updated 4 years ago
- Convert MuJoCo mjcf to URDF format.☆86Mar 9, 2020Updated 6 years ago
- Official code release for "CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity"☆95Jun 4, 2024Updated last year
- DexPoint: Generalizable Point Cloud Reinforcement Learning for Sim-to-Real Dexterous Manipulation, CoRL 2022☆104May 22, 2024Updated 2 years ago
- [ICML 2024, Spotlight] EfficientZero V2: Mastering Discrete and Continuous Control with Limited Data☆111Aug 9, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Public repository of the RL environment for robotic hands and a simulated model of the Faive Hand (and also somewhat easily extendable to…☆109Feb 28, 2025Updated last year
- PyTorch Implementation of Implicit Quantile Networks (IQN) for Distributional Reinforcement Learning with additional extensions like PER,…☆94Mar 4, 2023Updated 3 years ago
- A Minimal Example of Isaac Gym with DQN and PPO.☆111May 11, 2023Updated 3 years ago
- [NeurIPS 2023] Efficient Diffusion Policy☆114Oct 31, 2023Updated 2 years ago
- 中国科学院大学计算机科学与技术学院深度学习(徐俊刚老师)☆89Jul 28, 2022Updated 3 years ago
- API for controlling LEAP Hand v1☆162Oct 20, 2025Updated 7 months ago
- 基于C++线程池的轻量级Web并发服务器☆158Jun 11, 2021Updated 4 years ago
- Reinforcement Learning Agents Trained in the CARLA Simulator☆136Mar 23, 2020Updated 6 years ago
- ☆163Jul 12, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Deep Transformer Q-Networks for Partially Observable Reinforcement Learning☆175Jul 7, 2024Updated last year
- Python library for solving reinforcement learning (RL) problems using generative models (e.g. Diffusion Models).☆209Feb 18, 2025Updated last year
- Simple single-file baselines for Q-Learning in pure-GPU setting☆239Nov 24, 2025Updated 6 months ago
- TechGPT: Technology-Oriented Generative Pretrained Transformer☆227Jun 29, 2023Updated 2 years ago
- PyTorch implementation of FQF, IQN and QR-DQN.☆191Jul 25, 2024Updated last year
- This is the official implementation of the paper "ConRFT: A Reinforced Fine-tuning Method for VLA Models via Consistency Policy".☆348Mar 30, 2026Updated last month
- Deep Reinforcement Learning codes for study. Currently, there are only codes for algorithms: DQN, C51, QR-DQN, IQN, QUOTA.☆217Mar 15, 2023Updated 3 years ago