Deep Reasoning Translation (DRT) Project
☆240Sep 1, 2025Updated 6 months ago
Alternatives and similar repositories for DRT
Users that are interested in DRT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [EMNLP 2025 Findings] Retrieval-Augmented Machine Translation with Unstructured Knowledge☆14Sep 4, 2025Updated 6 months ago
- EMNLP 2022: ClidSum: A Benchmark Dataset for Cross-Lingual Dialogue Summarization☆36Jan 13, 2024Updated 2 years ago
- ☆21Sep 5, 2023Updated 2 years ago
- Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning☆193Mar 20, 2025Updated last year
- The code of ACL2022 paper "Conditional Bilingual Mutual Information based Adaptive Training for Neural Machine Translation"..☆14Aug 6, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Valley is a cutting-edge multimodal large model designed to handle a variety of tasks involving text, images, and video data.☆276Jan 20, 2026Updated 2 months ago
- A benchmark for the task of translation suggestion☆60Jun 23, 2022Updated 3 years ago
- [EMNLP'25] Code for paper "MT-R1-Zero: Advancing LLM-based Machine Translation via R1-Zero-like Reinforcement Learning"☆68Apr 15, 2025Updated 11 months ago
- This is the official code for our paper "Simple and Scalable Nearest Neighbor Machine Translation" (ICLR 2023).☆14Nov 22, 2023Updated 2 years ago
- An Open Large Reasoning Model for Real-World Solutions☆1,539Feb 13, 2026Updated last month
- XVERSE-MoE-A36B: A multilingual large language model developed by XVERSE Technology Inc.☆39Sep 12, 2024Updated last year
- code for Teaching LM to Translate with Comparison☆39Dec 15, 2023Updated 2 years ago
- Project for SNARE benchmark☆11Jun 5, 2024Updated last year
- AN O1 REPLICATION FOR CODING☆333Dec 11, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Scalable RL solution for advanced reasoning of language models☆1,831Mar 18, 2025Updated last year
- Repo for NAACL 2025 Paper "Unfolding the Headline: Iterative Self-Questioning for News Retrieval and Timeline Summarization"☆296Aug 4, 2025Updated 7 months ago
- ☆19Nov 7, 2024Updated last year
- This is the repository for the paper ‘A Survey of Inductive Reasoning for Large Language Models’☆46Oct 27, 2025Updated 5 months ago
- Implementation of DTMT with incremental decoding☆13Feb 20, 2021Updated 5 years ago
- ICANN‘2021: Multi-Modal Chorus Recognition for Improving Song Search☆28Aug 30, 2021Updated 4 years ago
- Large Reasoning Models☆806Dec 3, 2024Updated last year
- General Reasoner: Advancing LLM Reasoning Across All Domains [NeurIPS25]☆224Nov 27, 2025Updated 4 months ago
- [ICML 2025] SparseLoRA: Accelerating LLM Fine-Tuning with Contextual Sparsity☆71Mar 10, 2026Updated 2 weeks ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- O1 Replication Journey☆2,001Jan 14, 2025Updated last year
- ☆191Mar 13, 2026Updated 2 weeks ago
- ☆74May 22, 2025Updated 10 months ago
- 如需体验textin文档解析,请点击https://cc.co/16YSIy☆22Jul 9, 2024Updated last year
- A series of technical report on Slow Thinking with LLM☆763Aug 13, 2025Updated 7 months ago
- 🔍 Search-o1: Agentic Search-Enhanced Large Reasoning Models [EMNLP 2025]☆1,185Nov 17, 2025Updated 4 months ago
- notes for Multi-hop Reading Comprehension and open-domain question answering☆90Apr 21, 2022Updated 3 years ago
- The official implementation of paper: SimLayerKV: A Simple Framework for Layer-Level KV Cache Reduction.☆50Oct 18, 2024Updated last year
- A highly capable 2.4B lightweight LLM using only 1T pre-training data with all details.☆223Jul 25, 2025Updated 8 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Revisiting Mid-training in the Era of Reinforcement Learning Scaling☆186Jul 23, 2025Updated 8 months ago
- ☆254May 30, 2024Updated last year
- A Neural Framework for MT Evaluation☆730Mar 5, 2026Updated 3 weeks ago
- RM-R1: Unleashing the Reasoning Potential of Reward Models☆160Jun 26, 2025Updated 9 months ago
- [ICML 2025] LaCache: Ladder-Shaped KV Caching for Efficient Long-Context Modeling of Large Language Models☆18Nov 4, 2025Updated 4 months ago
- Technical Report: Is ChatGPT a Good NLG Evaluator? A Preliminary Study☆43Mar 8, 2023Updated 3 years ago
- [COLM 2025] Official code for "When To Solve, When To Verify: Compute-Optimal Problem Solving and Generative Verification for LLM Reasoni…☆15Oct 31, 2025Updated 4 months ago