krystalan / DRTView external linksLinks
Deep Reasoning Translation (DRT) Project
☆241Sep 1, 2025Updated 5 months ago
Alternatives and similar repositories for DRT
Users that are interested in DRT are comparing it to the libraries listed below
Sorting:
- [EMNLP 2025 Findings] Retrieval-Augmented Machine Translation with Unstructured Knowledge☆14Sep 4, 2025Updated 5 months ago
- Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning☆193Mar 20, 2025Updated 10 months ago
- [EMNLP'25] Code for paper "MT-R1-Zero: Advancing LLM-based Machine Translation via R1-Zero-like Reinforcement Learning"☆65Apr 15, 2025Updated 9 months ago
- XVERSE-MoE-A36B: A multilingual large language model developed by XVERSE Technology Inc.☆38Sep 12, 2024Updated last year
- Valley is a cutting-edge multimodal large model designed to handle a variety of tasks involving text, images, and video data.☆271Jan 20, 2026Updated 3 weeks ago
- An Open Large Reasoning Model for Real-World Solutions☆1,530Updated this week
- 如需体验textin文档解析,请点击https://cc.co/16YSIy☆22Jul 9, 2024Updated last year
- EMNLP 2022: ClidSum: A Benchmark Dataset for Cross-Lingual Dialogue Summarization☆36Jan 13, 2024Updated 2 years ago
- AN O1 REPLICATION FOR CODING☆334Dec 11, 2024Updated last year
- Repo for NAACL 2025 Paper "Unfolding the Headline: Iterative Self-Questioning for News Retrieval and Timeline Summarization"☆293Aug 4, 2025Updated 6 months ago
- Scalable RL solution for advanced reasoning of language models☆1,803Mar 18, 2025Updated 10 months ago
- This is the official code for our paper "Simple and Scalable Nearest Neighbor Machine Translation" (ICLR 2023).☆14Nov 22, 2023Updated 2 years ago
- ☆62Oct 29, 2024Updated last year
- O1 Replication Journey☆1,999Jan 14, 2025Updated last year
- [ICML 2025] LaCache: Ladder-Shaped KV Caching for Efficient Long-Context Modeling of Large Language Models☆17Nov 4, 2025Updated 3 months ago
- A highly capable 2.4B lightweight LLM using only 1T pre-training data with all details.☆223Jul 25, 2025Updated 6 months ago
- ☆74May 22, 2025Updated 8 months ago
- Large Reasoning Models☆806Dec 3, 2024Updated last year
- ☆21Sep 5, 2023Updated 2 years ago
- OpenRFT: Adapting Reasoning Foundation Model for Domain-specific Tasks with Reinforcement Fine-Tuning☆155Dec 24, 2024Updated last year
- [ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆125Jun 11, 2025Updated 8 months ago
- ☆189Feb 5, 2026Updated last week
- Work in progress.☆79Nov 25, 2025Updated 2 months ago
- 🔍 Search-o1: Agentic Search-Enhanced Large Reasoning Models [EMNLP 2025]☆1,170Nov 17, 2025Updated 2 months ago
- General Reasoner: Advancing LLM Reasoning Across All Domains [NeurIPS25]☆218Nov 27, 2025Updated 2 months ago
- Generative Modeling with Bayesian Sample Inference☆24May 17, 2025Updated 8 months ago
- ☆26Jul 8, 2025Updated 7 months ago
- Scaling Deep Research via Reinforcement Learning in Real-world Environments.☆699Oct 15, 2025Updated 3 months ago
- Project for SNARE benchmark☆11Jun 5, 2024Updated last year
- SIFT: Grounding LLM Reasoning in Contexts via Stickers☆57Mar 6, 2025Updated 11 months ago
- 反向代理X AI的API☆12Dec 10, 2024Updated last year
- ☆104Dec 6, 2024Updated last year
- The official implementation of paper: SimLayerKV: A Simple Framework for Layer-Level KV Cache Reduction.☆52Oct 18, 2024Updated last year
- ✨✨ [ICLR 2026] R1-Reward: Training Multimodal Reward Model Through Stable Reinforcement Learning☆281May 9, 2025Updated 9 months ago
- ☆59Mar 3, 2025Updated 11 months ago
- [ICML 2025] SparseLoRA: Accelerating LLM Fine-Tuning with Contextual Sparsity☆71Jul 5, 2025Updated 7 months ago
- ZeroSearch: Incentivize the Search Capability of LLMs without Searching☆1,237Aug 16, 2025Updated 5 months ago
- ScrollNet for Continual Learning☆11Sep 11, 2023Updated 2 years ago
- RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment☆16Dec 19, 2024Updated last year