krystalan / DRTLinks
Deep Reasoning Translation (DRT) Project
☆239Updated 3 months ago
Alternatives and similar repositories for DRT
Users that are interested in DRT are comparing it to the libraries listed below
Sorting:
- ☆300Updated 6 months ago
- Repo for "MaskSearch: A Universal Pre-Training Framework to Enhance Agentic Search Capability"☆146Updated 6 months ago
- open-o1: Using GPT-4o with CoT to Create o1-like Reasoning Chains☆116Updated 11 months ago
- A lightweight script for processing HTML page to markdown format with support for code blocks☆81Updated last year
- ☆151Updated 3 months ago
- SkyScript-100M: 1,000,000,000 Pairs of Scripts and Shooting Scripts for Short Drama: https://arxiv.org/abs/2408.09333v2☆129Updated last year
- GLM Series Edge Models☆153Updated 5 months ago
- ☆187Updated 2 months ago
- ☆95Updated last year
- Reformatted Alignment☆113Updated last year
- ☆50Updated last year
- Chrome / Edge extension to turn arXiv papers into Markdown codes in one click.☆86Updated 8 months ago
- FlexRAG: A RAG Framework for Information Retrieval and Generation.☆229Updated 2 weeks ago
- The code for paper: Decoupled Planning and Execution: A Hierarchical Reasoning Framework for Deep Search☆63Updated 5 months ago
- ☆95Updated last year
- [EMNLP-2024] ⚓️ Sailor: Open Language Models for South-East Asia☆138Updated 11 months ago
- 我们是第一个完全可商用的角色大模型。☆40Updated last year
- Mixture-of-Experts (MoE) Language Model☆192Updated last year
- ☆91Updated 6 months ago
- Training Turn-by-Turn Verifiers for Dialogue Tutoring Agents: The Curious Case of LLMs as Your Coding Tutors (ACL Findings 2025)☆86Updated 6 months ago
- Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models☆137Updated last year
- ☆185Updated 3 weeks ago
- [ICML 2025] |TokenSwift: Lossless Acceleration of Ultra Long Sequence Generation☆118Updated 6 months ago
- ☆173Updated 7 months ago
- Fused Qwen3 MoE layer for faster training, compatible with HF Transformers, LoRA, 4-bit quant, Unsloth☆212Updated last month
- [ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale☆263Updated 4 months ago
- ☆83Updated last year
- [COLM 2025] An Open Math Pre-trainng Dataset with 370B Tokens.☆108Updated 8 months ago
- ☆40Updated last year
- Repo for NAACL 2025 Paper "Unfolding the Headline: Iterative Self-Questioning for News Retrieval and Timeline Summarization"☆291Updated 4 months ago