Awesome Long-CoT Data
☆18Mar 26, 2025Updated 11 months ago
Alternatives and similar repositories for Awesome-Long-CoT-Data
Users that are interested in Awesome-Long-CoT-Data are comparing it to the libraries listed below
Sorting:
- MathFusion: Enhancing Mathematical Problem-solving of LLM through Instruction Fusion (ACL 2025)☆35Jul 16, 2025Updated 7 months ago
- The Code and Script of "David's Slingshot: A Strategic Coordination Framework of Small LLMs Matches Large LLMs in Data Synthesis"☆34Jun 13, 2025Updated 8 months ago
- MetaLadder: Ascending Mathematical Solution Quality via Analogical-Problem Reasoning Transfer (EMNLP 2025)☆11Apr 18, 2025Updated 10 months ago
- Tools for OpenDataArena: Fair, Open, and Transparent Arena for Data☆132Jan 31, 2026Updated last month
- ☆17Jun 3, 2024Updated last year
- This repository collects various works that reproduce DeepSeek R1, as well as works related to DeepSeek R1 and the DeepSeek series.☆19Apr 27, 2025Updated 10 months ago
- Unified 2D and 3D Pre-Training of Molecular Representations☆30Jun 30, 2022Updated 3 years ago
- [RECOMB 2023] Official implementation of "Pisces: A combo-wise contrastive learning approach to synergistic drug combination prediction".☆14Nov 21, 2023Updated 2 years ago
- [BIB 2023] Official implementation of "R2-DDI: Relation-aware Feature Refinement for Drug-drug Interaction Prediction".☆13Mar 18, 2024Updated last year
- HiCRISP Full Code, containing VirtualHome, pybullet simulator and Real AGV platform.☆15Apr 8, 2024Updated last year
- [ICLR24] Better Neural PDE Solvers Through Data-Free Mesh Movers☆17Mar 20, 2024Updated last year
- ☆20Feb 26, 2021Updated 5 years ago
- The official implementation of dual-view molecule pre-training.☆43Nov 22, 2021Updated 4 years ago
- [ACL 2025 Findings] Official implementation of the paper "Unveiling the Key Factors for Distilling Chain-of-Thought Reasoning".☆20Feb 26, 2025Updated last year
- Tailoring Molecules for Protein Pockets: a Transformer-based Generative Solution for Structured-based Drug Design☆20Jul 26, 2023Updated 2 years ago
- [ACL 2024] Masked Thought: Simply Masking Partial Reasoning Steps Can Improve Mathematical Reasoning Learning of Language Models☆27Jul 9, 2024Updated last year
- ☆34Nov 18, 2025Updated 3 months ago
- PyTorch code for KDD 2023 paper "Pre-training Antibody Language Models for Antigen-Specific Computational Antibody Design"☆55Nov 14, 2023Updated 2 years ago
- SSM-DTA: Breaking the Barriers of Data Scarcity in Drug-Target Affinity Prediction (Briefings in Bioinformatics 2023)☆55May 28, 2024Updated last year
- ☆176Apr 15, 2025Updated 10 months ago
- BioT5 (EMNLP 2023) and BioT5+ (ACL 2024 Findings)☆123Sep 14, 2024Updated last year
- A Large-Scale, High-Quality Math Dataset for Reinforcement Learning in Language Models☆72Feb 25, 2025Updated last year
- [EMNLP 2025] CompassVerifier: A Unified and Robust Verifier for LLMs Evaluation and Outcome Reward☆63Aug 10, 2025Updated 6 months ago
- M-CURL: Masked Contrastive Representation Learning for Reinforcement Learning☆28Nov 5, 2020Updated 5 years ago
- Call for participation in the impact of LLM for scientific discovery☆77Apr 11, 2024Updated last year
- [NeurIPS 2025] VeriThinker: Learning to Verify Makes Reasoning Model Efficient☆65Sep 27, 2025Updated 5 months ago
- ☆74Oct 21, 2023Updated 2 years ago
- (ACL 2025 Main) Distilling RAG for SLMs from LLMs to Transfer Knowledge and Mitigate Hallucination via Evidence and Graph-based Distillat…☆34Aug 23, 2025Updated 6 months ago
- SECOM: On Memory Construction and Retrieval for Personalized Conversational Agents, ICLR 2025☆55Mar 1, 2025Updated last year
- ☆30Updated this week
- [AAAI 2025] Neural-Symbolic Collaborative Distillation: Advancing Small Language Models for Complex Reasoning Tasks☆11Jun 19, 2025Updated 8 months ago
- ☆32Updated this week
- ☆10Dec 20, 2023Updated 2 years ago
- Code for paper: "Executing Arithmetic: Fine-Tuning Large Language Models as Turing Machines"☆11Oct 11, 2024Updated last year
- ☆12Mar 24, 2024Updated last year
- Code of "Instruction Multi-Constraint Molecular Generation Using a Teacher-Student Large Language Model"☆14Jul 8, 2025Updated 8 months ago
- [WWW 25] USPTO-LLM: A Large Language Model-Assisted Information-enriched Chemical Reaction Dataset☆16Dec 12, 2024Updated last year
- Official implementation of Panacea: A foundation model for clinical trial design, recruitment, search, and summarization.☆18Dec 24, 2024Updated last year
- Encoder-decoders for translating different chemical formats.☆18Sep 17, 2025Updated 5 months ago