[ICML2025] Official Repo for Paper "Optimizing Temperature for Language Models with Multi-Sample Inference"
☆23Feb 16, 2025Updated last year
Alternatives and similar repositories for TURN
Users that are interested in TURN are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICLR2026] The official repository for the CodeGym project: "Generalizable End-to-End Tool-Use RL with Synthetic CodeGym"☆31Oct 14, 2025Updated 7 months ago
- [EMNLP 2025] The official implementation for paper "Agentic-R1: Distilled Dual-Strategy Reasoning"☆104Apr 21, 2026Updated last month
- Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision☆124Sep 9, 2024Updated last year
- Neural theorem proving evaluation via the Lean REPL☆23Jul 12, 2025Updated 10 months ago
- AgentIR is a retriever specialized for Deep Research agents.☆55Apr 16, 2026Updated last month
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆17Aug 1, 2025Updated 9 months ago
- Evaluating GPT-OSS on BrowseComp-Plus with Native Browsering Tools☆20Oct 17, 2025Updated 7 months ago
- Chain of Thoughts (CoT) is so hot! so long! We need short reasoning process!☆72Apr 1, 2025Updated last year
- [NAACL'25 🏆 SAC Award] Official code for "Advancing MoE Efficiency: A Collaboration-Constrained Routing (C2R) Strategy for Better Expert…☆16Feb 4, 2025Updated last year
- [CVPR 2025] VISCO: Benchmarking Fine-Grained Critique and Correction Towards Self-Improvement in Visual Reasoning☆13Jun 7, 2025Updated 11 months ago
- Agentic RL on Any Harness at Scale☆136May 15, 2026Updated last week
- Codes for "Learning bounds for risk-sensitive learning," NeurIPS 2020 (or see arXiv 2006.08138)☆11Oct 15, 2020Updated 5 years ago
- Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for Transformer Agents"☆31Oct 12, 2023Updated 2 years ago
- Assessing Context-Aware Creative Intelligence in MLLMs☆23Jul 22, 2025Updated 10 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A curated list of cutting-edge research papers and resources on Long Chain-of-Thought (CoT) Reasoning with Tools.☆47Dec 17, 2025Updated 5 months ago
- Code for Sanity-Checking Pruning Methods: Random Tickets can Win the Jackpot☆43Nov 8, 2020Updated 5 years ago
- Official repository for Activation-Informed Merging (AIM) of Large Language Models☆24Feb 10, 2025Updated last year
- 2022 秋季学期清华大学电子系数据与算法课程 OJ 参考解答☆10Jun 18, 2023Updated 2 years ago
- Official implementation of Latent-SFT: teaching LLMs to reason with vocabulary-space latent chains.☆48Updated this week
- The official repository for the paper Multilingual Mathematical Autoformalization☆38May 20, 2024Updated 2 years ago
- Train a tiny LLaMA model from scratch to repeat your words using Reinforcement Learning from Human Feedback (RLHF)☆18May 23, 2024Updated 2 years ago
- [ACL 2025] "CoT-UQ: Improving Response-wise Uncertainty Quantification in LLMs with Chain-of-Thought"☆17Apr 3, 2025Updated last year
- [IJCAI 2023] The official repo of paper 'Automatic Truss Design with Reinforcement Learning'☆19Jun 19, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [ACL 2024 Findings] MathBench: A Comprehensive Multi-Level Difficulty Mathematics Evaluation Dataset☆115May 22, 2025Updated last year
- Collections of RLxLM experiments using minimal codes☆14Feb 17, 2025Updated last year
- Official implementation for paper "How Far Are We from Genuinely Useful Deep Research Agents?"☆65Dec 10, 2025Updated 5 months ago
- Awesome Triton Resources☆41Apr 27, 2025Updated last year
- PyTorch implementation of Data2Vec self-supervised approach for vision use cases.☆18Oct 7, 2022Updated 3 years ago
- 2022龙芯杯个人赛三等奖作品☆14Oct 11, 2023Updated 2 years ago
- This repository provides simulator codes for predicting and tracking popular discussion threads on Reddit☆21Sep 10, 2016Updated 9 years ago
- ☆15Sep 10, 2019Updated 6 years ago
- ☆19Apr 5, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆18Mar 15, 2021Updated 5 years ago
- ☆33Jun 12, 2025Updated 11 months ago
- Code of EMNLP 2025 paper 'UltraIF: Advancing Instruction Following from the Wild'.☆21Apr 3, 2025Updated last year
- Official Implementation of "DeCoRe: Decoding by Contrasting Retrieval Heads to Mitigate Hallucination"☆29Dec 18, 2024Updated last year
- Here we provide and collect many functions to generate math problem and step by step solutions for LLM training☆18Jun 21, 2023Updated 2 years ago
- ☆232Apr 4, 2025Updated last year
- ☆14Oct 24, 2024Updated last year