[ICML2025] Official Repo for Paper "Optimizing Temperature for Language Models with Multi-Sample Inference"
☆21Feb 16, 2025Updated last year
Alternatives and similar repositories for TURN
Users that are interested in TURN are comparing it to the libraries listed below
Sorting:
- The official repository for the CodeGym project: "Generalizable End-to-End Tool-Use RL with Synthetic CodeGym"☆23Oct 14, 2025Updated 4 months ago
- ☆17Aug 1, 2025Updated 7 months ago
- Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for Transformer Agents"☆31Oct 12, 2023Updated 2 years ago
- a website for accessing many models through api(deepseek、Qwen、Hunyuan etc.)☆17Jul 12, 2025Updated 7 months ago
- Chain of Thoughts (CoT) is so hot! so long! We need short reasoning process!☆72Apr 1, 2025Updated 11 months ago
- Awesome Triton Resources☆39Apr 27, 2025Updated 10 months ago
- Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision☆124Sep 9, 2024Updated last year
- [NeurIPS'25] The official code implementation for paper "R2R: Efficiently Navigating Divergent Reasoning Paths with Small-Large Model Tok…☆78Feb 10, 2026Updated 3 weeks ago
- This repository includes code to reproduce the tables in "Loss Landscapes are All You Need: Neural Network Generalization Can Be Explaine…☆39Mar 2, 2023Updated 3 years ago
- [NeurIPS 2025] What Makes a Reward Model a Good Teacher? An Optimization Perspective☆42Sep 18, 2025Updated 5 months ago
- [ICML 2024] Official Repository for the paper "Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models"☆10Jul 19, 2024Updated last year
- A python algorithm to change the pitch of the voice in real time☆13Dec 13, 2020Updated 5 years ago
- Official implementation for the paper "Can Large Reasoning Models Self-Train?"☆72Oct 10, 2025Updated 4 months ago
- The official repository for the paper Multilingual Mathematical Autoformalization☆38May 20, 2024Updated last year
- A collection of heat engines, based on the OpenAI Gym environment framework for use with reinforcement learning applications.☆15Dec 20, 2021Updated 4 years ago
- About Code release for "Imagination Mechanism: Mesh Information Propagation for Enhancing Data Efficiency in Reinforcement Learning"☆13Oct 7, 2023Updated 2 years ago
- ☆14Mar 21, 2024Updated last year
- ☆11Jan 11, 2022Updated 4 years ago
- [2025 - Journey of Learning LLMs - Basic Skills/Projects/Papers]☆20Jun 21, 2025Updated 8 months ago
- LLM Skirmish☆44Feb 3, 2026Updated last month
- ☆10Jul 13, 2024Updated last year
- CVPR 2023: PAniC-3D, Vtubers dataset downloader☆13Apr 22, 2023Updated 2 years ago
- ☆12Aug 6, 2024Updated last year
- Teaching a humanoid to walk(ish), then displaying in your browser (using tensorflow.js and reinforcement learning)☆10Sep 7, 2020Updated 5 years ago
- (READ ONLY MIRROR) The ProB Model Checker and Animator Plugin for Rodin☆19Feb 26, 2026Updated last week
- DreamSmooth: Improving Model-Based RL with Reward Smoothing (ICLR 2024)☆12May 6, 2024Updated last year
- code for polite☆11Feb 28, 2024Updated 2 years ago
- ☆16Jul 23, 2023Updated 2 years ago
- ☆13Mar 25, 2025Updated 11 months ago
- Proxify Molotov.tv DRM to share content publicly☆10Jun 24, 2020Updated 5 years ago
- [INTERSPEECH 2024] Official code for VoxSim: A perceptual voice similarity dataset☆12Sep 29, 2025Updated 5 months ago
- This is not remotely close to a finished product, and does not intend to nor does this claim to be working fine-tuning code for MaskGCT. …☆13Dec 4, 2024Updated last year
- HyPe: Better Pre-trained Language Model Fine-tuning with Hidden Representation Perturbation [ACL 2023]☆14Jul 11, 2023Updated 2 years ago
- Robot simulator using web technologies, just JavaScript☆10Feb 13, 2020Updated 6 years ago
- Brand new TTS solution☆11Dec 7, 2024Updated last year
- 2022 秋季学期清华大学电子系数据与算法课程 OJ 参考解答☆10Jun 18, 2023Updated 2 years ago
- ☆11May 11, 2023Updated 2 years ago
- the indexer and search engine for irchiver, see https://irchiver.com for license and other information☆14Dec 2, 2021Updated 4 years ago
- Enhanced Reverberation As Supervision (ERAS) for unsupervised reverberant speech separation☆15Aug 1, 2024Updated last year