Implementation of the Quiet-STAR paper (https://arxiv.org/pdf/2403.09629.pdf)
☆55Aug 8, 2024Updated last year
Alternatives and similar repositories for quiet-star
Users that are interested in quiet-star are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for Quiet-STaR☆741Aug 21, 2024Updated last year
- ☆20Nov 3, 2024Updated last year
- Code for ICLR 2025 Paper "What is Wrong with Perplexity for Long-context Language Modeling?"☆110Oct 11, 2025Updated 5 months ago
- ☆63Dec 6, 2024Updated last year
- Official implementation for "MM-Eval: A Multilingual Meta-Evaluation Benchmark for LLM-as-a-Judge and Reward Models"☆18Oct 26, 2024Updated last year
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- ☆77Feb 18, 2026Updated last month
- Code for STaR: Bootstrapping Reasoning With Reasoning (NeurIPS 2022)☆221Feb 21, 2023Updated 3 years ago
- About The official GitHub page for ''Unleashing the Potential of Large Language Models as Prompt Optimizers: An Analogical Analysis with …☆28Dec 12, 2024Updated last year
- Framework-Agnostic RL Environments for LLM Fine-Tuning☆44Updated this week
- Source code for Truth-Aware Context Selection: Mitigating the Hallucinations of Large Language Models Being Misled by Untruthful Contexts☆17Sep 2, 2024Updated last year
- ☆28Apr 16, 2024Updated last year
- Suri: Multi-constraint instruction following for long-form text generation (EMNLP’24)☆27Oct 3, 2025Updated 5 months ago
- a benchmark to evaluate the situated inductive reasoning☆15Jan 7, 2025Updated last year
- The official repo of paper "Self-Control of LLM Behaviors by Compressing Suffix Gradient into Prefix Controller"☆18Aug 13, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- An implementation of the base GPT-3 Model architecture from the paper by OPENAI "Language Models are Few-Shot Learners"☆20Jun 29, 2024Updated last year
- Repo for EmbedLLM: Learning Compact Representations of Large Language Models☆29Sep 25, 2025Updated 6 months ago
- Python implementation of a Minimal Active Inference Agent☆17Feb 9, 2023Updated 3 years ago
- Code for NAACL 2025 paper "AdaCAD: Adaptively Decoding to Balance Conflicts between Contextual and Parametric Knowledge"☆17Mar 2, 2026Updated 3 weeks ago
- Open sourced result for The Agent Company☆21Nov 11, 2025Updated 4 months ago
- Source code of our EMNLP 2024 paper "FactAlign: Long-form Factuality Alignment of Large Language Models"☆19Oct 3, 2024Updated last year
- ☆33Sep 19, 2025Updated 6 months ago
- This is for EMNLP 2024 Paper: AppBench: Planning of Multiple APIs from Various APPs for Complex User Instruction☆15Nov 4, 2024Updated last year
- Python package for compressing floating-point PyTorch tensors☆13Jul 22, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆10Dec 21, 2024Updated last year
- Behavior Trees Library in YARP☆12Apr 16, 2018Updated 7 years ago
- (SIGIR 25) Repo for "Review-driven Personalized Preference Reasoning with Large Language Models for Recommendation"☆10Jan 18, 2025Updated last year
- Code for "Theoretical Foundations of Deep Selective State-Space Models" (NeurIPS 2024)☆15Jan 7, 2025Updated last year
- A Flexible Framework for Generative Recommendation☆33Mar 6, 2026Updated 3 weeks ago
- A Large-Scale, High-Quality Math Dataset for Reinforcement Learning in Language Models☆73Feb 25, 2025Updated last year
- ☆16Jul 12, 2024Updated last year
- ☆78Dec 26, 2023Updated 2 years ago
- A repository containing learning materials I study/summarize☆19Mar 15, 2026Updated 2 weeks ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆34Mar 21, 2026Updated last week
- Code for abstracting, evaluating, and visualizing Markov Decision Processes.☆10Jan 12, 2017Updated 9 years ago
- Official Code For Dual Grained Quantization: Efficient Fine-Grained Quantization for LLM☆14Dec 27, 2023Updated 2 years ago
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆92Jan 23, 2025Updated last year
- ☆90Feb 12, 2026Updated last month
- ☆12Aug 19, 2025Updated 7 months ago
- Official code for the paper Improving Language Plasticity via Pretraining with Active Forgetting, NeurIPS 2023☆21Mar 12, 2026Updated 2 weeks ago