Inverse Scaling in Test-Time Compute
☆25Dec 3, 2025Updated 3 months ago
Alternatives and similar repositories for inverse-scaling-ttc
Users that are interested in inverse-scaling-ttc are comparing it to the libraries listed below
Sorting:
- Official implementation for "Mixture of In-Context Experts Enhance LLMs’ Awareness of Long Contexts" (Accepted by Neurips2024)☆13Jan 7, 2025Updated last year
- [EMNLP 25] An effective and interpretable weight-editing method for mitigating overly short reasoning in LLMs, and a mechanistic study un…☆17Dec 17, 2025Updated 2 months ago
- An extention to the GaLore paper, to perform Natural Gradient Descent in low rank subspace☆18Oct 21, 2024Updated last year
- LLM Can Get "Brain Rot"☆158Jan 9, 2026Updated 2 months ago
- Restore safety in fine-tuned language models through task arithmetic☆32Mar 28, 2024Updated last year
- [COLM 2025] SEAL: Steerable Reasoning Calibration of Large Language Models for Free☆54Apr 6, 2025Updated 11 months ago
- CyberX-AI-Digital-Twin is an AI-powered cybersecurity platform that uses digital twin technology to simulate, detect, and analyze cyber t…☆14Feb 13, 2025Updated last year
- Code for NeurIPS 2022 Datasets and Benchmarks paper - EgoTaskQA: Understanding Human Tasks in Egocentric Videos.☆37Apr 17, 2023Updated 2 years ago
- [ACL 2025] An inference-time decoding strategy with adaptive foresight sampling☆108May 18, 2025Updated 9 months ago
- ☆12Jan 20, 2024Updated 2 years ago
- ☆10Oct 11, 2022Updated 3 years ago
- PyTorch Implementation for the paper "Let Me Help You! Neuro-Symbolic Short-Context Action Anticipation" accepted to RA-L'24.☆12Nov 27, 2024Updated last year
- Code for a multi-agent particle environment used in the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"☆11Jan 15, 2020Updated 6 years ago
- The official repository for the CodeGym project: "Generalizable End-to-End Tool-Use RL with Synthetic CodeGym"☆23Oct 14, 2025Updated 4 months ago
- FormulaOne: A dataset of algorithmic problems based on MSO formulas.☆25Mar 1, 2026Updated last week
- Rendering code for ShapeNet models☆11Apr 20, 2017Updated 8 years ago
- ☆12Jun 18, 2024Updated last year
- ☆15Nov 22, 2023Updated 2 years ago
- Lipschitz Lifelong RL☆11Nov 6, 2020Updated 5 years ago
- 010Editor-Crack version:13.0.1☆10Mar 18, 2024Updated last year
- 实验室找工作交流☆10Oct 16, 2015Updated 10 years ago
- To mitigate position bias in LLMs, especially in long-context scenarios, we scale only one dimension of LLMs, reducing position bias and …☆11Jun 18, 2024Updated last year
- Scripts for making Hadoop deployments in AWS easy☆10Feb 26, 2014Updated 12 years ago
- ⚡ Developer-friendly hybrid-RAG toolkit merging Graphiti, Qdrant, mem0, LlamaIndex, and LangChain into one powerful engine.☆15Jan 14, 2026Updated last month
- ☆11Jun 5, 2023Updated 2 years ago
- Code for "Training Adversarially Robust Sparse Networks via Bayesian Connectivity Sampling" [ICML 2021]☆10Mar 14, 2022Updated 3 years ago
- ☆14Jul 18, 2025Updated 7 months ago
- Source code and data of our paper "Missing Counter-Evidence Renders NLP Fact-Checking Unrealistic for Misinformation" (https://arxiv.org/…☆10Jun 21, 2023Updated 2 years ago
- Unofficial implementation of Chain of Hindsight (https://arxiv.org/abs/2302.02676) using pytorch and huggingface Trainers.☆11Apr 5, 2023Updated 2 years ago
- The official baseline implementations for Chronocept☆10Dec 21, 2025Updated 2 months ago
- ☆11Dec 13, 2013Updated 12 years ago
- A project designed to build and render a full Minecraft crafting tree.☆10Aug 10, 2021Updated 4 years ago
- This project leverages autogen multi agent framework along with Azure OpenAI Assistants API to automate data analysis and report generati…☆12Feb 5, 2025Updated last year
- A simple Docker sandbox example and a ready-to-use autograder API. Based on asynchronous FastAPI and disposable Docker containers. Three …☆14Jan 10, 2022Updated 4 years ago
- Explanation Optimization☆13Oct 16, 2020Updated 5 years ago
- https://interactivetraining.ai/☆17Oct 2, 2025Updated 5 months ago
- Reproduction of "Latent Weights Do Not Exist: Rethinking Binarized Neural Network Optimization" for the Reproducibility challenge@NeurIPS…☆11Jan 14, 2020Updated 6 years ago
- Kernel-Enforced Install-Time Policies (KEIP): An eBPF/LSM based security tool that detects and blocks malicious network activity during p…☆32Feb 19, 2026Updated 2 weeks ago
- ☆10Apr 20, 2016Updated 9 years ago