☆42Feb 12, 2026Updated last month
Alternatives and similar repositories for TestTimeTrainingPapers
Users that are interested in TestTimeTrainingPapers are comparing it to the libraries listed below
Sorting:
- This is the official implementation of TAGCOS: Task-agnostic Gradient Clustered Coreset Selection for Instruction Tuning Data☆14Jul 21, 2024Updated last year
- ☆47Apr 9, 2025Updated 11 months ago
- Confidence Regulation Neurons in Language Models (NeurIPS 2024)☆15Feb 1, 2025Updated last year
- [EMNLP 2022] TaCube: Pre-computing Data Cubes for Answering Numerical-Reasoning Questions over Tabular Data☆17May 17, 2023Updated 2 years ago
- Transfer Learning in Dialogue Benchmarking Toolkit☆14Mar 31, 2023Updated 2 years ago
- A Gym for Agentic LLMs☆467Jan 21, 2026Updated 2 months ago
- ☆31Mar 6, 2026Updated 2 weeks ago
- REDSearch: A scalable, cost-efficient framework for long-horizon search agents. Features complex task synthesis, optimized mid-training, …☆62Feb 26, 2026Updated 3 weeks ago
- [MM'23] ProTegO: Protect Text Content against OCR Extraction Attack☆14Mar 12, 2024Updated 2 years ago
- This is the official repo of our work titled "The Codecfake Dataset and Countermeasures for the Universally Detection of Deepfake Audio".☆66Dec 13, 2024Updated last year
- [EuroSys'25] Mist: Efficient Distributed Training of Large Language Models via Memory-Parallelism Co-Optimization☆21Feb 5, 2026Updated last month
- ☆51Jan 24, 2024Updated 2 years ago
- Code for Semantic Adversarial Attacks☆11Oct 12, 2021Updated 4 years ago
- Learning from Indirect Observations☆11Jul 16, 2021Updated 4 years ago
- In the context of Deep Learning: What is the right way to conduct example weighting? How do you understand loss functions and so-called …☆10Mar 4, 2021Updated 5 years ago
- RapidIn: Scalable Influence Estimation for Large Language Models (LLMs). The implementation for paper "Token-wise Influential Training Da…☆21Mar 10, 2026Updated last week
- [WACV 2024] Instruct Me More! Random Prompting for Visual In-Context Learning☆17May 7, 2025Updated 10 months ago
- A simple implementation of ReasonGenRM.☆19Apr 21, 2025Updated 11 months ago
- Ideas for projects related to Tinker☆174Nov 6, 2025Updated 4 months ago
- A general framework used on evaluating the performance of large language models (LLMs) based on the peer review mechanism among LLMs☆19Aug 3, 2024Updated last year
- [ICLR 2025] Understanding and Enhancing Safety Mechanisms of LLMs via Safety-Specific Neuron☆30Apr 30, 2025Updated 10 months ago
- Official Implementation of UA^{2}-Agent and other baseline algorithms of "Towards Unified Alignment Between Agents, Humans, and Environme…☆19Nov 12, 2024Updated last year
- [ICLR 2025] Official implementation of paper "Dynamic Low-Rank Sparse Adaptation for Large Language Models".☆24Mar 16, 2025Updated last year
- A simple PyTorch implementation of Learning Instance Activation Maps for Weakly Supervised Instance Segmentation, in CVPR 2019☆11Jun 18, 2020Updated 5 years ago
- Code for "Can We Characterize Tasks Without Labels or Features?" (CVPR 2021)☆11Aug 31, 2021Updated 4 years ago
- Tiny evaluation of leading LLMs on competitive programming problems☆14Nov 28, 2024Updated last year
- Downloading and formatting YFCC100M dataset☆13Sep 21, 2020Updated 5 years ago
- Exploiting Class Activation Value for Partial-Label Learning, ICLR 2022 (poster)☆15Apr 18, 2022Updated 3 years ago
- A holistic framework for advancing LLMs as data science agents☆39Feb 3, 2026Updated last month
- Implementation of Evo-Memory style learning for LLM agents. Agents learn from outcomes, refine strategies, and get smarter with every tas…☆44Dec 3, 2025Updated 3 months ago
- official implementation of ICLR'2025 paper: Rethinking Bradley-Terry Models in Preference-based Reward Modeling: Foundations, Theory, and…☆72Apr 2, 2025Updated 11 months ago
- Resources for the Enigmata Project.☆80Aug 13, 2025Updated 7 months ago
- 【ICLR 2026 🔥】This work introduces MMEVOKE benchmark to reveal challenges in knowledge injection and explores potential solutions.☆50Jun 11, 2025Updated 9 months ago
- Code of paper "AdvReverb: AdvReverb: Rethinking the Stealthiness of Audio Adversarial Examples to Human Perception"☆19Nov 26, 2023Updated 2 years ago
- ☆10Oct 20, 2023Updated 2 years ago
- ☆16Apr 26, 2021Updated 4 years ago
- Ludax is a domain-specific language for board games that automatically compiles into hardware-accelerated learning environments with the …☆26Updated this week
- Official Repo for SwS: A Weakness-driven Problem Synthesis Framework in RL for LLM Reasoning☆42Nov 11, 2025Updated 4 months ago
- Interpolation between Residual and Non-Residual Networks, ICML 2020. https://arxiv.org/abs/2006.05749☆26Aug 16, 2020Updated 5 years ago