☆207Dec 23, 2025Updated 2 months ago
Alternatives and similar repositories for Learning_dynamics_LLM
Users that are interested in Learning_dynamics_LLM are comparing it to the libraries listed below
Sorting:
- PyTorch implementation of paper "StyDeSty: Min-Max Stylization and Destylization for Single Domain Generalization" in ICML 2024.☆15Jun 4, 2024Updated last year
- Code for CVPR 2024 Oral "Neural Lineage"☆17Jun 18, 2024Updated last year
- DreamGaussian with 2D-GS☆12Oct 10, 2024Updated last year
- NeurIPS'24 - LLM Safety Landscape☆39Oct 21, 2025Updated 4 months ago
- ☆44Oct 1, 2024Updated last year
- Forcing Diffuse Distributions out of Language Models☆18Sep 10, 2024Updated last year
- Learnable Semi-structured Sparsity for Vision Transformers and Diffusion Transformers☆14Feb 7, 2025Updated last year
- AbstainQA, ACL 2024☆28Feb 4, 2026Updated 3 weeks ago
- ☆18Jun 3, 2024Updated last year
- ☆19Mar 25, 2025Updated 11 months ago
- Improving Alignment and Robustness with Circuit Breakers☆258Sep 24, 2024Updated last year
- Official Repository of "Learning to Reason under Off-Policy Guidance"☆418Oct 4, 2025Updated 4 months ago
- ☆331May 31, 2025Updated 9 months ago
- Official Repository for The Paper: Safety Alignment Should Be Made More Than Just a Few Tokens Deep☆174Apr 23, 2025Updated 10 months ago
- [NeurIPS 2024] "Can Language Models Perform Robust Reasoning in Chain-of-thought Prompting with Noisy Rationales?"☆39Jul 18, 2025Updated 7 months ago
- [ICML 2024] LESS: Selecting Influential Data for Targeted Instruction Tuning☆512Oct 20, 2024Updated last year
- A Model Agnostic function to directly remove specified layers from the LLM☆10May 23, 2024Updated last year
- [KDD'23] This is the code repo for our KDD'23 paper "DyGen: Learning from Noisy Labels via Dynamics-Enhanced Generative Modeling".☆11Jun 14, 2023Updated 2 years ago
- code repo for ICLR 2024 paper "Can LLMs Express Their Uncertainty? An Empirical Evaluation of Confidence Elicitation in LLMs"☆144Mar 14, 2024Updated last year
- [ICLR 2026] PSFT is a trust-region–inspired fine-tuning objective that views SFT as a policy gradient method with constant advantages, co…☆35Sep 9, 2025Updated 5 months ago
- ☆18May 3, 2025Updated 9 months ago
- [NeurIPS 2024] Federated Learning from Vision-Language Foundation Models: Theoretical Analysis and Method☆13Oct 1, 2024Updated last year
- [JMLR] Gradual Domain Adaptation: Theory and Algorithms☆11Jan 14, 2025Updated last year
- The official repository for the CodeGym project: "Generalizable End-to-End Tool-Use RL with Synthetic CodeGym"☆23Oct 14, 2025Updated 4 months ago
- ☆10Oct 20, 2023Updated 2 years ago
- [ACL 2024 main] Aligning Large Language Models with Human Preferences through Representation Engineering (https://aclanthology.org/2024.…☆28Sep 25, 2024Updated last year
- ☆32May 30, 2025Updated 9 months ago
- On Memorization of Large Language Models in Logical Reasoning☆74Mar 29, 2025Updated 11 months ago
- A repo for open research on building large reasoning models☆140Feb 18, 2026Updated last week
- Source code of "Task arithmetic in the tangent space: Improved editing of pre-trained models".☆112Jun 8, 2023Updated 2 years ago
- [NeurIPS 2025] TTRL: Test-Time Reinforcement Learning☆998Feb 23, 2026Updated last week
- This is the official repository for paper: cross-modal information flow in multimodal large language models☆41May 21, 2025Updated 9 months ago
- [NeurIPS'24] Official code for *🎯DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving*☆120Dec 10, 2024Updated last year
- The code for creating the iGSM datasets in papers "Physics of Language Models Part 2.1, Grade-School Math and the Hidden Reasoning Proces…☆84Jan 12, 2025Updated last year
- An implementation of online data mixing for the Pile dataset, based on the GPT-NeoX library.☆13Jan 9, 2024Updated 2 years ago
- [ICML 2024] Code release for "On the Emergence of Cross-Task Linearity in Pretraining-Finetuning Paradigm"☆11Feb 20, 2025Updated last year
- Code for Representation Bending Paper☆16Jul 15, 2025Updated 7 months ago
- This is the official implementation of TAGCOS: Task-agnostic Gradient Clustered Coreset Selection for Instruction Tuning Data☆13Jul 21, 2024Updated last year
- ☆32Oct 4, 2025Updated 4 months ago