Github repository for "Internalizing World Models via Self-Play Finetuning for Agentic RL"
☆33Nov 1, 2025Updated 4 months ago
Alternatives and similar repositories for SPA
Users that are interested in SPA are comparing it to the libraries listed below
Sorting:
- [ACL'24 Oral] Analysing The Impact of Sequence Composition on Language Model Pre-Training☆23Aug 18, 2024Updated last year
- ☆13Oct 5, 2025Updated 4 months ago
- 🪝PISCES - Precise In-Parameter Suppression for Concept EraSure in Large Language Models☆12May 30, 2025Updated 9 months ago
- AAAI2025☆11Apr 18, 2025Updated 10 months ago
- ☆11Oct 25, 2024Updated last year
- A multi-agent framework to help with your homework.☆10Mar 1, 2025Updated last year
- A framework for evaluating Machine Translation models.☆12May 26, 2025Updated 9 months ago
- ☆14Apr 29, 2025Updated 10 months ago
- ☆31Feb 3, 2026Updated 3 weeks ago
- Code for the NAACL 2024 HCI+NLP Workshop paper "LLMCheckup: Conversational Examination of Large Language Models via Interpretability Tool…☆13Mar 24, 2024Updated last year
- [ICML 2025] Repository for M3-JEPA: Multimodal Alignment via Multi-gate MoE based on the Joint-Predictive Embedding Architecture☆18Nov 4, 2025Updated 3 months ago
- [CVPR 2023] "TrojViT: Trojan Insertion in Vision Transformers" by Mengxin Zheng, Qian Lou, Lei Jiang☆14Jan 5, 2024Updated 2 years ago
- ☆11Jan 10, 2020Updated 6 years ago
- Implementation of our ICLR 2021 paper: Policy-Driven Attack: Learning to Query for Hard-label Black-box Adversarial Examples.☆11Mar 9, 2021Updated 4 years ago
- R1V, trained with AI feedback, answers open-ended visual questions.☆14Apr 12, 2025Updated 10 months ago
- Codes for "Benchmarking the Generation of Fact Checking Explanations"☆10Aug 16, 2024Updated last year
- ☆13Jun 25, 2025Updated 8 months ago
- Code for paper "Concrete Subspace Learning based Interference Elimination for Multi-task Model Fusion"☆14Mar 28, 2024Updated last year
- ☆10Dec 17, 2020Updated 5 years ago
- An implementation of "Subspace Representations for Soft Set Operations and Sentence Similarities" (NAACL 2024)☆10May 31, 2024Updated last year
- Reasoning-based Evaluation and Ranking of Translations.☆19Jul 18, 2025Updated 7 months ago
- This repository contains the research project that enables the robot to automatically join a group based on the modeled personal, social …☆11Nov 4, 2018Updated 7 years ago
- HarmAug: Effective Data Augmentation for Knowledge Distillation of Safety Guard Models☆13Mar 6, 2025Updated 11 months ago
- Implementation of our NeurIPS 2019 paper: Subspace Attack: Exploiting Promising Subspaces for Query-Efficient Black-box Attacks☆10Dec 16, 2019Updated 6 years ago
- A python package for causal inference in panels☆13Nov 11, 2025Updated 3 months ago
- ☆10Mar 19, 2024Updated last year
- Use PaliGemma to auto-label data for use in training fine-tuned vision models.☆12Jun 13, 2024Updated last year
- The code implementation of MuScleLoRA (Accepted in ACL 2024)☆10Dec 1, 2024Updated last year
- Code for paper "Towards Efficient Pareto Set Approximation via Weight-Ensembling Mixture of Experts"☆11Sep 13, 2024Updated last year
- Code repo for the ICML 2021 paper "Making Paper Reviewing Robust to Bid Manipulation Attacks".☆10Sep 15, 2021Updated 4 years ago
- Code of paper "HyperVLA: Efficient Inference in Vision-Language-Action Models via Hypernetworks"☆22Oct 8, 2025Updated 4 months ago
- Code for generating a single image pretraining dataset☆13Aug 3, 2021Updated 4 years ago
- The Quick theme magically transforms your README.md into a GitHub Pages site, applying clean and visually appealing styles. The fastest a…☆22Nov 30, 2025Updated 3 months ago
- A simple Python package for deep learning using forward automatic differentiation based on JAX.☆14Aug 17, 2022Updated 3 years ago
- Code for "Discovering Non-monotonic Autoregressive Orderings with Variational Inference" (paper and code updated from ICLR 2021)☆12Mar 7, 2024Updated last year
- Applies ROME and MEMIT on Mamba-S4 models☆14Apr 5, 2024Updated last year
- Official Implementation of "Transferring Inductive Biases Through Knowledge Distillation"☆15Jun 3, 2020Updated 5 years ago
- https://arxiv.org/abs/2404.10917☆14Mar 18, 2025Updated 11 months ago
- Materials for "Multi-property Steering of Large Language Models with Dynamic Activation Composition"☆14Nov 22, 2024Updated last year