shiqichen17/SPA

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/shiqichen17/SPA)

shiqichen17 / SPA

Github repository for "Internalizing World Models via Self-Play Finetuning for Agentic RL"

☆33

Alternatives and similar repositories for SPA

Users that are interested in SPA are comparing it to the libraries listed below

Sorting:

yuzhaouoe / pretraining-data-packing
View on GitHub
[ACL'24 Oral] Analysing The Impact of Sequence Composition on Language Model Pre-Training
☆23Aug 18, 2024Updated last year
ordavid-s / snmf-mlp-decomposition
View on GitHub
☆13Oct 5, 2025Updated 4 months ago
yoavgur / PISCES
View on GitHub
🪝PISCES - Precise In-Parameter Suppression for Concept EraSure in Large Language Models
☆12May 30, 2025Updated 9 months ago
keikeiqi / MGTTA
View on GitHub
AAAI2025
☆11Apr 18, 2025Updated 10 months ago
AI45Lab / DEAN
View on GitHub
☆11Oct 25, 2024Updated last year
zhiyuan-zhang0206 / HomeworkAgent
View on GitHub
A multi-agent framework to help with your homework.
☆10Mar 1, 2025Updated last year
langtech-bsc / mt-evaluation
View on GitHub
A framework for evaluating Machine Translation models.
☆12May 26, 2025Updated 9 months ago
DiLi-Lab / ScanDL
View on GitHub
☆14Apr 29, 2025Updated 10 months ago
Embodied-Reasoning-Agent / Embodied-Reasoning-Agent
View on GitHub
☆31Feb 3, 2026Updated 3 weeks ago
DFKI-NLP / LLMCheckup
View on GitHub
Code for the NAACL 2024 HCI+NLP Workshop paper "LLMCheckup: Conversational Examination of Large Language Models via Interpretability Tool…
☆13Mar 24, 2024Updated last year
HongyangLL / M3-JEPA
View on GitHub
[ICML 2025] Repository for M3-JEPA: Multimodal Alignment via Multi-gate MoE based on the Joint-Predictive Embedding Architecture
☆18Nov 4, 2025Updated 3 months ago
mxzheng / TrojViT
View on GitHub
[CVPR 2023] "TrojViT: Trojan Insertion in Vision Transformers" by Mengxin Zheng, Qian Lou, Lei Jiang
☆14Jan 5, 2024Updated 2 years ago
zdou0830 / MetaNLP
View on GitHub
☆11Jan 10, 2020Updated 6 years ago
ZiangYan / pda.pytorch
View on GitHub
Implementation of our ICLR 2021 paper: Policy-Driven Attack: Learning to Query for Hard-label Black-box Adversarial Examples.
☆11Mar 9, 2021Updated 4 years ago
Exgc / R1V-Free
View on GitHub
R1V, trained with AI feedback, answers open-ended visual questions.
☆14Apr 12, 2025Updated 10 months ago
LanD-FBK / benchmark-gen-explanations
View on GitHub
Codes for "Benchmarking the Generation of Fact Checking Explanations"
☆10Aug 16, 2024Updated last year
SongW-SW / CEB
View on GitHub
☆13Jun 25, 2025Updated 8 months ago
tanganke / subspace_fusion
View on GitHub
Code for paper "Concrete Subspace Learning based Interference Elimination for Multi-task Model Fusion"
☆14Mar 28, 2024Updated last year
philschmid / multilingual-serverless-qa-aws-lambda
View on GitHub
☆10Dec 17, 2020Updated 5 years ago
yoichi1484 / subspace
View on GitHub
An implementation of "Subspace Representations for Soft Set Operations and Sentence Similarities" (NAACL 2024)
☆10May 31, 2024Updated last year
SakanaAI / TransEvalnia
View on GitHub
Reasoning-based Evaluation and Ranking of Translations.
☆19Jul 18, 2025Updated 7 months ago
usr-lab / pepper-social
View on GitHub
This repository contains the research project that enables the robot to automatically join a group based on the modeled personal, social …
☆11Nov 4, 2018Updated 7 years ago
hbseong97 / HarmAug
View on GitHub
HarmAug: Effective Data Augmentation for Knowledge Distillation of Safety Guard Models
☆13Mar 6, 2025Updated 11 months ago
ZiangYan / subspace-attack.pytorch
View on GitHub
Implementation of our NeurIPS 2019 paper: Subspace Attack: Exploiting Promising Subspaces for Query-Efficient Black-box Attacks
☆10Dec 16, 2019Updated 6 years ago
TianyiPeng / causaltensor
View on GitHub
A python package for causal inference in panels
☆13Nov 11, 2025Updated 3 months ago
THUKElab / LatEval
View on GitHub
☆10Mar 19, 2024Updated last year
autodistill / autodistill-paligemma
View on GitHub
Use PaliGemma to auto-label data for use in training fine-tuned vision models.
☆12Jun 13, 2024Updated last year
ZrW00 / MuScleLoRA
View on GitHub
The code implementation of MuScleLoRA (Accepted in ACL 2024)
☆10Dec 1, 2024Updated last year
tanganke / pareto_set_learning
View on GitHub
Code for paper "Towards Efficient Pareto Set Approximation via Weight-Ensembling Mixture of Experts"
☆11Sep 13, 2024Updated last year
facebookresearch / secure-paper-bidding
View on GitHub
Code repo for the ICML 2021 paper "Making Paper Reviewing Robust to Bid Manipulation Attacks".
☆10Sep 15, 2021Updated 4 years ago
MasterXiong / Hyper-VLA
View on GitHub
Code of paper "HyperVLA: Efficient Inference in Vision-Language-Action Models via Hypernetworks"
☆22Oct 8, 2025Updated 4 months ago
yukimasano / single_img_pretraining
View on GitHub
Code for generating a single image pretraining dataset
☆13Aug 3, 2021Updated 4 years ago
devich / quick
View on GitHub
The Quick theme magically transforms your README.md into a GitHub Pages site, applying clean and visually appealing styles. The fastest a…
☆22Nov 30, 2025Updated 3 months ago
DSSC-projects / veni
View on GitHub
A simple Python package for deep learning using forward automatic differentiation based on JAX.
☆14Aug 17, 2022Updated 3 years ago
xuanlinli17 / autoregressive_inference
View on GitHub
Code for "Discovering Non-monotonic Autoregressive Orderings with Variational Inference" (paper and code updated from ICLR 2021)
☆12Mar 7, 2024Updated last year
arnab-api / romba
View on GitHub
Applies ROME and MEMIT on Mamba-S4 models
☆14Apr 5, 2024Updated last year
samiraabnar / Reflect
View on GitHub
Official Implementation of "Transferring Inductive Biases Through Knowledge Distillation"
☆15Jun 3, 2020Updated 5 years ago
ritikamangla / QSalience
View on GitHub
https://arxiv.org/abs/2404.10917
☆14Mar 18, 2025Updated 11 months ago
DanielSc4 / Dynamic-Activation-Composition
View on GitHub
Materials for "Multi-property Steering of Large Language Models with Dynamic Activation Composition"
☆14Nov 22, 2024Updated last year