a novel architecture that leverages Autoencoders to superimpose the hidden representations of a base model and a fine-tuned model within a shared parameter space. Using B-spline-based blending coefficients and autoencoders that adaptively reconstruct the original hidden states based on the input data distribution.
☆79Aug 1, 2025Updated 8 months ago
Alternatives and similar repositories for Superposition-Transformer
Users that are interested in Superposition-Transformer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A sophisticated trading system leveraging local LLM deployment through Ollama, distributed computing with Apache Spark, and vector-based …☆16Feb 3, 2025Updated last year
- A unified robotic manipulation learning framework☆21Sep 4, 2025Updated 7 months ago
- 🕵 Code for our EMNLP 2025 Main paper: "FlashAdventure: A Benchmark for GUI Agents Solving Full Story Arcs in Diverse Adventure Games"☆25Dec 14, 2025Updated 4 months ago
- [SIGGRAPH Asia 2025] CHARM: Control-point-based 3D Anime Hairstyle Auto-Regressive Modeling☆47Sep 26, 2025Updated 6 months ago
- [EMNLP 2025] Code for paper "Table-R1: Inference-Time Scaling for Table Reasoning"☆29Jun 3, 2025Updated 10 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Audio Jailbreak: An Open Comprehensive Benchmark for Jailbreaking Large Audio-Language Models☆32Oct 6, 2025Updated 6 months ago
- A Comprehensive Dataset for Advanced Image Generation and Editing}☆31Oct 2, 2025Updated 6 months ago
- [ICLR 2026] Information Gain-based Policy Optimization: A Simple and Effective Approach for Multi-Turn Search Agents☆51Mar 7, 2026Updated last month
- Adaptive Multimodal Reasoning via Reinforcement Learning☆23Jan 11, 2026Updated 3 months ago
- Reproducible Language Agent Research☆34Jun 25, 2025Updated 9 months ago
- [ICCV 2025] Dynamic-VLM☆28Dec 16, 2024Updated last year
- The Code and Script of "David's Slingshot: A Strategic Coordination Framework of Small LLMs Matches Large LLMs in Data Synthesis"☆34Jun 13, 2025Updated 10 months ago
- High-performance ASR tool using Faster Whisper, supporting custom models, multi-language transcription, and real-time processing feedback…☆10Sep 17, 2025Updated 6 months ago
- Latex template for CUHK PhD Thesis☆12Jun 29, 2025Updated 9 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆12Jul 23, 2024Updated last year
- ☆11Feb 16, 2025Updated last year
- offical code for Dense-TSNet☆12Sep 17, 2024Updated last year
- Gaussian Splatting for Robotic Simulation☆23Nov 7, 2025Updated 5 months ago
- [ICCV 2025] MRGen: Segmentation Data Engine for Underrepresented MRI Modalities☆39Sep 26, 2025Updated 6 months ago
- CleanUMamba: A Compact Mamba Network for Speech Denoising using Channel Pruning [Official PyTorch implementation]☆23Jun 12, 2025Updated 10 months ago
- A Structured Output Benchmark whose 'ground-truth' is actually right☆19Dec 5, 2025Updated 4 months ago
- Code repository supporting the paper "Auto-Generating Weak Labels for Real & Synthetic Data to Improve Label-Scarce Medical Image Segment…☆12Apr 29, 2024Updated last year
- Fairer Preferences Elicit Improved Human-Aligned Large Language Model Judgments (Zhou et al., EMNLP 2024)☆14Oct 3, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- The NLPStatTest project☆12Mar 12, 2022Updated 4 years ago
- "Integrating Topics and Syntax" T.L. Griffiths et al. Python implementation of HMM-LDA☆16Sep 24, 2015Updated 10 years ago
- Sotopia-RL: Reward Design for Social Intelligence☆49Apr 1, 2026Updated 2 weeks ago
- [ACL 2025 Main] Official Repository for "Evaluating Language Models as Synthetic Data Generators"☆41Dec 13, 2024Updated last year
- ☆33Jan 30, 2026Updated 2 months ago
- Core ML Demos is an experimental Core ML app. It visualizes the inference results of ML models and can be used to benchmark ML models and…☆12Jan 8, 2026Updated 3 months ago
- This repo contains the official PyTorch implementation of "Analyzing Discrete Self Supervised Speech Representation For Spoken Language M…☆20Jan 3, 2023Updated 3 years ago
- EMO-SUPERB submission☆51Oct 13, 2025Updated 6 months ago
- [ACL 2025] Exploring Compositional Generalization of Multimodal LLMs for Medical Imaging☆39Jun 4, 2025Updated 10 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆40Jan 14, 2025Updated last year
- ☆17Jul 20, 2025Updated 8 months ago
- Python 3 support for the MS COCO caption evaluation tools☆14Jun 14, 2024Updated last year
- [ICML 2024] Official Repository for the paper "Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models"☆10Jul 19, 2024Updated last year
- 为开发者提供CMS、dotNET、FreeKit 相关文档☆13Oct 30, 2025Updated 5 months ago
- Official PyTorch implementation for "MMS-LLaMA: Efficient LLM-based Audio-Visual Speech Recognition with Minimal Multimodal Speech Tokens…☆47Jun 12, 2025Updated 10 months ago
- UW DigiPsych Prosody Feature Extraction Repository☆13May 16, 2019Updated 6 years ago