fabienfrfr / tpttLinks
😊 TPTT: Transforming Pretrained Transformers into Titans
☆27Updated last week
Alternatives and similar repositories for tptt
Users that are interested in tptt are comparing it to the libraries listed below
Sorting:
- Resa: Transparent Reasoning Models via SAEs☆41Updated last week
- [ACL2025 Findings] Benchmarking Multihop Multimodal Internet Agents☆46Updated 7 months ago
- RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best…☆52Updated 6 months ago
- Official implementation of ECCV24 paper: POA☆24Updated last year
- imagetokenizer is a python package, helps you encoder visuals and generate visuals token ids from codebook, supports both image and video…☆37Updated last year
- A repository for research on medium sized language models.☆78Updated last year
- Official repository for ICML 2024 paper "MoRe Fine-Tuning with 10x Fewer Parameters"☆20Updated 4 months ago
- Pytorch implementation of HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal Large Language Models☆28Updated last year
- The official repo for “Unleashing the Reasoning Potential of Pre-trained LLMs by Critique Fine-Tuning on One Problem” [EMNLP25]☆32Updated last month
- A Framework for Decoupling and Assessing the Capabilities of VLMs☆43Updated last year
- ☆50Updated 3 months ago
- Lottery Ticket Adaptation☆39Updated 10 months ago
- https://x.com/BlinkDL_AI/status/1884768989743882276☆28Updated 5 months ago
- ☆26Updated 2 weeks ago
- Official PyTorch Implementation for Vision-Language Models Create Cross-Modal Task Representations, ICML 2025☆31Updated 5 months ago
- Official PyTorch implementation and models for paper "Diffusion Beats Autoregressive in Data-Constrained Settings". We find diffusion mod…☆98Updated last month
- ☆54Updated 3 months ago
- The official repo for "OpenMoE 2: Sparse Diffusion Language Models".☆19Updated this week
- Multi-Turn RL Training System with AgentTrainer for Language Model Game Reinforcement Learning☆34Updated last week
- Official Repository for Task-Circuit Quantization☆24Updated 4 months ago
- Official repo of paper LM2☆44Updated 7 months ago
- [ICLR 2025] Source code for paper "A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegr…☆78Updated 9 months ago
- The official repo of continuous speculative decoding☆30Updated 6 months ago
- Repository for the Q-Filters method (https://arxiv.org/pdf/2503.02812)☆35Updated 6 months ago
- CODA: Coordinating the Cerebrum and Cerebellum for a Dual-Brain Computer Use Agent with Decoupled Reinforcement Learning☆31Updated last month
- Implementation of SmoothCache, a project aimed at speeding-up Diffusion Transformer (DiT) based GenAI models with error-guided caching.☆45Updated 2 months ago
- Description and applications of OpenAI's paper about DALL-E (2021) and implementation of other (CLIP-guided) zero-shot text-to-image gene…☆33Updated 3 years ago
- GoldFinch and other hybrid transformer components☆45Updated last year
- Official code for the paper "Attention as a Hypernetwork"☆42Updated last year
- Official code and dataset for our EMNLP 2024 Findings paper: Stark: Social Long-Term Multi-Modal Conversation with Persona Commonsense Kn…☆18Updated 9 months ago