thu-nics / C2CLinks
The official code implementation for "Cache-to-Cache: Direct Semantic Communication Between Large Language Models"
☆265Updated 3 weeks ago
Alternatives and similar repositories for C2C
Users that are interested in C2C are comparing it to the libraries listed below
Sorting:
- The code repository of the paper: Competition and Attraction Improve Model Fusion☆166Updated 3 months ago
- Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B☆506Updated last week
- Super basic implementation (gist-like) of RLMs with REPL environments.☆255Updated last month
- ☆300Updated 3 months ago
- Lego for GRPO☆30Updated 6 months ago
- Analysis code for Neurips 2025 paper "SciArena: An Open Evaluation Platform for Foundation Models in Scientific Literature Tasks"☆55Updated 3 months ago
- ☆86Updated last year
- ☆107Updated 3 weeks ago
- ☆62Updated 4 months ago
- Marketplace ML experiment - training without backprop☆27Updated 2 months ago
- Streamline on-policy/off-policy distillation workflows in a few lines of code☆64Updated last week
- A preprint version of our recent research on the capability of frontier AI systems to do self-replication☆58Updated 11 months ago
- ☆49Updated 3 months ago
- ☆104Updated 5 months ago
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆84Updated 8 months ago
- Collection of scripts and notebooks for OpenAI's latest GPT OSS models☆479Updated 3 months ago
- Real-Time Detection of Hallucinated Entities in Long-Form Generation☆268Updated last week
- Simple & Scalable Pretraining for Neural Architecture Research☆302Updated last month
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆106Updated this week
- The offical repo for "Parallel-R1: Towards Parallel Thinking via Reinforcement Learning"☆236Updated 2 weeks ago
- ☆127Updated 2 months ago
- ☆68Updated 6 months ago
- Public repository containing METR's DVC pipeline for eval data analysis☆138Updated 7 months ago
- Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.☆349Updated 5 months ago
- Code to accompany the Universal Deep Research paper (https://arxiv.org/abs/2509.00244)☆449Updated 3 months ago
- look how they massacred my boy☆63Updated last year
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆173Updated 10 months ago
- GRadient-INformed MoE☆264Updated last year
- Train your own SOTA deductive reasoning model☆107Updated 8 months ago
- ☆289Updated 3 weeks ago