thu-nics / C2CLinks
The official code implementation for "Cache-to-Cache: Direct Semantic Communication Between Large Language Models"
☆87Updated last week
Alternatives and similar repositories for C2C
Users that are interested in C2C are comparing it to the libraries listed below
Sorting:
- The code repository of the paper: Competition and Attraction Improve Model Fusion☆163Updated 2 months ago
- Enhancing LLMs with LoRA☆173Updated 2 weeks ago
- ☆300Updated 3 months ago
- Real-Time Detection of Hallucinated Entities in Long-Form Generation☆264Updated 3 weeks ago
- The Prime Intellect CLI provides a powerful command-line interface for managing GPU resources across various providers☆102Updated this week
- Marketplace ML experiment - training without backprop☆27Updated last month
- Verifiers for LLM Reinforcement Learning☆77Updated last month
- Lego for GRPO☆30Updated 5 months ago
- A preprint version of our recent research on the capability of frontier AI systems to do self-replication☆57Updated 10 months ago
- Super basic implementation (gist-like) of RLMs with REPL environments.☆228Updated 3 weeks ago
- ☆158Updated 6 months ago
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆107Updated 8 months ago
- ☆107Updated last week
- ☆86Updated last year
- Inference, Fine Tuning and many more recipes with Gemma family of models☆274Updated 3 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆58Updated 3 weeks ago
- look how they massacred my boy☆63Updated last year
- Open collaboration infrastructure that enables communication, coordination, trust and payments for The Internet of Agents.☆195Updated last week
- A simple MLX implementation for pretraining LLMs on Apple Silicon.☆84Updated 2 months ago
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆80Updated 7 months ago
- A multi-agent LLM system for detecting and resolving cognitive dissonance.☆268Updated 3 weeks ago
- ☆68Updated 5 months ago
- ☆62Updated 3 months ago
- ☆15Updated 3 months ago
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆62Updated last year
- ☆18Updated 11 months ago
- Lightweight toolkit package to train and fine-tune 1.58bit Language models☆97Updated 5 months ago
- The DPAB-α Benchmark☆30Updated 9 months ago
- ☆79Updated last month
- frozen-in-time version of our Paper Finder agent for reproducing evaluation results☆196Updated 2 months ago