[ICLR'26] The official code implementation for "Cache-to-Cache: Direct Semantic Communication Between Large Language Models"
☆406Mar 13, 2026Updated 3 months ago
Alternatives and similar repositories for C2C
Users that are interested in C2C are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [CoLM'25] The official implementation of the paper <MoA: Mixture of Sparse Attention for Automatic Large Language Model Compression>☆158Jan 14, 2026Updated 5 months ago
- [NeurIPS'25] KVCOMM: Online Cross-context KV-cache Communication for Efficient LLM-based Multi-agent Systems☆176Nov 3, 2025Updated 7 months ago
- THU实验课实验报告模板与数据处理工具整理☆19Dec 15, 2023Updated 2 years ago
- 🌳 MCTS-inspired parallel beam search for conversation optimization. Explore multiple dialogue strategies simultaneously, stress-test a…☆36Jan 18, 2026Updated 5 months ago
- StepFun-Formalizer: Unlocking the Autoformalization Potential of LLMs through Knowledge-Reasoning Fusion☆30Aug 19, 2025Updated 10 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [ICCV'25] The official code of paper "Combining Similarity and Importance for Video Token Reduction on Large Visual Language Models"☆75Jan 13, 2026Updated 5 months ago
- Structural mirror consistency, observer perturbation, mirror break, and rupture auditing across supplied reflections, transformations, an…☆75Updated this week
- A truly open version of gpt-oss which shows the entire pre-training from scratch☆90Sep 4, 2025Updated 9 months ago
- XRAY MCP provides progressive code intelligence and navigation capabilities for AI assistants through structural code analysis using as…☆50Dec 11, 2025Updated 6 months ago
- A red teaming agent☆20Oct 15, 2025Updated 8 months ago
- [ICLR 2026] AgentSynth: Scalable Task Generation for Generalist Computer-Use Agents☆48Apr 17, 2026Updated 2 months ago
- [NeurIPS'25] The official code implementation for paper "R2R: Efficiently Navigating Divergent Reasoning Paths with Small-Large Model Tok…☆91Apr 7, 2026Updated 2 months ago
- [CVPR2025] Code Release of Patch Matters: Training-free Fine-grained Image Caption Enhancement via Local Perception☆25Jun 17, 2025Updated last year
- ☆26Jul 11, 2025Updated 11 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Official Implementation of APB (ACL 2025 main Oral) and Spava (ACL 2026 main).☆37Apr 6, 2026Updated 2 months ago
- This repository is related to 'Intriguing Properties of Hyperbolic Embeddings in Vision-Language Models', published at TMLR (2024), https…☆21Jul 5, 2024Updated last year
- [ECCV24] MixDQ: Memory-Efficient Few-Step Text-to-Image Diffusion Models with Metric-Decoupled Mixed Precision Quantization☆50Nov 27, 2024Updated last year
- A benchmark of programming tasks for LLMs that supports almost any programming language.☆13Jun 30, 2025Updated 11 months ago
- Arabic Grapheme-to-Phoneme (G2P) Conversion☆16Mar 15, 2025Updated last year
- Local modular AI assistant with speech, vision, and robotics support. Uses Qwen3-VL-4B in LM Studio.☆54Jan 9, 2026Updated 5 months ago
- [Arxiv 2025] In-Video Instructions: Visual Signals as Generative Control☆45Nov 25, 2025Updated 6 months ago
- AI debugger and AI coder integrated. Use AI to code and drives runtime debugger☆85Mar 17, 2026Updated 3 months ago
- Official code for the paper "Examining Post-Training Quantization for Mixture-of-Experts: A Benchmark"☆30Jun 30, 2025Updated 11 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [ICML 2026 Spotlight] Latent Collaboration in Multi-Agent Systems☆991Jun 6, 2026Updated last week
- [ICLR'25] "Attention in Large Language Models Yields Efficient Zero-Shot Re-Rankers"☆44Mar 31, 2025Updated last year
- ☆18Oct 17, 2025Updated 8 months ago
- An MCP server implementation providing a standardized interface for LLMs to interact with the Atla API.☆18Jul 21, 2025Updated 10 months ago
- 清华大学电子工程系数字逻辑与处理器基础实验大作业——流水线 CPU☆12Aug 8, 2021Updated 4 years ago
- [ICLR 2025] Drop-Upcycling: Training Sparse Mixture of Experts with Partial Re-initialization☆24Oct 5, 2025Updated 8 months ago
- This is some extra code for the "Advanced Build With AI" SDK tutorial I gave remotely on May 17.☆13May 17, 2025Updated last year
- PathwaysJob API is an OSS Kubernetes-native API, to deploy ML training and batch inference workloads, using Pathways on GKE.☆21Oct 22, 2025Updated 7 months ago
- LLM-based conversational Telegram bot☆13Mar 29, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- LATTICE turns retrieval into an LLM-driven navigation problem over a semantic scaffold☆37Mar 9, 2026Updated 3 months ago
- AI-powered Knowledge Base UI Kit built on Memvid - the video-based memory format for AI.☆25Jan 5, 2026Updated 5 months ago
- 清华大学电子系部分课程整理☆16Sep 10, 2022Updated 3 years ago
- Automatically Block Malicious IPs with Cloudflare Protect your WordPress site from hackers and brute-force attacks. This free plugin auto…☆23Nov 15, 2025Updated 7 months ago
- Nearly Inference Free Embeddings: make your RAG queries 500x faster☆77Apr 27, 2026Updated last month
- PyTorch model of OpenFace☆12May 8, 2017Updated 9 years ago
- PICABench: How Far Are We from Physically Realistic Image Editing?☆38Nov 5, 2025Updated 7 months ago