Accelerating Long Context LLM Inference with Accuracy-Preserving Context Optimization in SGLang, vLLM, llama.cpp, OpenClaw, RAG, and Agentic AI.
☆115Jun 6, 2026Updated this week
Alternatives and similar repositories for ContextPilot
Users that are interested in ContextPilot are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A framework for evaluating the effectiveness of chain-of-thought reasoning in language models.☆19Feb 6, 2025Updated last year
- ☆14Oct 7, 2023Updated 2 years ago
- [CVPR 2026] Official repo for "EVATok: Adaptive Length Video Tokenization for Efficient Visual Autoregressive Generation"☆60Mar 13, 2026Updated 2 months ago
- ☆14Jul 17, 2022Updated 3 years ago
- Exploring CoT-Decoding from Google DeepMind's paper, "Chain-of-Thought Reasoning Without Prompting".☆13Feb 22, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- GeoViz: A Multi-View Visual Platform for Spatio-temporal Knowledge Graph☆13May 13, 2024Updated 2 years ago
- libsmctrl论文的复现,添加了python端接口,可以在python端灵活调用接口来分配计算资源☆12May 21, 2024Updated 2 years ago
- A system that turns jailbreak papers into runnable attacks and benchmarks — live, as research evolves.☆147Jun 1, 2026Updated last week
- Accelerated in CUDA☆11Oct 28, 2022Updated 3 years ago
- Python SDK for Higgsfield API☆53Nov 17, 2025Updated 6 months ago
- High-performance React virtualized diff viewer for large code / text comparison☆133Apr 28, 2026Updated last month
- Pie: Programmable LLM Serving☆172Updated this week
- Slowdown prediction module of Echo: Simulating Distributed Training at Scale☆13May 17, 2025Updated last year
- This repository provides the implementation of the ICLR 2025 Multi-View Permutation of Variational Auto-Encoders (MVP) method for handlin…☆31Feb 21, 2025Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- PLM: Efficient Peripheral Language Models Hardware-Co-Designed for Ubiquitous Computing☆21Mar 18, 2025Updated last year
- 一套可移植的 AI 辅助开发框架。从项目代码中提取架构知识,生成结构化的 Skill 体系,并通过执行记录自动积累经验,让 AI 助手从第一天就深度理解你的项目,且越用越好。☆109May 27, 2026Updated 2 weeks ago
- ☆31Mar 23, 2024Updated 2 years ago
- 一些有趣的页面,使用 Github Pages 和 Vercel 部署☆13Feb 8, 2024Updated 2 years ago
- ☆53Dec 17, 2025Updated 5 months ago
- This is the official repo for the paper "AMO-Bench: Large Language Models Still Struggle in High School Math Competitions".☆128Feb 6, 2026Updated 4 months ago
- Code for "Practical Low-Rank Communication Compression in Decentralized Deep Learning"☆17Aug 4, 2020Updated 5 years ago
- 基于知识图谱的问答系统☆13Jun 30, 2024Updated last year
- Resumable, traceable AI coding — decisions and history stay with the project☆186Updated this week
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆14Updated this week
- [EuroSys'25] Mist: Efficient Distributed Training of Large Language Models via Memory-Parallelism Co-Optimization☆23Apr 13, 2026Updated last month
- Official implementation of A cappella: Audio-visual Singing VoiceSeparation, from BMVC21☆18May 14, 2022Updated 4 years ago
- ☆19Dec 31, 2025Updated 5 months ago
- Handwritten digit recognition implemented in c++ without libraries☆11Jan 30, 2024Updated 2 years ago
- GAOGAO-Bench-Updates is a supplement to the GAOKAO-Bench, a dataset to evaluate large language models.☆44Jan 7, 2025Updated last year
- The open-source project for "Mandheling: Mixed-Precision On-Device DNN Training with DSP Offloading"[MobiCom'2022]☆19Aug 4, 2022Updated 3 years ago
- A practical guide to building AI products end-to-end. You'll learn how to choose models based on product constraints, decide when fine-t…☆100Apr 27, 2026Updated last month
- Search job and match☆152Updated this week
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A comprehensive paper list of Table-based Question Answering.☆40Sep 1, 2023Updated 2 years ago
- Converge AI is an autonomous CLI tool designed to solve "rebase hell" for enterprise teams maintaining long-lived, customized forks of op…☆464Apr 6, 2026Updated 2 months ago
- Run AI coding agents safely in Docker containers. Your code stays isolated - no surprises on your host machine.☆48Feb 11, 2026Updated 3 months ago
- Polaris 是一个事务驱动的 AI 软件工厂内核。它不是聊天式编程助手——而是将 LLM 降级为受限决策组件,由系统内核统一接管执行、审计、预算与回滚,实现企业级的无人值守、可追责、可回滚软件交付流水线。内置 PM / Architect / Director / QA…☆103Updated this week
- A distributed GPU-centric experience replay system for large AI models.☆19Aug 1, 2023Updated 2 years ago
- ☆89Oct 6, 2023Updated 2 years ago
- 1st Multilingual Benchmark for Repository-Level E2E Microservice Generation☆97Apr 20, 2026Updated last month