Accelerating Long Context LLM Inference with Accuracy-Preserving Context Optimization in SGLang, vLLM, llama.cpp, OpenClaw, RAG, and Agentic AI.
☆118Jun 24, 2026Updated this week
Alternatives and similar repositories for ContextPilot
Users that are interested in ContextPilot are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- LLM-powered Python☆15Apr 8, 2026Updated 2 months ago
- An intelligent tuner for vLLM that automatically monitors GPU metrics, uses Bayesian optimization to tune parameters☆65Mar 12, 2026Updated 3 months ago
- A framework for evaluating the effectiveness of chain-of-thought reasoning in language models.☆19Feb 6, 2025Updated last year
- ☆14Oct 7, 2023Updated 2 years ago
- [CVPR 2026] When Numbers Speak: Aligning Textual Numerals and Visual Instances in Text-to-Video Diffusion Models☆69Apr 11, 2026Updated 2 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Here is the repo for public scripts.☆12Jul 16, 2022Updated 3 years ago
- GeoViz: A Multi-View Visual Platform for Spatio-temporal Knowledge Graph☆13May 13, 2024Updated 2 years ago
- libsmctrl论文的复现,添加了python端接口,可以在python端灵活调用接口来分配计算资源☆12May 21, 2024Updated 2 years ago
- Pie: Programmable LLM Serving☆178Updated this week
- ☆35Nov 3, 2025Updated 7 months ago
- Slowdown prediction module of Echo: Simulating Distributed Training at Scale☆13May 17, 2025Updated last year
- ☆29May 13, 2025Updated last year
- ☆31Mar 23, 2024Updated 2 years ago
- 一些有趣的页面,使用 Github Pages 和 Vercel 部署☆14Feb 8, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Code for "Practical Low-Rank Communication Compression in Decentralized Deep Learning"☆17Aug 4, 2020Updated 5 years ago
- 基于知识图谱的问答系统☆13Jun 30, 2024Updated 2 years ago
- A Model Context Protocol (MCP) server that provides enhanced file operation capabilities with streaming, patching, and change tracking su…☆21Jul 11, 2025Updated 11 months ago
- ☆14Jun 8, 2026Updated 3 weeks ago
- [EuroSys'25] Mist: Efficient Distributed Training of Large Language Models via Memory-Parallelism Co-Optimization☆24Apr 13, 2026Updated 2 months ago
- 支持Taiyi-Diffusion-XL模型的Fooocus☆20Apr 27, 2024Updated 2 years ago
- Extracting LaTeX equations from PDF☆21Sep 14, 2023Updated 2 years ago
- Official implementation of A cappella: Audio-visual Singing VoiceSeparation, from BMVC21☆18May 14, 2022Updated 4 years ago
- GAOGAO-Bench-Updates is a supplement to the GAOKAO-Bench, a dataset to evaluate large language models.☆46Jan 7, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Handwritten digit recognition implemented in c++ without libraries☆11Jan 30, 2024Updated 2 years ago
- Agentkube - Run Kubernetes Like Never Before☆38Mar 1, 2026Updated 4 months ago
- Convert any Repo into an RL Environment☆419Jun 18, 2026Updated last week
- [NeurIPS 2024] Offical PyTorch implementation of All-in-One Image Coding for Joint Human-Machine Vision with Multi-Path Aggregation☆23Jul 8, 2025Updated 11 months ago
- An easy way to run, test, benchmark and tune OpenCL kernel files☆24Aug 25, 2023Updated 2 years ago
- A distributed GPU-centric experience replay system for large AI models.☆19Aug 1, 2023Updated 2 years ago
- Implementation of "Audio Retrieval with Natural Language Queries", INTERSPEECH 2021, PyTorch☆26Aug 18, 2023Updated 2 years ago
- Sparse kernels for GNNs based on TVM☆17Nov 18, 2020Updated 5 years ago
- ☆21Jul 22, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆37Feb 12, 2025Updated last year
- [NSDI25] AutoCCL: Automated Collective Communication Tuning for Accelerating Distributed and Parallel DNN Training☆32May 2, 2025Updated last year
- Serverless LLM Serving for Everyone.☆686May 4, 2026Updated last month
- Reproduction paper --- PDFTriage : Question Answering over Long, Structured Documents☆42Jan 16, 2024Updated 2 years ago
- LangGraph Agent Development System. Build production-ready AI agents through natural conversation with Claude Code.☆11Jun 29, 2025Updated last year
- Lightly-reviewed collection of community environments☆237Jun 19, 2026Updated last week
- Official repository for ICLR 2024 Spotlight paper "Large Language Models Are Not Robust Multiple Choice Selectors"☆43May 20, 2025Updated last year