Accelerating Long Context LLM Inference with Accuracy-Preserving Context Optimization in SGLang, vLLM, llama.cpp, OpenClaw, RAG, and Agentic AI.
☆100May 14, 2026Updated last week
Alternatives and similar repositories for ContextPilot
Users that are interested in ContextPilot are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- LLM-powered Python☆15Apr 8, 2026Updated last month
- An intelligent tuner for vLLM that automatically monitors GPU metrics, uses Bayesian optimization to tune parameters☆64Mar 12, 2026Updated 2 months ago
- A framework for evaluating the effectiveness of chain-of-thought reasoning in language models.☆19Feb 6, 2025Updated last year
- ☆14Oct 7, 2023Updated 2 years ago
- [CVPR 2026] Official repo for "EVATok: Adaptive Length Video Tokenization for Efficient Visual Autoregressive Generation"☆58Mar 13, 2026Updated 2 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- 《Machine Learning Systems: Design and Implementation》- English Version☆39Jan 27, 2025Updated last year
- [CVPR 2026] When Numbers Speak: Aligning Textual Numerals and Visual Instances in Text-to-Video Diffusion Models☆68Apr 11, 2026Updated last month
- Here is the repo for public scripts.☆12Jul 16, 2022Updated 3 years ago
- GeoViz: A Multi-View Visual Platform for Spatio-temporal Knowledge Graph☆13May 13, 2024Updated 2 years ago
- libsmctrl论文的复现,添加了python端接口,可以在python端灵活调用接口来分配计算资源☆12May 21, 2024Updated 2 years ago
- Accelerated in CUDA☆11Oct 28, 2022Updated 3 years ago
- Slowdown prediction module of Echo: Simulating Distributed Training at Scale☆13May 17, 2025Updated last year
- ☆28May 13, 2025Updated last year
- PLM: Efficient Peripheral Language Models Hardware-Co-Designed for Ubiquitous Computing☆21Mar 18, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆31Mar 23, 2024Updated 2 years ago
- 一些有趣的页面,使用 Github Pages 和 Vercel 部署☆13Feb 8, 2024Updated 2 years ago
- This is the official repo for the paper "AMO-Bench: Large Language Models Still Struggle in High School Math Competitions".☆122Feb 6, 2026Updated 3 months ago
- Code for "Practical Low-Rank Communication Compression in Decentralized Deep Learning"☆17Aug 4, 2020Updated 5 years ago
- 基于知识图谱的问答系统☆13Jun 30, 2024Updated last year
- [EuroSys'25] Mist: Efficient Distributed Training of Large Language Models via Memory-Parallelism Co-Optimization☆23Apr 13, 2026Updated last month
- Tiny语言编译器☆11Sep 2, 2023Updated 2 years ago
- This was a hack that is currently in a broken state☆16Dec 23, 2020Updated 5 years ago
- A lightweight, continuously-updated catalog of research papers on AI agents.☆29Oct 13, 2025Updated 7 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Extracting LaTeX equations from PDF☆21Sep 14, 2023Updated 2 years ago
- the official code of DriveMonkey☆45Mar 20, 2026Updated 2 months ago
- ☆19Dec 31, 2025Updated 4 months ago
- Handwritten digit recognition implemented in c++ without libraries☆11Jan 30, 2024Updated 2 years ago
- GAOGAO-Bench-Updates is a supplement to the GAOKAO-Bench, a dataset to evaluate large language models.☆42Jan 7, 2025Updated last year
- Agentkube - Run Kubernetes Like Never Before☆38Mar 1, 2026Updated 2 months ago
- The open-source project for "Mandheling: Mixed-Precision On-Device DNN Training with DSP Offloading"[MobiCom'2022]☆19Aug 4, 2022Updated 3 years ago
- ☆12Jul 25, 2024Updated last year
- A comprehensive paper list of Table-based Question Answering.☆39Sep 1, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Reinforcement Learning Framework for Visual Generation☆112Feb 13, 2026Updated 3 months ago
- [NeurIPS 2024] Offical PyTorch implementation of All-in-One Image Coding for Joint Human-Machine Vision with Multi-Path Aggregation☆23Jul 8, 2025Updated 10 months ago
- This notebook illustrates the use of the Google Maps API to determine the optimum route given a list of addresses☆11Nov 26, 2018Updated 7 years ago
- Run AI coding agents safely in Docker containers. Your code stays isolated - no surprises on your host machine.☆47Feb 11, 2026Updated 3 months ago
- ☆15Jul 17, 2025Updated 10 months ago
- A lightweight, user-friendly data-plane for LLM training.☆39Sep 10, 2025Updated 8 months ago
- AI chaos reasoning persona☆33May 14, 2026Updated last week