Accelerating Long Context LLM Inference with Accuracy-Preserving Context Optimization in SGLang, vLLM, llama.cpp, OpenClaw, RAG, and Agentic AI.
☆80Apr 21, 2026Updated last week
Alternatives and similar repositories for ContextPilot
Users that are interested in ContextPilot are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A framework for evaluating the effectiveness of chain-of-thought reasoning in language models.☆19Feb 6, 2025Updated last year
- ☆14Oct 7, 2023Updated 2 years ago
- A proxy server that intercepts Anthropic API requests and converts them to OpenAI-compatible format, enabling integration with dozens of …☆42Apr 16, 2026Updated 2 weeks ago
- [CVPR 2026] Official repo for "EVATok: Adaptive Length Video Tokenization for Efficient Visual Autoregressive Generation"☆54Mar 13, 2026Updated last month
- [CVPR 2026] When Numbers Speak: Aligning Textual Numerals and Visual Instances in Text-to-Video Diffusion Models☆63Apr 11, 2026Updated 2 weeks ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 《Machine Learning Systems: Design and Implementation》- English Version☆39Jan 27, 2025Updated last year
- Exploring CoT-Decoding from Google DeepMind's paper, "Chain-of-Thought Reasoning Without Prompting".☆13Feb 22, 2024Updated 2 years ago
- ☆20Nov 3, 2025Updated 5 months ago
- Here is the repo for public scripts.☆12Jul 16, 2022Updated 3 years ago
- Accelerated in CUDA☆11Oct 28, 2022Updated 3 years ago
- Pie: Programmable LLM Serving☆149Updated this week
- Slowdown prediction module of Echo: Simulating Distributed Training at Scale☆13May 17, 2025Updated 11 months ago
- ☆31Mar 23, 2024Updated 2 years ago
- This is the official repo for the paper "AMO-Bench: Large Language Models Still Struggle in High School Math Competitions".☆122Feb 6, 2026Updated 2 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code for "Practical Low-Rank Communication Compression in Decentralized Deep Learning"☆17Aug 4, 2020Updated 5 years ago
- ☆14Apr 16, 2026Updated 2 weeks ago
- [EuroSys'25] Mist: Efficient Distributed Training of Large Language Models via Memory-Parallelism Co-Optimization☆23Apr 13, 2026Updated 2 weeks ago
- the official code of DriveMonkey☆45Mar 20, 2026Updated last month
- Official implementation of A cappella: Audio-visual Singing VoiceSeparation, from BMVC21☆17May 14, 2022Updated 3 years ago
- Handwritten digit recognition implemented in c++ without libraries☆11Jan 30, 2024Updated 2 years ago
- GAOGAO-Bench-Updates is a supplement to the GAOKAO-Bench, a dataset to evaluate large language models.☆41Jan 7, 2025Updated last year
- Agentkube - Run Kubernetes Like Never Before☆38Mar 1, 2026Updated 2 months ago
- The open-source project for "Mandheling: Mixed-Precision On-Device DNN Training with DSP Offloading"[MobiCom'2022]☆19Aug 4, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A comprehensive paper list of Table-based Question Answering.☆38Sep 1, 2023Updated 2 years ago
- An easy way to run, test, benchmark and tune OpenCL kernel files☆24Aug 25, 2023Updated 2 years ago
- Run AI coding agents safely in Docker containers. Your code stays isolated - no surprises on your host machine.☆45Feb 11, 2026Updated 2 months ago
- A distributed GPU-centric experience replay system for large AI models.☆19Aug 1, 2023Updated 2 years ago
- AI chaos reasoning persona☆32Updated this week
- Implementation of "Audio Retrieval with Natural Language Queries", INTERSPEECH 2021, PyTorch☆26Aug 18, 2023Updated 2 years ago
- Sparse kernels for GNNs based on TVM☆17Nov 18, 2020Updated 5 years ago
- ☆88Mar 4, 2026Updated last month
- Discourse API Documentation☆44Updated this week
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- MorphCharts is a JavaScript visualization library for creating rich, immersive, and engaging 2D and 3D data visualizations☆83Apr 21, 2026Updated last week
- ☆36Feb 12, 2025Updated last year
- [NSDI25] AutoCCL: Automated Collective Communication Tuning for Accelerating Distributed and Parallel DNN Training☆31May 2, 2025Updated 11 months ago
- Serverless LLM Serving for Everyone.☆675Mar 6, 2026Updated last month
- TF code for our CVPR2020 paper "Discriminative Multi-modality Speech Recognition"☆26Apr 27, 2022Updated 4 years ago
- LangGraph Agent Development System. Build production-ready AI agents through natural conversation with Claude Code.☆11Jun 29, 2025Updated 10 months ago
- Marketplace ML experiment - training without backprop☆27Sep 9, 2025Updated 7 months ago