LLM Context Manager for inference optimization
☆25Jul 28, 2025Updated 9 months ago
Alternatives and similar repositories for LLM-Context-Manager
Users that are interested in LLM-Context-Manager are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Production-ready RAG framework for Python — multi-tenant chatbots with streaming, tool calling, agent mode (LangGraph), vector search (FA…☆17Apr 29, 2026Updated last week
- A modular backend framework for building AI chat applications powered by large language models (LLMs)☆37Nov 22, 2025Updated 5 months ago
- StrongSort-Pip: Packaged version of StrongSort☆10Sep 3, 2022Updated 3 years ago
- Demo of fine-tuning QA models for answering FAQ of cloud providers documentation☆11Mar 7, 2023Updated 3 years ago
- Atom SQ controller extension for Bitwig☆15Jun 21, 2025Updated 10 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Precision Knowledge Editing (PKE): A novel method to reduce toxicity in LLMs while preserving performance, with robust evaluations and ha…☆11Nov 26, 2024Updated last year
- Groq-powered MAD: The first work to explore Multi-Agent Debate with Large Language Models :D☆12Jul 5, 2024Updated last year
- LiteLLM model integration for Pydantic AI framework - access 100+ LLM providers through a unified interface☆22Apr 15, 2026Updated 2 weeks ago
- Using deep research workflow to generate datasets for finetuning LLMs.☆39Oct 9, 2025Updated 6 months ago
- a character-ai like UI for LLM☆10Dec 3, 2024Updated last year
- AI model Prompt Tester (AIPT for short) is a simple app that will check how suitable each model is for a given prompt.☆15Jul 7, 2024Updated last year
- The rag pipeline for optimizing dynamic data editing.☆21Oct 30, 2025Updated 6 months ago
- ☆19Oct 18, 2025Updated 6 months ago
- code for training and using chess embeddings models☆13Jun 9, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆16Jun 4, 2025Updated 11 months ago
- Bitwig Extension to use the midi controller "Midi Fighter Twister" with the DAW Bitwig☆16Apr 6, 2025Updated last year
- Evaluating LLMs performance in PR reviews as an indicator for their capability in creating PRs.☆13Apr 10, 2024Updated 2 years ago
- A text analysis library for relevance and subtheme detection☆16Mar 20, 2026Updated last month
- ☆16Dec 16, 2024Updated last year
- IngestRSS is an AWS-based RSS feed processing system that automatically fetches, processes, and stores articles from specified RSS feeds.…☆17Dec 22, 2024Updated last year
- A minimalist implementation of the ViT (Vision Transformer) model, using tinygrad☆16Sep 2, 2024Updated last year
- ☆17Jun 22, 2024Updated last year
- An automated data pipeline scaling RL to pretraining levels☆76Oct 11, 2025Updated 6 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A framework for creating message-driven training systems with PyTorch☆21Oct 7, 2025Updated 6 months ago
- ☆13Jan 28, 2026Updated 3 months ago
- Conductor is a Gemini CLI extension that allows you to specify, plan, and implement software features.☆50Mar 19, 2026Updated last month
- Code for the paper "Harnessing Discrete Representations for Continual Reinforcement Learning"☆16Jun 16, 2024Updated last year
- Code for "Accelerating Training with Neuron Interaction and Nowcasting Networks" [ICLR 2025]☆29Feb 20, 2026Updated 2 months ago
- A Max4Live controller for the essential functions of the ASM Hydrasynth☆14Mar 30, 2022Updated 4 years ago
- Powered by Pydantic v2, the core library for FHIR☆18Updated this week
- Tool to automatically generate text descriptions for images using Ollama vision models (LLaVA, Qwen3-VL, Llama Vision)☆32Dec 14, 2025Updated 4 months ago
- Using Ollama to invoke functions through use of runtime plugins☆15Jun 8, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- An interactive web app to visualize and explore data structures and algorithms. Users can perform operations like insertion, deletion, an…☆22Apr 14, 2025Updated last year
- Persistent caching for Python functions☆18Dec 10, 2025Updated 4 months ago
- rudradb-opin-examples is for example implementations of the pip install rudradb-opin☆29Mar 3, 2026Updated 2 months ago
- Docs‑focused crawler that converts documentation sites to clean Markdown.☆44Mar 21, 2026Updated last month
- Bring your code and propmpts easily to your LLM☆21Jun 10, 2025Updated 10 months ago
- A java client for Ollama☆25Mar 24, 2025Updated last year
- A collection of experimental Retrieval Augmented Generation (RAG) Techniques to elevate your pipelines, all with code and intuitive expla…☆36Jul 21, 2025Updated 9 months ago