vllm-project / ci-infraLinks
This repo hosts code for vLLM CI & Performance Benchmark infrastructure.
☆15Updated this week
Alternatives and similar repositories for ci-infra
Users that are interested in ci-infra are comparing it to the libraries listed below
Sorting:
- This repository is designed for deploying and managing server processes that handle embeddings using the Infinity Embedding model or Larg…☆24Updated 5 months ago
- GPU Environment Management for Visual Studio Code☆39Updated 2 years ago
- GPT4 based personalized ArXiv paper assistant bot☆10Updated last year
- A high-throughput and memory-efficient inference and serving engine for LLMs☆11Updated last year
- CrewAI AgentOps: Monitor your AI Agents☆18Updated last year
- A swarm of LLM agents that will help you test, document, and productionize your code!☆18Updated last week
- Public reports detailing responses to sets of prompts by Large Language Models.☆31Updated 7 months ago
- ☆11Updated 2 years ago
- Detecting Drift in a Diabetes Dataset using Taipy☆12Updated 3 months ago
- Open Source AI with Granite and Granite Code☆22Updated 3 weeks ago
- Feel the Vibes☆13Updated 6 months ago
- Cookiecutter for community-maintained Jupyter Docker images☆15Updated last week
- An autonomous agent to automate your code review workflow made using crewAI☆16Updated last year
- ☆20Updated last year
- TLS & API keys for your LLM APIs☆18Updated 8 months ago
- Deploy and Scale LLM-based applications☆26Updated 2 years ago
- examples and guides to using Nomic Atlas☆39Updated 4 months ago
- Example using OpenTelemetry to instrument a FastAPI / LangGraph / Langchain application☆11Updated 9 months ago
- Rats is a collection of tools to help researchers define and run experiments. It is designed to be a modular and extensible framework cur…☆25Updated this week
- Large Language Model Hosting Container☆91Updated 2 weeks ago
- Streamlit app for recommending eval functions using prompt diffs☆29Updated last year
- Contains the model patches and the eval logs from the passing swe-bench-lite run.☆10Updated last year
- Example for Logging LLM Evaluator Prompt Responses☆18Updated 2 years ago
- Super performant RAG pipeline for AI apps.☆17Updated last year
- Web Interface for Vision Language Models Including InternVLM2☆23Updated last year
- A python command-line tool to download & manage MLX AI models from Hugging Face.☆18Updated last year
- A repository of Python scripts to scrape code contents of the public repositories of `huggingface`.☆53Updated last year
- This repository contains a toy implementation of a basic RAQA system.☆20Updated last year
- LLM application tracing based on OpenTelemetry☆13Updated 2 weeks ago
- Skill for free. Just fork it.☆15Updated 3 years ago