unifyai / aibench-llm-endpoints
Runner in charge of collecting metrics from LLM inference endpoints for the Unify Hub
☆17Updated 9 months ago
Related projects ⓘ
Alternatives and complementary repositories for aibench-llm-endpoints
- Benchmarking tool for assessing LLM models' performance across different hardwares☆13Updated 11 months ago
- Deploy and Scale LLM-based applications☆26Updated last year
- A desktop for AI agents☆28Updated this week
- RepoGPT: AI-powered GitHub assistant to chat, manage, and explore your repos effortlessly.☆46Updated 3 weeks ago
- LLM plugin for models hosted by OpenRouter☆68Updated 6 months ago
- Connect to your customer data using any LLM and gain actionable insights. IdentityRAG creates a single comprehensive customer 360 view (g…☆20Updated this week
- 📡 Deploy AI models and apps to Kubernetes without developing a hernia☆23Updated 5 months ago
- Repository hosting Langchain helm charts.☆40Updated this week
- Self-host LLMs with vLLM and BentoML☆72Updated this week
- ☆14Updated 3 weeks ago
- Run Structured LLM Inference with Easy Parallelism☆15Updated 3 months ago
- Embed anything.☆29Updated 5 months ago
- LLM model runway server☆12Updated last year
- Creating Generative AI Apps which work☆16Updated 4 months ago
- 🐝 Create powerful, collaborative AI applications.☆37Updated this week
- ☆45Updated 3 weeks ago
- Helm charts to deploy Weaviate to k8s☆50Updated this week
- This repo is the central repo for all the RAG Evaluation reference material and partner workshop☆44Updated last month
- A visual tool to interpret and understand PyTorch machine learning models☆15Updated 8 months ago
- Public reports detailing responses to sets of prompts by Large Language Models.☆25Updated last year
- ☆110Updated this week
- ☆37Updated 2 months ago
- Run code-llama with 50k tokens using flash attention and better transformer☆12Updated 11 months ago
- Quickly and securely turn any Linux box into a build and deployment assistant☆25Updated last month
- An open source collection of agentic Github workflows☆12Updated 6 months ago
- IBM development fork of https://github.com/huggingface/text-generation-inference☆57Updated last month
- QLLM: A powerful CLI for seamless interaction with multiple Large Language Models. Simplify AI workflows, streamline development, and unl…☆24Updated 2 weeks ago
- Routing on Random Forest (RoRF)☆83Updated last month
- OpenAI compatible API for open source LLMs☆15Updated last year
- A new benchmark for measuring LLM's capability to detect bugs in large codebase.☆27Updated 5 months ago