bekkermans / llm-api-serverLinks
A Ray-based LLM server compatible with OpenAI API
☆12Updated last year
Alternatives and similar repositories for llm-api-server
Users that are interested in llm-api-server are comparing it to the libraries listed below
Sorting:
- Convert MUSE from TensorFlow to PyTorch and ONNX☆11Updated last year
- Framework for processing and filtering datasets☆31Updated last year
- Evalica, your favourite evaluation toolkit☆62Updated this week
- Tools and agents for automated research.☆48Updated 2 months ago
- A database-like benchmark of feature generation from time-series data☆13Updated last year
- ☆31Updated last year
- Small python package to measure OCR quality and other related metrics.☆27Updated last year
- Effective LLM Alignment Toolkit☆152Updated 7 months ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆69Updated 2 months ago
- Utilities for monitoring and interacting with Jupyter Notebooks☆38Updated 3 months ago
- Top ML papers of the week.☆45Updated this week
- Optimus is a flexible and scalable framework built to train language models efficiently across diverse hardware configurations, including…☆68Updated 2 months ago
- Augmentex — a library for augmenting texts with errors☆69Updated last year
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…☆84Updated last year
- ☆80Updated last year
- Slides and info for girafe-ai Journal Club☆22Updated 2 years ago
- Automatic Prompt Optimization Framework☆171Updated this week
- Multi-threaded matrix multiplication and cosine similarity calculations for dense and sparse matrices. Appropriate for calculating the K …☆86Updated last year
- MERA (Multimodal Evaluation for Russian-language Architectures) is a new open benchmark for the Russian language for evaluating fundament…☆62Updated last year
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆32Updated 4 months ago
- ☆91Updated 7 months ago
- Create fast graph language models from converted PDF documents for knowledge extraction and Q&A.☆58Updated last year
- AirLLM 70B inference with single 4GB GPU☆17Updated 7 months ago
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆24Updated last year
- A Python implementation of an agent swarm system that works with local LLM servers. The system allows you to create multiple agents that …☆11Updated last year
- ☆22Updated 2 years ago
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorch☆103Updated last year
- Universal text classifier for generative models☆24Updated last year
- LLM application tracing based on OpenTelemetry☆16Updated 2 months ago
- ☆53Updated 4 months ago