friendliai / friendli-clientLinks
[⛔️ DEPRECATED] Friendli: the fastest serving engine for generative AI
☆48Updated 2 weeks ago
Alternatives and similar repositories for friendli-client
Users that are interested in friendli-client are comparing it to the libraries listed below
Sorting:
- FMO (Friendli Model Optimizer)☆12Updated 6 months ago
- Nexusflow function call, tool use, and agent benchmarks.☆24Updated 7 months ago
- Tutorial to get started with SkyPilot!☆58Updated last year
- vLLM adapter for a TGIS-compatible gRPC server.☆33Updated this week
- IBM development fork of https://github.com/huggingface/text-generation-inference☆61Updated 2 months ago
- Self-host LLMs with LMDeploy and BentoML☆20Updated last week
- A collection of all available inference solutions for the LLMs☆91Updated 4 months ago
- vLLM: A high-throughput and memory-efficient inference and serving engine for LLMs☆87Updated this week
- ☆62Updated 3 months ago
- Self-host LLMs with vLLM and BentoML☆133Updated last week
- Google TPU optimizations for transformers models☆114Updated 5 months ago
- ☆52Updated last year
- Website with current metrics on the fastest AI models.☆41Updated 8 months ago
- Sentence Embedding as a Service☆15Updated 2 weeks ago
- Training-free Post-training Efficient Sub-quadratic Complexity Attention. Implemented with OpenAI Triton.☆139Updated this week
- manage histories of LLM applied applications☆91Updated last year
- 🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform☆38Updated last year
- ☆22Updated 3 months ago
- AI Agent who manages your Jira project☆18Updated last year
- ☆23Updated 5 months ago
- AgentParse is a high-performance parsing library designed to map various structured data formats (such as Pydantic models, JSON, YAML, an…☆13Updated 2 weeks ago
- Simple LLM inference server☆20Updated last year
- AI-based search done right☆18Updated this week
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆29Updated last week
- A simple service that integrates vLLM with Ray Serve for fast and scalable LLM serving.☆68Updated last year
- ☆46Updated last month
- List of popular open-source models deployed on AWS using tensorfuse☆28Updated 3 months ago
- ☆25Updated this week
- Web Interface for Vision Language Models Including InternVLM2☆22Updated 11 months ago
- ScreenSuite - The most comprehensive benchmarking suite for GUI Agents!☆94Updated 3 weeks ago