friendliai / friendli-client
Friendli: the fastest serving engine for generative AI
☆42Updated 3 weeks ago
Alternatives and similar repositories for friendli-client:
Users that are interested in friendli-client are comparing it to the libraries listed below
- FMO (Friendli Model Optimizer)☆12Updated last month
- ☆45Updated 5 months ago
- FriendliAI Model Hub☆89Updated 2 years ago
- Welcome to PeriFlow CLI ☁︎☆12Updated last year
- Nexusflow function call, tool use, and agent benchmarks.☆19Updated 2 months ago
- Training-free Post-training Efficient Sub-quadratic Complexity Attention. Implemented with OpenAI Triton.☆96Updated this week
- ☆11Updated last month
- 1-Click is all you need.☆59Updated 9 months ago
- A collection of all available inference solutions for the LLMs☆78Updated 5 months ago
- AI Agent who manages your Jira project☆15Updated 7 months ago
- Tiny configuration for Triton Inference Server☆44Updated last month
- ☆31Updated last year
- High-performance vector search engine with no loss of accuracy through GPU and dynamic placement☆28Updated last year
- ☆27Updated 2 months ago
- vLLM adapter for a TGIS-compatible gRPC server.☆21Updated this week
- ☆22Updated this week
- Accelerated inference of 🤗 models using FuriosaAI NPU chips.☆26Updated 8 months ago
- OpenAI compatible API for open source LLMs☆15Updated last year
- Dotfile management with bare git☆19Updated this week
- manage histories of LLM applied applications☆88Updated last year
- MIST: High-performance IoT Stream Processing☆17Updated 5 years ago
- IBM development fork of https://github.com/huggingface/text-generation-inference☆59Updated 2 months ago
- Benchmark suite for LLMs from Fireworks.ai☆66Updated last week
- ☆43Updated 7 months ago
- ☆11Updated 10 months ago
- Evaluate your LLM apps, RAG pipeline, any generated text, and more!Updated 9 months ago
- Official repository for EXAONE 3.5 built by LG AI Research☆139Updated 2 months ago
- AI Assistant running within your browser.☆59Updated 2 months ago