friendliai / friendli-client
Friendli: the fastest serving engine for generative AI
☆44Updated 3 months ago
Alternatives and similar repositories for friendli-client:
Users that are interested in friendli-client are comparing it to the libraries listed below
- FMO (Friendli Model Optimizer)☆12Updated 3 months ago
- ☆45Updated 7 months ago
- FriendliAI Model Hub☆92Updated 2 years ago
- Nexusflow function call, tool use, and agent benchmarks.☆19Updated 4 months ago
- manage histories of LLM applied applications☆88Updated last year
- Welcome to PeriFlow CLI ☁︎☆12Updated last year
- Ditto is an open-source framework that enables direct conversion of HuggingFace PreTrainedModels into TensorRT-LLM engines.☆38Updated this week
- ☆11Updated 2 months ago
- 1-Click is all you need.☆61Updated 11 months ago
- building a CLIP application using BentoML☆10Updated last month
- Evaluate your LLM apps, RAG pipeline, any generated text, and more!Updated 11 months ago
- Dotfile management with bare git☆19Updated 2 weeks ago
- vLLM adapter for a TGIS-compatible gRPC server.☆26Updated this week
- Training-free Post-training Efficient Sub-quadratic Complexity Attention. Implemented with OpenAI Triton.☆127Updated this week
- Tiny configuration for Triton Inference Server☆45Updated 3 months ago
- High-performance vector search engine with no loss of accuracy through GPU and dynamic placement☆29Updated last year
- Sentence Embedding as a Service☆15Updated last year
- ☆31Updated 4 months ago
- SGLang is fast serving framework for large language models and vision language models.☆22Updated 2 months ago
- Build complex LLM Applications with Python Dictionary☆40Updated 6 months ago
- AI Agent who manages your Jira project☆16Updated 10 months ago
- OSLO: Open Source for Large-scale Optimization☆175Updated last year
- ☆20Updated last year
- Efficient fine-tuning for ko-llm models☆183Updated last year
- Self-host LLMs with vLLM and BentoML☆106Updated last week
- Estimating hardware and cloud costs of LLMs and transformer projects☆14Updated last year
- Newsletter bot for 🤗 Daily Papers☆118Updated this week
- MIST: High-performance IoT Stream Processing☆17Updated 6 years ago
- ☆23Updated this week
- ☆101Updated last year