friendliai / friendli-clientLinks

[⛔️ DEPRECATED] Friendli: the fastest serving engine for generative AI

☆49

Alternatives and similar repositories for friendli-client

Users that are interested in friendli-client are comparing it to the libraries listed below

Sorting:

friendliai / friendli-model-optimizer
FMO (Friendli Model Optimizer)
☆13Updated 10 months ago
nexusflowai / NexusBench
Nexusflow function call, tool use, and agent benchmarks.
☆29Updated 11 months ago
substratusai / vllm-docker
☆64Updated 7 months ago
opendatahub-io / vllm-tgis-adapter
vLLM adapter for a TGIS-compatible gRPC server.
☆44Updated this week
mani-kantap / llm-inference-solutions
A collection of all available inference solutions for the LLMs
☆91Updated 8 months ago
bentoml / BentoVLLM
Self-host LLMs with vLLM and BentoML
☆158Updated 3 weeks ago
fixie-ai / thefastest.ai
Website with current metrics on the fastest AI models.
☆42Updated last year
skypilot-org / skypilot-tutorial
Tutorial to get started with SkyPilot!
☆57Updated last year
cloneofsimo / fim-llama-deepspeed
☆32Updated last year
furiousteabag / vram-calculator
Transformer GPU VRAM estimator
☆66Updated last year
facebookresearch / fastgen
Simple high-throughput inference library
☆149Updated 6 months ago
runpod-workers / worker-sglang
SGLang is fast serving framework for large language models and vision language models.
☆30Updated 2 weeks ago
huggingface / optimum-tpu
Google TPU optimizations for transformers models
☆122Updated 10 months ago
nyunAI / PruneGPT
☆51Updated last year
IBM / text-generation-inference
IBM development fork of https://github.com/huggingface/text-generation-inference
☆62Updated 2 months ago
cgftinc / benchmax
Framework-Agnostic RL Environments for LLM Fine-Tuning
☆38Updated last week
kyegomez / AGI
Welcome to AGI, the cutting-edge project dedicated to building the core components of Artificial General Intelligence.
☆11Updated this week
deepgrove-ai / Bonsai
☆33Updated 8 months ago
DeepAuto-AI / hip-attention
Training-free Post-training Efficient Sub-quadratic Complexity Attention. Implemented with OpenAI Triton.
☆148Updated 3 weeks ago
bentoml / sentence-embedding-bento
Sentence Embedding as a Service
☆15Updated 4 months ago
silphendio / sliced_llama
Simple LLM inference server
☆20Updated last year
NEOS-AI / Neosearch
AI-based search done right
☆20Updated 3 weeks ago
voyage-ai / voyageai-python
Voyage AI Official Python Library
☆81Updated 2 months ago
Cerebras / DocChat
GPT-4 Level Conversational QA Trained In a Few Hours
☆65Updated last year
mystic-ai / pipeline
Pipeline is an open source python SDK for building AI/ML workflows
☆138Updated last year
The-Swarm-Corporation / AgentParse
AgentParse is a high-performance parsing library designed to map various structured data formats (such as Pydantic models, JSON, YAML, an…
☆16Updated last month
facebookresearch / DIG-In
This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.
☆20Updated last year
The-Swarm-Corporation / swarms-core
Multi-threading, Concurrency, Asynchrony, and various Execution Methods implemented in a Rust backend for bleeding edge performance.
☆16Updated last year
fixie-ai / ai-benchmarks
Benchmarking suite for popular AI APIs
☆88Updated 9 months ago
friendliai / aipm
AI Agent who manages your Jira project
☆20Updated last year