viktor-ferenczi / vllm-clientLinks
vLLM client with minimal dependencies
☆15Updated last year
Alternatives and similar repositories for vllm-client
Users that are interested in vllm-client are comparing it to the libraries listed below
Sorting:
- call rwkv v4/v5/v6/v7 raven/world/finch 1B5-14B rwkv.cpp using csharp cpu/gpu (support INT4,8,Float16,32)☆35Updated 10 months ago
- Minimal C# bindings for llama.cpp + .NET core library with API host/client.☆73Updated last year
- This project implements token calculation for OpenAI's gpt-4 and gpt-3.5-turbo model, specifically using `cl100k_base` encoding.☆82Updated last week
- Run GreenBitAI's Quantized LLMs on Apple Devices with MLX☆31Updated 4 months ago
- This is Local LLM Server for Semantic Kernel☆35Updated 2 years ago
- AI Studio helps you with the power of chatGPT in many subjects such as adding unit tests, refactoring code, adding summary, etc. while wr…☆38Updated last month
- Semantic-Fleet serves as a specialized extension hub for the Semantic-Kernel ecosystem. It houses a diverse array of connectors designed …☆31Updated 2 months ago
- Implementation of Adepts Fuyu all-new Multi-Modality model in pytorch☆24Updated last year
- ☆34Updated last year
- An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.☆13Updated 2 years ago
- A planner that integrates into Semantic Kernel to enable function calling on all Chat based LLMs (Mistral, Bard, Claude, LLama etc)☆57Updated last year
- An Experiment on Dynamic NTK Scaling RoPE☆64Updated 2 years ago
- ASP.NET Core Web, WebApi & WPF implementations for LLama.cpp & LLamaSharp☆58Updated 2 years ago
- A simple Semantic Kernel semantic function debugging tool.☆30Updated last year
- ☆50Updated last year
- In-Context Alignment: Chat with Vanilla Language Models Before Fine-Tuning☆35Updated 2 years ago
- The RedStone repository includes code for preparing extensive datasets used in training large language models.☆146Updated 6 months ago
- AuditNLG: Auditing Generative AI Language Modeling for Trustworthiness☆101Updated 11 months ago
- Official repo of Respond-and-Respond: data, code, and evaluation☆104Updated last year
- Advancing LLM with Diverse Coding Capabilities☆80Updated last year
- FuseAI Project☆88Updated 11 months ago
- Stable Diffusion model v1.5 for TorchSharp☆19Updated last year
- Contextual Position Encoding but with some custom CUDA Kernels https://arxiv.org/abs/2405.18719☆22Updated last year
- Unofficial implementation of AlpaGasus☆94Updated 2 years ago
- MUFFIN: Curating Multi-Faceted Instructions for Improving Instruction-Following☆16Updated last year
- ☆162Updated last year
- [ACL 2024] LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement☆192Updated last year
- Semantic Kernel connector for ONNX models.☆12Updated last year
- Copy the MLP of llama3 8 times as 8 experts , created a router with random initialization,add load balancing loss to construct an 8x8b Mo…☆27Updated last year
- ☆51Updated last year