deep-diver / LLM-Serve
This repository provides a framework to serve LLM(Large Language Model) based applications such as Chatbot.
β17Updated last year
Alternatives and similar repositories for LLM-Serve:
Users that are interested in LLM-Serve are comparing it to the libraries listed below
- β37Updated last year
- manage histories of LLM applied applicationsβ88Updated last year
- π¨ Imagine what Picasso could have done with AI. Self-host your StableDiffusion API.β50Updated last year
- β26Updated 2 years ago
- β31Updated last year
- 1-Click is all you need.β59Updated 10 months ago
- generate synthetic data for LLM fine-tuning in arbitrary situations within systematic wayβ21Updated 11 months ago
- β23Updated last year
- hllama is a library which aims to provide a set of utility tools for large language models.β10Updated 10 months ago
- High-performance vector search engine with no loss of accuracy through GPU and dynamic placementβ28Updated last year
- β35Updated 11 months ago
- "Learning-based One-line intelligence Owner Network Connectivity Tool"β15Updated last year
- The aim of this project is to publish and archive newsletters to a target email address.β19Updated last year
- β26Updated last year
- This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.β20Updated 9 months ago
- Open Source + Multilingual MLLM + Fine-tuning + Distillation + More efficient models and learning + ?β18Updated last month
- Build complex LLM Applications with Python Dictionaryβ38Updated 5 months ago
- β32Updated last year
- All-in-one repository for Fine-tuning & Pretraining (Large) Language Modelsβ15Updated 2 years ago
- HuggingChat like UI in Gradioβ70Updated last year
- AskUp Search ChatGPT Pluginβ20Updated last year
- Code for the examples presented in the talk "Training a Llama in your backyard: fine-tuning very large models on consumer hardware" givenβ¦β14Updated last year
- Anh - LAION's multilingual assistant datasets and modelsβ27Updated last year
- Serving Example of CodeGen-350M-Mono-GPTJ on Triton Inference Server with Docker and Kubernetesβ20Updated last year
- Sakura-SOLAR-DPO: Merge, SFT, and DPOβ116Updated last year
- Submodule of evalverse forked from [google-research/instruction_following_eval](https://github.com/google-research/google-research/tree/mβ¦β13Updated 10 months ago
- Command-line script for inferencing from models such as LLaMA, in a chat scenario, with LoRA adaptationsβ33Updated last year
- κΈμ΅ λλ©μΈμ νΉνλ νκ΅μ΄ μλ² λ© λͺ¨λΈβ20Updated 7 months ago
- **ARCHIVED** Filesystem interface to π€ Hubβ58Updated last year