deep-diver / LLM-ServeLinks
This repository provides a framework to serve LLM(Large Language Model) based applications such as Chatbot.
β18Updated 2 years ago
Alternatives and similar repositories for LLM-Serve
Users that are interested in LLM-Serve are comparing it to the libraries listed below
Sorting:
- π¨ Imagine what Picasso could have done with AI. Self-host your StableDiffusion API.β50Updated 2 years ago
- manage histories of LLM applied applicationsβ91Updated last year
- β35Updated last year
- β37Updated 2 years ago
- β26Updated 2 years ago
- Weak Labeling (NER) using ChatGPTβ38Updated 2 years ago
- AskUp Search ChatGPT Pluginβ20Updated 2 years ago
- hllama is a library which aims to provide a set of utility tools for large language models.β10Updated last year
- HuggingChat like UI in Gradioβ71Updated 2 years ago
- 1-Click is all you need.β62Updated last year
- generate synthetic data for LLM fine-tuning in arbitrary situations within systematic wayβ22Updated last year
- Sakura-SOLAR-DPO: Merge, SFT, and DPOβ116Updated last year
- Open Source + Multilingual MLLM + Fine-tuning + Distillation + More efficient models and learning + ?β18Updated 6 months ago
- showing various ways to serve Keras based stable diffusionβ111Updated 2 years ago
- "Learning-based One-line intelligence Owner Network Connectivity Tool"β16Updated 2 years ago
- β32Updated 8 months ago
- β33Updated 2 years ago
- The aim of this project is to publish and archive newsletters to a target email address.β20Updated last year
- High-performance vector search engine with no loss of accuracy through GPU and dynamic placementβ29Updated last month
- β20Updated 2 years ago
- Use OpenAI with HuggingChat by emulating the text_generation_inference_serverβ45Updated 2 years ago
- Serving Example of CodeGen-350M-Mono-GPTJ on Triton Inference Server with Docker and Kubernetesβ20Updated 2 years ago
- OpenOrca-KO datasetμ νμ©νμ¬ llama2λ₯Ό fine-tuningν Korean-OpenOrcaβ19Updated last year
- Newsletter bot for π€ Daily Papersβ126Updated this week
- nllb-200 distilled 350M for English to Korean translationβ26Updated last year
- Command-line script for inferencing from models such as LLaMA, in a chat scenario, with LoRA adaptationsβ33Updated 2 years ago
- GPT2 fine-tuning pipeline with KerasNLP, TensorFlow, and TensorFlow Extendedβ32Updated last year
- a Jax/Flax inference code of StarCoderβ12Updated 2 years ago
- KoCommonGEN v2: A Benchmark for Navigating Korean Commonsense Reasoning Challenges in Large Language Modelsβ25Updated 11 months ago
- β12Updated last year