deep-diver / LLM-Serve
This repository provides a framework to serve LLM(Large Language Model) based applications such as Chatbot.
☆17Updated last year
Alternatives and similar repositories for LLM-Serve:
Users that are interested in LLM-Serve are comparing it to the libraries listed below
- ☆37Updated last year
- ☆35Updated 10 months ago
- "Learning-based One-line intelligence Owner Network Connectivity Tool"☆15Updated last year
- 🎨 Imagine what Picasso could have done with AI. Self-host your StableDiffusion API.☆50Updated last year
- hllama is a library which aims to provide a set of utility tools for large language models.☆10Updated 9 months ago
- ☆26Updated last year
- ☆31Updated last year
- Sakura-SOLAR-DPO: Merge, SFT, and DPO☆116Updated last year
- manage histories of LLM applied applications☆88Updated last year
- This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.☆20Updated 7 months ago
- Toy O☆15Updated 4 months ago
- 1-Click is all you need.☆59Updated 9 months ago
- ☆26Updated 10 months ago
- High-performance vector search engine with no loss of accuracy through GPU and dynamic placement☆28Updated last year
- generate synthetic data for LLM fine-tuning in arbitrary situations within systematic way☆21Updated 10 months ago
- ☆32Updated last year
- Code for the examples presented in the talk "Training a Llama in your backyard: fine-tuning very large models on consumer hardware" given…☆14Updated last year
- LoRA fine-tuned Stable Diffusion Deployment☆31Updated last year
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data☆21Updated 6 months ago
- ☆12Updated 10 months ago
- ☆23Updated last year
- HuggingChat like UI in Gradio☆69Updated last year
- The aim of this project is to publish and archive newsletters to a target email address.☆19Updated last year
- Chain-of-thought 방식을 활용하여 llama2를 fine-tuning☆10Updated last year
- Serving Example of CodeGen-350M-Mono-GPTJ on Triton Inference Server with Docker and Kubernetes☆20Updated last year
- MEXMA: Token-level objectives improve sentence representations☆37Updated 3 weeks ago
- ☆20Updated last year
- ☆24Updated last year
- Polyglot을 활용한 image-text multimodal☆11Updated last year
- GPT2 fine-tuning pipeline with KerasNLP, TensorFlow, and TensorFlow Extended☆32Updated last year