deep-diver / LLM-Serve
This repository provides a framework to serve LLM(Large Language Model) based applications such as Chatbot.
β17Updated last year
Alternatives and similar repositories for LLM-Serve:
Users that are interested in LLM-Serve are comparing it to the libraries listed below
- hllama is a library which aims to provide a set of utility tools for large language models.β10Updated 10 months ago
- π¨ Imagine what Picasso could have done with AI. Self-host your StableDiffusion API.β50Updated last year
- β26Updated last year
- "Learning-based One-line intelligence Owner Network Connectivity Tool"β15Updated last year
- β37Updated last year
- β35Updated 10 months ago
- generate synthetic data for LLM fine-tuning in arbitrary situations within systematic wayβ21Updated 10 months ago
- High-performance vector search engine with no loss of accuracy through GPU and dynamic placementβ28Updated last year
- AskUp Search ChatGPT Pluginβ20Updated last year
- GPT2 fine-tuning pipeline with KerasNLP, TensorFlow, and TensorFlow Extendedβ32Updated last year
- manage histories of LLM applied applicationsβ88Updated last year
- Sakura-SOLAR-DPO: Merge, SFT, and DPOβ116Updated last year
- β31Updated last year
- Implementation of stop sequencer for Huggingface Transformersβ16Updated last year
- The aim of this project is to publish and archive newsletters to a target email address.β19Updated last year
- 1-Click is all you need.β59Updated 9 months ago
- Open Source + Multilingual MLLM + Fine-tuning + Distillation + More efficient models and learning + ?β18Updated 2 weeks ago
- β32Updated last year
- Code for the examples presented in the talk "Training a Llama in your backyard: fine-tuning very large models on consumer hardware" givenβ¦β14Updated last year
- Toy Oβ15Updated 4 months ago
- λ¬Έμ₯λ¨μλ‘ λΆμ λ λ무μν€ λ°μ΄ν°μ . Releasesμμ λ€μ΄λ‘λ λ°κ±°λ, tfds-koreanμ ν΅ν΄ λ€μ΄λ‘λ λ°μΌμΈμ.β19Updated 3 years ago
- β14Updated last year
- The open source implementation of the base model behind GPT-4 from OPENAI [Language + Multi-Modal]β11Updated last year
- Modified Beam Search with periodical restartβ12Updated 5 months ago
- β26Updated 11 months ago
- κΈμ΅ λλ©μΈμ νΉνλ νκ΅μ΄ μλ² λ© λͺ¨λΈβ20Updated 6 months ago
- All-in-one repository for Fine-tuning & Pretraining (Large) Language Modelsβ15Updated last year
- HuggingChat like UI in Gradioβ69Updated last year
- Repository containing the SPIN experiments on the DIBT 10k ranked promptsβ24Updated 11 months ago
- Serving Example of CodeGen-350M-Mono-GPTJ on Triton Inference Server with Docker and Kubernetesβ20Updated last year