vtuber-plan / langport
Langport is a language model inference service
☆94Updated 6 months ago
Alternatives and similar repositories for langport:
Users that are interested in langport are comparing it to the libraries listed below
- Train llama with lora on one 4090 and merge weight of lora to work as stanford alpaca.☆50Updated last year
- MultilingualShareGPT, the free multi-language corpus for LLM training☆73Updated last year
- ☆82Updated 10 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆130Updated 8 months ago
- This project is established for real-time training of the RWKV model.☆49Updated 10 months ago
- Evaluation for AI apps and agent☆36Updated last year
- LoRA weights for Cerebras-GPT-2.7b finetuned on Alpaca dataset with shorter prompt☆63Updated last year
- 实现Blip2RWKV+QFormer的多模态图文对话大模型,使用Two-Step Cognitive Psychology Prompt方法,仅3B参数的模型便能够出现类人因果思维链。对标MiniGPT-4,ImageBind等图文对话大语言模型,力求以更小的算力和资源实…☆38Updated last year
- A simple LangChain-like implementation based on Sentence Embedding+local knowledge base, with Vicuna (FastChat) serving as the LLM. Suppo…☆92Updated last year
- 演示 vllm 对中文大语言模型的神奇效果☆31Updated last year
- Another ChatGLM2 implementation for GPTQ quantization☆54Updated last year
- LLaMA Server combines the power of LLaMA C++ with the beauty of Chatbot UI.☆122Updated last year
- CodeAssist is an advanced code completion tool that provides high-quality code completions for Python, Java, C++ and so on. CodeAssist 是一…☆58Updated last year
- Evaluating LLMs with Dynamic Data☆78Updated last month
- Leveraging large language models for text-to-SQL synthesis, this project fine-tunes WizardLM/WizardCoder-15B-V1.0 with QLoRA on a custom …☆43Updated last year
- RWKV infctx trainer, for training arbitary context sizes, to 10k and beyond!☆148Updated 7 months ago
- Large language Model fintuning bloom , opt , gpt, gpt2 ,llama,llama-2,cpmant and so on☆97Updated 10 months ago
- This is a text generation method which returns a generator, streaming out each token in real-time during inference, based on Huggingface/…☆95Updated last year
- Inference script for Meta's LLaMA models using Hugging Face wrapper☆110Updated last year
- This is a personal reimplementation of Google's Infini-transformer, utilizing a small 2b model. The project includes both model and train…☆56Updated 11 months ago
- ChatGLM-6B-Slim:裁减掉20K图片Token的ChatGLM-6B,完全一样的性能,占用更小的显存。☆126Updated last year
- code for Scaling Laws of RoPE-based Extrapolation☆70Updated last year
- Open Source Text Embedding Models with OpenAI Compatible API☆149Updated 8 months ago
- Official implementation for 'Extending LLMs’ Context Window with 100 Samples'☆75Updated last year
- 使用qlora对中文大语言模型进行微调,包含ChatGLM、Chinese-LLaMA-Alpaca、BELLE☆85Updated last year
- Unofficial implementation of AlpaGasus☆90Updated last year
- ☆74Updated last year
- Enhancing LangChain prompts to work better with RWKV models☆34Updated last year
- fastertransformer for codegeex model☆63Updated last year
- Imitate OpenAI with Local Models☆88Updated 6 months ago