vtuber-plan / langport
Langport is a language model inference service
☆94Updated 5 months ago
Alternatives and similar repositories for langport:
Users that are interested in langport are comparing it to the libraries listed below
- The paddle implementation of meta's LLaMA.☆45Updated last year
- Train llama with lora on one 4090 and merge weight of lora to work as stanford alpaca.☆50Updated last year
- A high-throughput and memory-efficient inference and serving engine for LLMs☆130Updated 7 months ago
- A converter and basic tester for rwkv onnx☆42Updated last year
- fastertransformer for codegeex model☆64Updated last year
- This is a text generation method which returns a generator, streaming out each token in real-time during inference, based on Huggingface/…☆96Updated 11 months ago
- CodeAssist is an advanced code completion tool that provides high-quality code completions for Python, Java, C++ and so on. CodeAssist 是一…☆58Updated 11 months ago
- MultilingualShareGPT, the free multi-language corpus for LLM training☆72Updated last year
- Leveraging large language models for text-to-SQL synthesis, this project fine-tunes WizardLM/WizardCoder-15B-V1.0 with QLoRA on a custom …☆43Updated last year
- ☆81Updated 9 months ago
- Evaluation for AI apps and agent☆36Updated last year
- Script and instruction how to fine-tune large RWKV model on your data for Alpaca dataset☆31Updated last year
- Enhancing LangChain prompts to work better with RWKV models☆34Updated last year
- LoRA weights for Cerebras-GPT-2.7b finetuned on Alpaca dataset with shorter prompt☆63Updated last year
- LLaMa Tuning with Stanford Alpaca Dataset using Deepspeed and Transformers☆50Updated last year
- rwkv finetuning☆36Updated 9 months ago
- Deployment a light and full OpenAI API for production with vLLM to support /v1/embeddings with all embeddings models.☆40Updated 7 months ago
- This project is established for real-time training of the RWKV model.☆49Updated 8 months ago
- Formatron empowers everyone to control the format of language models' output with minimal overhead.☆184Updated last month
- ☆51Updated 6 months ago
- 4 bits quantization of SantaCoder using GPTQ☆51Updated last year
- Imitate OpenAI with Local Models☆85Updated 5 months ago
- Inference script for Meta's LLaMA models using Hugging Face wrapper☆111Updated last year
- Open Source Text Embedding Models with OpenAI Compatible API☆145Updated 7 months ago
- ☆52Updated 8 months ago
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆36Updated last year
- ☆33Updated last year
- Silk Road will be the dataset zoo for Luotuo(骆驼). Luotuo is an open sourced Chinese-LLM project founded by 陈启源 @ 华中师范大学 & 李鲁鲁 @ 商汤科技 & 冷子…☆38Updated last year
- LLaMA Server combines the power of LLaMA C++ with the beauty of Chatbot UI.☆117Updated last year
- Retrieves parquet files from Hugging Face, identifies and quantifies junky data, duplication, contamination, and biased content in datase…☆51Updated last year