singularity-s0 / MOSS_frontend
Frontend for the MOSS chatbot.
☆48Updated 8 months ago
Alternatives and similar repositories for MOSS_frontend:
Users that are interested in MOSS_frontend are comparing it to the libraries listed below
- backend for fastnlp MOSS project☆60Updated 7 months ago
- MOSS 003 WebSearchTool: A simple but reliable implementation☆45Updated last year
- Moss Vortex is a lightweight and high-performance deployment and inference backend engineered specifically for MOSS 003, providing a weal…☆37Updated last year
- SUS-Chat: Instruction tuning done right☆48Updated last year
- Another ChatGLM2 implementation for GPTQ quantization☆54Updated last year
- zero零训练llm调参☆31Updated last year
- rwkv finetuning☆36Updated 9 months ago
- 实现一种多Lora权值集成切换+Zero-Finetune零微调增强的跨模型技术方案,LLM-Base+LLM-X+Alpaca,初期,LLM-Base为Chatglm6B底座模型,LLM-X是LLAMA增强模型。该方案简易高效,目标是使此类语言模型能够低能耗广泛部署,并最…☆117Updated last year
- ⚡ boost inference speed of GPT models in transformers by onnxruntime☆53Updated last year
- A more efficient GLM implementation!☆55Updated last year
- GRAIN: Gradient-based Intra-attention Pruning on Pre-trained Language Models☆18Updated last year
- 基于中文法律知识的ChatGLM指令微调☆43Updated last year
- Kanchil(鼷鹿)是世界上最小的偶蹄目动物,这个开源项目意在探索小模型(6B以下)是否也能具备和人类偏好对齐的能力。☆113Updated last year
- 大语言模型训练和服务调研☆35Updated last year
- 实现Blip2RWKV+QFormer的多模态图文对话大模型,使用Two-Step Cognitive Psychology Prompt方法,仅3B参数的模型便能够出现类人因果思维链。对标MiniGPT-4,ImageBind等图文对话大语言模型,力求以更小的算力和资源实…☆37Updated last year
- GPT+神器,简单实用的一站式AGI架构,内置本地化,LLM模型,agent,矢量数据库,智能链chain☆48Updated last year
- This is a personal reimplementation of Google's Infini-transformer, utilizing a small 2b model. The project includes both model and train…☆55Updated 9 months ago
- ✅4g GPU可用 | 简易实现ChatGLM单机调用多个计算设备(GPU、CPU)进行推理☆34Updated last year
- Music large model based on InternLM2-chat.☆22Updated last month
- AGM阿格姆:AI基因图谱模型,从token-weight权重微粒角度,探索AI模型,GPT\LLM大模型的内在运作机制。☆28Updated last year
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆36Updated last year
- deep learning☆150Updated 7 months ago
- ☆14Updated 10 months ago
- The paddle implementation of meta's LLaMA.☆45Updated last year
- XVERSE-65B: A multilingual large language model developed by XVERSE Technology Inc.☆134Updated 10 months ago
- 本项目旨在对大量文本文件进行快速编码检测和转换以辅助mnbvc语料集项目的数据清洗工作☆56Updated 3 months ago
- XVERSE-MoE-A4.2B: A multilingual large language model developed by XVERSE Technology Inc.☆36Updated 9 months ago
- ☆102Updated this week
- ☆19Updated 2 years ago
- share data, prompt data , pretraining data☆35Updated last year