singularity-s0 / MOSS_frontendLinks
Frontend for the MOSS chatbot.
☆48Updated last year
Alternatives and similar repositories for MOSS_frontend
Users that are interested in MOSS_frontend are comparing it to the libraries listed below
Sorting:
- backend for fastnlp MOSS project☆59Updated 11 months ago
- Moss Vortex is a lightweight and high-performance deployment and inference backend engineered specifically for MOSS 003, providing a weal…☆37Updated 2 years ago
- 实现Blip2RWKV+QFormer的多模态图文对话大模型,使用Two-Step Cognitive Psychology Prompt方法,仅3B参数的模型便能够出现类人因果思维链。对标MiniGPT-4,ImageBind等图文对话大语言模型,力求以更小的算力和资源实…☆38Updated last year
- SUS-Chat: Instruction tuning done right☆48Updated last year
- MOSS 003 WebSearchTool: A simple but reliable implementation☆45Updated 2 years ago
- rwkv finetuning☆36Updated last year
- XVERSE-MoE-A4.2B: A multilingual large language model developed by XVERSE Technology Inc.☆39Updated last year
- Kanchil(鼷鹿)是世界上最小的偶蹄目动物,这个开源项目意在探索小模型(6B以下)是否也能具备和人类偏好对齐的能力。☆113Updated 2 years ago
- deep learning☆148Updated last month
- GRAIN: Gradient-based Intra-attention Pruning on Pre-trained Language Models☆19Updated last year
- code for Scaling Laws of RoPE-based Extrapolation☆73Updated last year
- share data, prompt data , pretraining data☆36Updated last year
- 基于中文法律知识的ChatGLM指令微调☆44Updated 2 years ago
- The paddle implementation of meta's LLaMA.☆45Updated 2 years ago
- Imitate OpenAI with Local Models☆86Updated 10 months ago
- Just for debug☆56Updated last year
- 最简易的R1结果在小模型上的复现,阐述类O1与DeepSeek R1最重要的本质。Think is all your need。利用实验佐证,对于强推理能力,think思考过程性内容是AGI/ASI的核心。☆45Updated 4 months ago
- 实现一种多Lora权值集成切换+Zero-Finetune零微调增强的跨模型技术方案,LLM-Base+LLM-X+Alpaca,初期,LLM-Base为Chatglm6B底座模型,LLM-X是LLAMA增强模型。该方案简易高效,目标是使此类语言模型能够低能耗广泛部署,并最…☆115Updated last year
- 本项目旨在对大量文本文件进行快速编码检测和转换以辅助mnbvc语料集项目的数据清洗工作☆61Updated 8 months ago
- ☆82Updated last year
- Light local website for displaying performances from different chat models.☆87Updated last year
- This is a personal reimplementation of Google's Infini-transformer, utilizing a small 2b model. The project includes both model and train…☆57Updated last year
- A more efficient GLM implementation!☆55Updated 2 years ago
- 使用qlora对中文大语言模型进行微调,包含ChatGLM、Chinese-LLaMA-Alpaca、BELLE☆86Updated 2 years ago
- XVERSE-65B: A multilingual large language model developed by XVERSE Technology Inc.☆139Updated last year
- the newest version of llama3,source code explained line by line using Chinese☆22Updated last year
- zero零训练llm调参☆31Updated last year
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆39Updated last year
- qwen models finetuning☆99Updated 3 months ago
- 基于Llama3,通过进一步CPT,SFT,ORPO得到的中文版Llama3☆17Updated last year