baudzhou / MindsporeTrainer
Make Mindspore Training Easier
☆8Updated last year
Related projects: ⓘ
- GRAIN: Gradient-based Intra-attention Pruning on Pre-trained Language Models☆17Updated last year
- ☆23Updated this week
- 在kaggle部署ChatGLM API,和ChatGPT api使用相同的调用方式☆14Updated last year
- A light proxy solution for HuggingFace hub.☆43Updated 10 months ago
- accelerate generating vector by using onnx model☆10Updated 7 months ago
- TensorRT☆11Updated 4 years ago
- share data, prompt data , pretraining data☆35Updated 9 months ago
- A Simple MLLM Surpassed QwenVL-Max with OpenSource Data Only in 14B LLM.☆35Updated last week
- 本项目旨在对大量文本文件进行快速编码检测和转换以辅助mnbvc语料集项目的数据清洗工作☆52Updated 3 weeks ago
- aigc evals☆10Updated 9 months ago
- AGM阿格姆:AI基因图谱模型,从token-weight权重微粒角度,探索AI模型,GPT\LLM大模型的内在运作机制。☆24Updated last year
- This project is mainly to explore what effect can be achieved by fine-tuning LLM model (ChatGLM-6B)of about 6B in vertical field (Romance…☆25Updated last year
- Music large model based on InternLM2-chat.☆21Updated 2 months ago
- ☆12Updated this week
- ☆16Updated 4 months ago
- 本项目用于大模型数学解题能力方面的数据集合成,模型训练及评测,相关文章记录。☆41Updated last week
- make LLM easier to use☆59Updated last year
- Exploration of the multi modal fuyu-8b model of Adept. 🤓 🔍☆28Updated 10 months ago
- Sparse Multilabel Categorical Crossentropy☆9Updated last year
- rwkv finetuning☆35Updated 5 months ago
- 纯c++的全平台llm加速库,支持python调用,支持baichuan, glm, llama, moss基座,手机端流畅运行chatglm-6B级模型单卡可达10000+token / s,☆44Updated last year
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆32Updated 8 months ago
- LLM RAG 应用,支持 API 调用,语音交互。☆10Updated 2 months ago
- 大语言模型训练和服务调研☆32Updated last year
- Large-scale exact string matching tool☆15Updated 11 months ago
- ☆32Updated 3 months ago
- SUS-Chat: Instruction tuning done right☆47Updated 8 months ago
- LLM+RAG for QA☆19Updated 8 months ago
- This is a personal reimplementation of Google's Infini-transformer, utilizing a small 2b model. The project includes both model and train…☆51Updated 5 months ago