howard-hou / transformers-benchmarks-exercise
exercise for transformers-benchmarks, add 3090 benchmark
☆12Updated 2 years ago
Alternatives and similar repositories for transformers-benchmarks-exercise:
Users that are interested in transformers-benchmarks-exercise are comparing it to the libraries listed below
- 🔥Your Daily Dose of AI Research from Hugging Face 🔥 Stay updated with the latest AI breakthroughs! This bot automatically collects and…☆50Updated this week
- AM (Advanced Mathematics) Chat is a large language model that integrates advanced mathematical knowledge, exercises in higher mathematics…☆182Updated 8 months ago
- backend for fastnlp MOSS project☆59Updated 9 months ago
- The Roadmap for LLMs☆84Updated last year
- GPT数据收录 It is a repo recording all the public gpts. You can use it to build your own GPTs Store.Updated 10 months ago
- 😜 表情包视觉数据集,使用glm-4v、step-1v的图像解析能力标注。☆119Updated last year
- GLM Series Edge Models☆136Updated 2 months ago
- huggingface-go : 加速下载 huggingface 的模型和数据集☆45Updated last year
- 使用单个24G显卡,从0开始训练LLM☆53Updated 6 months ago
- ☆53Updated last month
- 中国如何下载huggingface 模型并共享链接☆54Updated last year
- ChatGLM-6B-Slim:裁减掉20K图片Token的ChatGLM-6B,完全一样的性能,占用更小的显存。☆126Updated 2 years ago
- 使用甄嬛语料微调的chatglm☆85Updated 2 years ago
- A free tool that helps you transcribe, translate, and summarize videos in any language.☆18Updated last year
- 基于《西游记》原文、白话文、ChatGPT生成数据制作的,以InternLM2微调的角色扮演多LLM聊天室。 本项目将介绍关于角色扮演类 LLM 的一切,从数据获取、数据处理,到使用 XTuner 微调并部署至 OpenXLab,再到使用 LMDeploy 部署,以 op…☆98Updated last year
- 百度QA100万数据集☆47Updated last year
- something for paper agent☆11Updated 4 months ago
- XVERSE-65B: A multilingual large language model developed by XVERSE Technology Inc.☆139Updated last year
- Frontend for the MOSS chatbot.☆48Updated 10 months ago
- The official codes for "Aurora: Activating chinese chat capability for Mixtral-8x7B sparse Mixture-of-Experts through Instruction-Tuning"☆261Updated 11 months ago
- SUS-Chat: Instruction tuning done right☆48Updated last year
- Gaokao Benchmark for AI☆108Updated 2 years ago
- 最简易的R1结果在小模型上的复现,阐述类O1与DeepSeek R1最重要的本质。Think is all your need。利用实验佐证,对于强推理能力,think思考过程性内容是AGI/ASI的核心。☆44Updated 2 months ago
- ☆28Updated 11 months ago
- A demo built on Megrez-3B-Instruct, integrating a web search tool to enhance the model's question-and-answer capabilities.☆37Updated 4 months ago
- Awesome LLM pre-training resources, including data, frameworks, and methods.☆36Updated this week
- A small open source 3D agent simulator based on LLM.☆61Updated 4 months ago
- DeepSpeed教程 & 示例注释 & 学习笔记 (大模型高效训练)☆159Updated last year
- Delta-CoMe can achieve near loss-less 1-bit compressin which has been accepted by NeurIPS 2024☆57Updated 5 months ago
- 实现Blip2RWKV+QFormer的多模态图文对话大模型,使用Two-Step Cognitive Psychology Prompt方法,仅3B参数的模型便能够出现类人因果思维链。对标MiniGPT-4,ImageBind等图文对话大语言模型,力求以更小的算力和资源实…☆38Updated last year