☆29Sep 29, 2024Updated last year
Alternatives and similar repositories for Chinese_LLM_From_Scratch
Users that are interested in Chinese_LLM_From_Scratch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 从零搭建大语言模型/神经网络框架,以达到深入理解大模型底层运行机制的目的☆19Sep 16, 2025Updated 6 months ago
- Flash Attention in ~100 lines of CUDA (forward pass only)☆10Jun 10, 2024Updated last year
- DLBlas: clean and efficient kernels☆35Mar 16, 2026Updated last week
- ☆10Apr 21, 2025Updated 11 months ago
- Homework of CMU 10-414/714: Deep Learning Systems (https://dlsyscourse.org/)☆15Mar 21, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [ACL 2023] Modeling What-to-ask and How-to-ask for Answer-unaware Conversational Question Generation☆14Jul 11, 2023Updated 2 years ago
- Repository for the Findings of ACL'23 paper Label Agnostic Pre-training for Zero-shot Text Classification☆12Aug 10, 2023Updated 2 years ago
- ☆11Apr 13, 2020Updated 5 years ago
- ☆15Jun 22, 2025Updated 9 months ago
- ☆15Apr 4, 2025Updated 11 months ago
- Filter RSS Feed with GPT-4☆16May 22, 2023Updated 2 years ago
- 《开源大模型食用指南》基于Linux环境快速部署开源大模型,更适合中国宝宝的部署教程☆11Jun 8, 2024Updated last year
- 食品安全舆情分析系统(前端展示模块)☆15May 21, 2015Updated 10 years ago
- Enhancing Retrieval and Managing Retrieval: 4-Module Synergy☆23Dec 7, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- EGFI: Drug-Drug Interaction Extraction and Generation with Fusion of Enriched Entity and Sentence Information☆20Mar 6, 2023Updated 3 years ago
- LLM training in simple, raw C/CUDA☆15Dec 5, 2024Updated last year
- Multi-agent AI Teaching Assistant Learns from Limited Data☆26Mar 17, 2026Updated last week
- 集成Qwen与DeepSeek等先进大语言模型,支持纯LLM+分类层模式及LLM+LoRA+分类层模式,使用transformers模块化设计和训练便于根据需要调整或替换组件。☆20Sep 1, 2025Updated 6 months ago
- Materials and exercises for SICP☆15Feb 13, 2017Updated 9 years ago
- this is a multi-modal chinese poetry generation paper list☆12Oct 22, 2018Updated 7 years ago
- Survey on Knowledge Graph☆15Dec 5, 2018Updated 7 years ago
- Full Marks | Auditing CS61B Data Structures, Spring 2021☆14Jul 31, 2023Updated 2 years ago
- 完整的一个图片OCR微信小程序项目,采用了百度OCR的API和百度翻译API,实现了拍照,选图,批量图片识别提取文字,图片剪裁,支持分享,翻译,校对,记录识别历史等功能,还使用了微信小程序云函数进行识别后文字的安全鉴定。☆14Feb 23, 2021Updated 5 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- cvpr2019/cvpr2018/cvpr2019 papers,极市团队整理☆12Aug 27, 2019Updated 6 years ago
- Source code for paper "Conversational Question Answering over Knowledge Graphs with Transformer and Graph Attention Networks"☆33Jul 23, 2023Updated 2 years ago
- 基于知识图谱的农业智能问答系统,正在持续完善☆22Oct 8, 2019Updated 6 years ago
- seq2seq_translation☆28Nov 28, 2021Updated 4 years ago
- ☆22Apr 22, 2025Updated 11 months ago
- 使用 Bert 进行文本分类☆20Dec 7, 2021Updated 4 years ago
- ☆17May 19, 2023Updated 2 years ago
- Heterogeneous Information Network Datasets☆20May 26, 2019Updated 6 years ago
- 中文版hf-alignment-handbook,大模型全套sft、dpo、orpo、cpt训练教程.☆15Aug 25, 2024Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- FedVCK: Non-IID Robust and Communication-Efficient Federated Learning via Valuable Condensed Knowledge for Medical Image Analysis, Accept…☆21Feb 19, 2025Updated last year
- 包括轮廓自动识别筛选,区域分割,以及传统的缺陷检测算法☆14Jul 16, 2022Updated 3 years ago
- Insider threat detection via bert☆23Jan 13, 2022Updated 4 years ago
- Phi2-Chinese-0.2B 从0开始训练自己的Phi2中文小模型,支持接入langchain加载本地知识库做检索增强生成RAG。Training your own Phi2 small chat model from scratch.☆586Jul 11, 2024Updated last year
- Concepts Explored in/with Pytorch☆20Aug 6, 2024Updated last year
- 2025.01:从零到一实现了一个多模态大模型,并命名为Reyes(睿视),R:睿,eyes:眼。Reyes的参数量为8B,视觉编码器使用的是InternViT-300M-448px-V2_5,语言模型侧使用的是Qwen2.5-7B-Instruct,Reyes也通过一个两…☆33Feb 10, 2026Updated last month
- Implementation of FedDM: Iterative Distribution Matching for Communication-Efficient Federated Learning☆20Jan 4, 2024Updated 2 years ago