an implementation of transformer, bert, gpt, and diffusion models for learning purposes
☆161Oct 16, 2024Updated last year
Alternatives and similar repositories for CleanTransformer
Users that are interested in CleanTransformer are comparing it to the libraries listed below
Sorting:
- coded with and corrected by Google Anti-Gravity☆13Nov 23, 2025Updated 3 months ago
- FuseFL: One-Shot Federated Learning through the Lens of Causality with Progressive Model Fusion (NeurIPS 2024 Spotlight)☆14Mar 31, 2025Updated 11 months ago
- ☆13Nov 12, 2021Updated 4 years ago
- ☆60Updated this week
- PyTorch distributed training from scratch (for educational purposes only)☆21Apr 12, 2025Updated 10 months ago
- ☆16Mar 30, 2024Updated last year
- CFR implementation of a poker bot.☆12Feb 17, 2023Updated 3 years ago
- Implementation of "ACL'24: When Do LLMs Need Retrieval Augmentation? Mitigating LLMs’ Overconfidence Helps Retrieval Augmentation"☆24Jul 19, 2024Updated last year
- 用于从头预训练+SFT一个小参数量的中文LLaMa2的仓库;24G单卡即可运行得到一个具备简单中文问答能力的chat-llama2.☆2,897May 21, 2024Updated last year
- Tutorial: Writing R and Python Packages with Multithreaded C++ Code using BLAS, AVX2/AVX512, OpenMP, C++11 Threads and Cuda GPU accelerat…☆13Nov 27, 2022Updated 3 years ago
- A collection of different PyTorch wrappers for training neural networks and reinforcement algorithms☆13Dec 15, 2022Updated 3 years ago
- Implementation of KDR-Agent, the AAAI 2025 accepted paper, focusing on knowledge-driven reasoning for autonomous agents.☆16Nov 24, 2025Updated 3 months ago
- 本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)☆23,265Updated this week
- 【LLMs九层妖塔】分享 LLMs在自然语言处理(ChatGLM、Chinese-LLaMA-Alpaca、小羊驼 Vicuna、LLaMA、GPT4ALL等)、信息检索(langchain)、语言合成、语言识别、多模态等领域(Stable Diffusion、MiniGP…☆2,156Mar 30, 2024Updated last year
- IAN: An Intelligent System for Omics Data Analysis and Discovery☆10Feb 23, 2026Updated last week
- Some microbenchmarks and design docs before commencement☆12Feb 1, 2021Updated 5 years ago
- 看图学大模型☆317Jul 30, 2024Updated last year
- ☆82Jan 17, 2019Updated 7 years ago
- Pytorch❤️ Keras 😋😋☆2,009Feb 26, 2026Updated last week
- Train a 1B LLM with 1T tokens from scratch by personal☆789Apr 27, 2025Updated 10 months ago
- ⭐️ NLP Algorithms with transformers lib. Supporting Text-Classification, Text-Generation, Information-Extraction, Text-Matching, RLHF, SF…☆2,409Sep 29, 2023Updated 2 years ago
- Package: Interactive Presentation Ninja☆10Jun 7, 2024Updated last year
- This Repository Contains my Microwave Imaging Studies☆11Mar 1, 2016Updated 10 years ago
- A QA system based on k8s-specific knowledge build on ChatGLM2-6B, serving by Ray.☆10Sep 14, 2023Updated 2 years ago
- Debug DeepSpeed-Chat step by step in IDE (在IDE里一步一步调试DeepSpeed-Chat)☆10Apr 17, 2023Updated 2 years ago
- Simple repo to finetune an LLM hosted on Hugging Face by creating a LORA☆11Dec 20, 2023Updated 2 years ago
- 🚀全流程自己训练一个VLA 「大模型」1小时从0训练26M参数的视觉多模态VLM!🌏 Train a 26M-parameter VLM from scratch in just 1 hours!☆27Oct 16, 2025Updated 4 months ago
- 爬取百度指数数据☆12Dec 8, 2022Updated 3 years ago
- 用于深度哈希图像检索和深度哈希跨模态检索的性能评估算法的计算脚本☆13Oct 30, 2024Updated last year
- Generate Quiz Question from PDF/Text files☆11Feb 2, 2024Updated 2 years ago
- Affine Term-Structure Models: Theory and Implementation☆14Apr 6, 2020Updated 5 years ago
- Official Implementation of "Learning to Refuse: Towards Mitigating Privacy Risks in LLMs"☆10Dec 13, 2024Updated last year
- 🌿快速生成文件夹目录结构,支持定义目录层级,支持生成到 markdown 文件。☆13Oct 19, 2022Updated 3 years ago
- Accepted LLM Papers in NeurIPS 2024☆37Oct 13, 2024Updated last year
- CapsNet implementation in keras for R☆12May 8, 2018Updated 7 years ago
- Awesome free AI courses available on YouTube☆12Apr 16, 2025Updated 10 months ago
- Redefining Video Management with power of SQL☆11Oct 15, 2023Updated 2 years ago
- New version of mpMap☆12Jul 19, 2020Updated 5 years ago
- BrowseComp-Plus: A More Fair and Transparent Evaluation Benchmark of Deep-Research Agent☆190Dec 11, 2025Updated 2 months ago