☆509Apr 29, 2024Updated last year
Alternatives and similar repositories for Transformer-from-scratch
Users that are interested in Transformer-from-scratch are comparing it to the libraries listed below
Sorting:
- 《Reinforcement Learning》读书学习与视频分享笔记☆78Apr 1, 2025Updated 11 months ago
- 训练自己的中文 Embedding 模型☆28Jan 6, 2025Updated last year
- ☆30Dec 6, 2024Updated last year
- NLP/LLM Mlops Pipeline to dev/train/evaluation, scalable deploy and monitoring systems.☆22Mar 15, 2024Updated last year
- Datastructure for data science☆23Apr 12, 2024Updated last year
- ☆13Sep 12, 2024Updated last year
- 中文版hf-alignment-handbook,大模型全套sft、dpo、orpo、cpt训练教程.☆14Aug 25, 2024Updated last year
- multi-source domain graph convolution networks (Fault and Abnormal Vibration Diagnosis)☆12Mar 29, 2022Updated 3 years ago
- A simple implementation of Llama 1, 2. Llama Architecture built from scratch using PyTorch all the models are built from scratch that inc…☆13May 6, 2024Updated last year
- 零实现 AlphaGo Zero☆17Nov 10, 2024Updated last year
- introduce AI infra knowledges. 人工智能系统基础架构知识库☆16Jun 4, 2023Updated 2 years ago
- 复现大模型相关算法及一些学习记录☆3,023Feb 10, 2026Updated last month
- Agent Watch is an AgentOps monitoring library designed for Crew AI applications.☆21Dec 2, 2024Updated last year
- AI tool that generates an Audio short story based on the context of an uploaded image by prompting a GenAI LLM model, Hugging Face AI mod…☆51Jan 11, 2024Updated 2 years ago
- ☆19Jul 10, 2023Updated 2 years ago
- ☆18Nov 19, 2023Updated 2 years ago
- 从无名小卒到大模型(LLM)大英雄~ 欢迎关注后续!!!☆2,048Nov 22, 2025Updated 3 months ago
- Zero-human, cold-start construction of long-chain agents in professional domains☆48Nov 10, 2025Updated 4 months ago
- Code repo for MathAgent☆19Dec 15, 2023Updated 2 years ago
- a set of simple (and not so simple) web-servers written in Go☆16Mar 4, 2025Updated last year
- 中文翻译的 Hands-On-Large-Language-Models (hands-on-llms),动手学习大模型☆2,241Oct 19, 2025Updated 4 months ago
- NeurIPS 2022 paper, SubHypergraph Inductive Neural nEtwork☆19Aug 4, 2023Updated 2 years ago
- Implemented a script that automatically adjusts Qwen3's inference and non-inference capabilities, based on an OpenAI-like API. The infere…☆22May 9, 2025Updated 10 months ago
- CrewAI AgentOps: Monitor your AI Agents☆19Jun 29, 2024Updated last year
- Tracking the hot Github repos and update daily 每天自动追踪Github热门项目☆50Updated this week
- Llama3-Tutorial(XTuner、LMDeploy、OpenCompass)☆512May 10, 2024Updated last year
- CentOS based Docker container for Time Series Analysis and Modeling.☆22Sep 19, 2019Updated 6 years ago
- Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.☆18Dec 31, 2023Updated 2 years ago
- 一个将视频转换为PPT的桌面应用。☆23Nov 14, 2021Updated 4 years ago
- Graph Convolutional Network for RUL prediction with multi-sensor signals☆20Feb 12, 2023Updated 3 years ago
- Utilities for efficient fine-tuning, inference and evaluation of code generation models☆21Oct 3, 2023Updated 2 years ago
- 《大模型白盒子构建指南》:一个全手搓 的Tiny-Universe☆4,563Feb 12, 2026Updated 3 weeks ago
- Graph Few-Shot Class-Incremental Learning via Prototype Representation☆21Aug 14, 2022Updated 3 years ago
- Retriever-0.1B☆96Jun 6, 2024Updated last year
- 《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程☆28,676Feb 24, 2026Updated last week
- Keras implementation of "DFNet: Discriminative feature extraction and integration network for salient object detection"☆23Jan 5, 2021Updated 5 years ago
- Contains relevant notebooks for the hands-on NLP workshop for the GIDS AIML Conference -2020 Edition☆23May 3, 2021Updated 4 years ago
- A neural network-based AI chatbot has been designed that uses LSTM as its training model for both encoding and decoding. The chatbot work…☆22May 27, 2021Updated 4 years ago
- UnifyPy是一个强大的自动化解决方案,能将任何Python项目打包成跨平台的独立可执行文件和安装程序。支持Windows、macOS和Linux三大主流操作系统,提供统一的接口和丰富的配置选项。☆42Oct 12, 2025Updated 4 months ago