A pytorch Implementation of the Transformer: Attention Is All You Need
☆14Jun 7, 2024Updated last year
Alternatives and similar repositories for transformer-pytorch
Users that are interested in transformer-pytorch are comparing it to the libraries listed below
Sorting:
- xgboost复现☆15Oct 6, 2024Updated last year
- A great tutorial that gives you a sufficient understanding of the recommendation system☆10Mar 21, 2020Updated 5 years ago
- Reinforcement Learning for Cut Selection☆12Dec 8, 2022Updated 3 years ago
- 基于deepseek、qwen3大模型,lora sft 医疗行业数据☆15Dec 2, 2025Updated 3 months ago
- ☆10Apr 15, 2023Updated 2 years ago
- ☆12Sep 15, 2021Updated 4 years ago
- ☆14May 6, 2025Updated 10 months ago
- Official Repository for the ICLR 2022 paper "Generalization of Neural Combinatorial Solvers through the Lens of Adversarial Robustness"☆13Nov 20, 2022Updated 3 years ago
- A medical knowledge graph related website, utilizing technologies and frameworks such as Django, Bootstrap, Echarts, with MySQL and neo4j…☆12Jun 9, 2024Updated last year
- [ICLR 2025] Code&Data for the paper "Super(ficial)-alignment: Strong Models May Deceive Weak Models in Weak-to-Strong Generalization"☆14Jun 21, 2024Updated last year
- ☆17Apr 23, 2025Updated 10 months ago
- I'm an AI assistant with extensive knowledge in psychology, and my name is Care.☆25Aug 25, 2025Updated 6 months ago
- A curated list of cutting-edge research papers and resources on Long Chain-of-Thought (CoT) Reasoning with Tools.☆46Dec 17, 2025Updated 2 months ago
- An automatic prompt iteration and optimization generator suitable for any scenario☆16Jan 31, 2025Updated last year
- Matlab code for Low rank Matrix Factorization with AMP☆17Aug 18, 2015Updated 10 years ago
- PyTorch distributed training from scratch (for educational purposes only)☆21Apr 12, 2025Updated 10 months ago
- Source code for "Improving Chemical Reaction Yield Prediction Using Pre-Trained Graph Neural Networks"☆21Oct 31, 2024Updated last year
- ☆18Feb 20, 2025Updated last year
- The repository for 'Unsupervised Learning for Combinatorial Optimization with Principled Proxy Design'☆16Oct 9, 2022Updated 3 years ago
- ☆21Oct 31, 2024Updated last year
- Data Augmentation on Graphs: A Technical Survey☆15Feb 12, 2023Updated 3 years ago
- ☆19Aug 9, 2024Updated last year
- memAry @ University of Texas at Austin☆21Apr 16, 2024Updated last year
- Achieve your exclusive DeepResearch.☆24Apr 25, 2025Updated 10 months ago
- ☆112Jun 12, 2025Updated 8 months ago
- 一些 LLM 方面的从零复现笔记☆243Apr 29, 2025Updated 10 months ago
- 目前各大高校领域将各种信息分布在不同的部门信息门户下,存在典型的信息孤岛问题,各个部门信息没有形成互通。当前,老师和学生存在很多有关本校相关文件、政策和活动等众多方面智能问答的统一入口的需求,例如财务处、人事处、学工处、教务处、图书馆等存在各种政策和文件规定,目前在校师生都…☆35Aug 5, 2024Updated last year
- Recent Advances on MLLM's Reasoning Ability☆26Apr 11, 2025Updated 10 months ago
- ☆32Feb 13, 2024Updated 2 years ago
- SoulStar 是一个心理咨询大模型,内核为温柔知心的大姐姐,能详细分析倾诉的问题,给出切实的建议和安慰,并有可爱表情和颜文字回复~~(*╹▽╹*)☆32Mar 3, 2024Updated 2 years ago
- Official Code Repository for Knowledge-Augmented Language Model Verification (EMNLP 2023)☆28Dec 22, 2023Updated 2 years ago
- ☆34Jul 12, 2024Updated last year
- Copy the MLP of llama3 8 times as 8 experts , created a router with random initialization,add load balancing loss to construct an 8x8b Mo…☆27Jul 1, 2024Updated last year
- PyTorch implementation of Logic Tensor Networks, a Neural-Symbolic framework.☆36Oct 2, 2024Updated last year
- Recruitment instructions of Professor Li Zhenghua.☆30Sep 6, 2025Updated 6 months ago
- Open source code for paper "EDITS: Modeling and Mitigating Data Bias for Graph Neural Networks".☆28Jul 8, 2022Updated 3 years ago
- A simple deep learning framework inspired by Dezero and PyTorch☆31Jan 27, 2025Updated last year
- nanoGRPO is a lightweight implementation of Group Relative Policy Optimization (GRPO)☆143May 8, 2025Updated 9 months ago
- [NeurIPS 2024 Main Track] Code for the paper titled "Instruction Tuning With Loss Over Instructions"☆38May 24, 2024Updated last year