EasternJournalist / learn-deep-learningLinks
Labs for deep learning course.
☆16Updated 4 years ago
Alternatives and similar repositories for learn-deep-learning
Users that are interested in learn-deep-learning are comparing it to the libraries listed below
Sorting:
- 《自然语言处理概论》 张奇、桂韬、黄萱菁著☆118Updated 2 years ago
- Covert Keras models to Pytorch☆12Updated 6 years ago
- RLHF experiments on a single A100 40G GPU. Support PPO, GRPO, REINFORCE, RAFT, RLOO, ReMax, DeepSeek R1-Zero reproducing.☆73Updated 7 months ago
- 我的数据竞赛方案总结☆70Updated last year
- 基于Gated Attention Unit的Transformer模型(尝鲜版)☆98Updated 2 years ago
- Jam of papers that interest or bore me and my friends :P☆23Updated this week
- Tips for paper writing and researches 科技论文写作经验记录和总结☆136Updated 3 years ago
- 用RLHF可选LoRA对LLaMA和MOSS进行训练|Training LLaMA or MOSS with RLHF [LoRA]☆21Updated 2 years ago
- Notes of my introduction about NLP in Fudan University☆37Updated 4 years ago
- This is a personal reimplementation of Google's Infini-transformer, utilizing a small 2b model. The project includes both model and train…☆58Updated last year
- an implementation of transformer, bert, gpt, and diffusion models for learning purposes☆158Updated last year
- 中文金融大模型测评基准,六大类二十五任务、等级化评价,国内模型获得A级☆10Updated last year
- Fast instruction tuning with Llama2☆11Updated last year
- 逻辑回归和单层softmax的解析解☆12Updated 4 years ago
- Research without Re-search: Maximal Update Parametrization Yields Accurate Loss Prediction across Scales☆32Updated 2 years ago
- learn jiu wan shier l☆53Updated 4 years ago
- Must-read papers on improving efficiency for pre-trained language models.☆105Updated 2 years ago
- An open-source implementation of Scaling Laws for Neural Language Models using nanoGPT☆49Updated last year
- A curated reading list for large language model (LLM) alignment. Take a look at our new survey "Large Language Model Alignment: A Survey"…☆81Updated 2 years ago
- ☆100Updated 3 years ago
- 收集经常用到的一些python代码☆50Updated 4 months ago
- Feeling confused about super alignment? Here is a reading list☆43Updated last year
- [ICLR 2024]EMO: Earth Mover Distance Optimization for Auto-Regressive Language Modeling(https://arxiv.org/abs/2310.04691)☆125Updated last year
- [NAACL 2022] "SemAttack: Natural Textual Attacks via Different Semantic Spaces" by Boxin Wang, Chejian Xu, Xiangyu Liu, Yu Cheng, Bo Li☆21Updated 3 years ago
- A pytorch &keras implementation and demo of Fastformer.☆190Updated 3 years ago
- Pretrain、decay、SFT a CodeLLM from scratch 🧙♂️☆39Updated last year
- OpenLLMDE: An open source data engineering framework for LLMs☆18Updated 2 years ago
- Our code will be public soon .☆27Updated 2 years ago
- Python下shuffle几百G文件☆33Updated 4 years ago
- ChatGPT相关资源汇总☆56Updated 2 years ago