EasternJournalist / learn-deep-learningLinks
Labs for deep learning course.
☆16Updated 4 years ago
Alternatives and similar repositories for learn-deep-learning
Users that are interested in learn-deep-learning are comparing it to the libraries listed below
Sorting:
- 我的数据竞赛方案总结☆68Updated last year
- 基于python的BM25文本匹配算法实现☆32Updated 3 years ago
- RLHF experiments on a single A100 40G GPU. Support PPO, GRPO, REINFORCE, RAFT, RLOO, ReMax, DeepSeek R1-Zero reproducing.☆66Updated 5 months ago
- 收集经常用到的一些python代码☆48Updated last month
- This is a personal reimplementation of Google's Infini-transformer, utilizing a small 2b model. The project includes both model and train…☆58Updated last year
- learn jiu wan shier l☆52Updated 3 years ago
- 基于Gated Attention Unit的Transformer模型(尝鲜版)☆97Updated 2 years ago
- 用RLHF可选LoRA对LLaMA和MOSS进行训练|Training LLaMA or MOSS with RLHF [LoRA]☆21Updated 2 years ago
- 怎么训练一个LLM分词器☆151Updated 2 years ago
- ☆13Updated 3 years ago
- pytorch分布式训练☆67Updated last year
- [ICLR 2024]EMO: Earth Mover Distance Optimization for Auto-Regressive Language Modeling(https://arxiv.org/abs/2310.04691)☆123Updated last year
- A curated reading list for large language model (LLM) alignment. Take a look at our new survey "Large Language Model Alignment: A Survey"…☆80Updated last year
- 仓库主要记录 NLP 算法工程师相关的顶会论文研读笔记【文本匹配篇】☆13Updated 3 years ago
- an implementation of transformer, bert, gpt, and diffusion models for learning purposes☆155Updated 9 months ago
- 校招复习之旅:机器学习(MachineLearning)、深度学习(DeepLearning)、Leetcode、NLP等 (思维导图型笔记)算法岗面试☆17Updated 4 years ago
- Large language Model fintuning bloom , opt , gpt, gpt2 ,llama,llama-2,cpmant and so on☆97Updated last year
- Official Repository for SIGIR2024 Demo Paper "An Integrated Data Processing Framework for Pretraining Foundation Models"☆82Updated 10 months ago
- Code for a New Loss for Mitigating the Bias of Learning Difficulties in Generative Language Models☆64Updated 5 months ago
- Fantastic Data Engineering for Large Language Models☆89Updated 6 months ago
- 基于DPO算法微调语言大模型,简单好上手。☆40Updated last year
- Our code will be public soon .☆26Updated 2 years ago
- OpenLLMDE: An open source data engineering framework for LLMs☆17Updated last year
- ☆18Updated 3 years ago
- Compute training dynamics, plot data cartography, analysing data quality...☆41Updated 2 years ago
- Python下shuffle几百G文件☆33Updated 3 years ago
- make LLM easier to use☆59Updated 2 years ago
- A pre-trained model with multi-exit transformer architecture.☆54Updated 2 years ago
- This repository presents the original implementation of Pretraining Data Detection for Large Language Models: A Divergence-based Calibrat…☆16Updated 2 months ago
- ☆46Updated last month