EasternJournalist / learn-deep-learningLinks
Labs for deep learning course.
☆16Updated 4 years ago
Alternatives and similar repositories for learn-deep-learning
Users that are interested in learn-deep-learning are comparing it to the libraries listed below
Sorting:
- Tips for paper writing and researches 科技论文写作经验记录和总结☆136Updated 3 years ago
- ☆45Updated 3 months ago
- RLHF experiments on a single A100 40G GPU. Support PPO, GRPO, REINFORCE, RAFT, RLOO, ReMax, DeepSeek R1-Zero reproducing.☆69Updated 6 months ago
- Must-read papers on improving efficiency for pre-trained language models.☆105Updated 2 years ago
- 用RLHF可选LoRA对LLaMA和MOSS进行训练|Training LLaMA or MOSS with RLHF [LoRA]☆21Updated 2 years ago
- 我的数据竞赛方案总结☆70Updated last year
- A curated reading list for large language model (LLM) alignment. Take a look at our new survey "Large Language Model Alignment: A Survey"…☆80Updated last year
- ☆104Updated last month
- [ICLR 2024]EMO: Earth Mover Distance Optimization for Auto-Regressive Language Modeling(https://arxiv.org/abs/2310.04691)☆124Updated last year
- Fast instruction tuning with Llama2☆11Updated last year
- Lion and Adam optimization comparison☆63Updated 2 years ago
- A Tight-fisted Optimizer (Tiger), implemented in PyTorch.☆12Updated last year
- Jam of papers that interest or bore me and my friends :P☆23Updated this week
- make LLM easier to use☆59Updated 2 years ago
- 《自然语言处理概论》 张奇、桂韬、黄萱菁著☆117Updated last year
- Notes of my introduction about NLP in Fudan University☆37Updated 4 years ago
- ☆16Updated 3 years ago
- This is a personal reimplementation of Google's Infini-transformer, utilizing a small 2b model. The project includes both model and train…☆58Updated last year
- 怎么训练一个LLM分词器☆152Updated 2 years ago
- AI Alignment: A Comprehensive Survey☆135Updated last year
- This repository presents the original implementation of Pretraining Data Detection for Large Language Models: A Divergence-based Calibrat…☆17Updated 3 months ago
- On Memorization of Large Language Models in Logical Reasoning☆71Updated 5 months ago
- The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.☆67Updated 2 years ago
- Fantastic Data Engineering for Large Language Models☆90Updated 8 months ago
- Policies of scientific publisher and conferences towards large language model (LLM), such as ChatGPT☆75Updated 2 years ago
- ☆30Updated 5 months ago
- A repository of useful research/skill-upgrading talks or acticles in NLP/CV/AI Area (in Chinese).☆81Updated last year
- ☆125Updated last year
- Deep Research☆85Updated last week
- WideSearch: Benchmarking Agentic Broad Info-Seeking☆87Updated 3 weeks ago