youkaichao / my-portable-aiml-environmentLinks
A dockerfile to build my portable environment for AI/ML development, with some daily used packags!
☆20Updated 4 months ago
Alternatives and similar repositories for my-portable-aiml-environment
Users that are interested in my-portable-aiml-environment are comparing it to the libraries listed below
Sorting:
- A Python implementation of Toolformer using Huggingface Transformers☆14Updated 2 years ago
- Minimal RLHF implementation built on top of minGPT.☆30Updated last year
- AI Alignment: A Comprehensive Survey☆135Updated last year
- GAOGAO-Bench-Updates is a supplement to the GAOKAO-Bench, a dataset to evaluate large language models.☆34Updated 7 months ago
- code for paper 《RankingGPT: Empowering Large Language Models in Text Ranking with Progressive Enhancement》☆34Updated last year
- MetaAgent: Toward Self-Evolving Agent via Tool Meta-Learning☆25Updated 3 weeks ago
- Course Materials for ML Course at Tsinghua☆26Updated 5 years ago
- The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.☆69Updated 2 years ago
- Deep Research☆72Updated last week
- ☆30Updated 5 months ago
- 逻辑回归和单层softmax的 解析解☆12Updated 4 years ago
- Python client designed specifically for large-scale requests to the openai interface☆22Updated last year
- ☆74Updated 2 weeks ago
- 最简易的R1结果在小模型上的复现,阐述类O1与DeepSeek R1最重要的本质。Think is all your need。利用实验佐证,对于强推理能力,think思考过程性内容是AGI/ASI的核心。☆43Updated 6 months ago
- Repo of ACL 2025 Paper "Quantification of Large Language Model Distillation"☆91Updated last month
- 官方transformers源码解析。AI大模型时代,pytorch、transformer是新操作系统,其他都是运行在其上面的软件。☆17Updated last year
- Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models☆136Updated last year
- 中国如何下载huggingface 模型并共享链接☆55Updated last year
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆19Updated 2 years ago
- 百度QA100万数据集☆48Updated last year
- PPTC Benchmark: Evaluating Large Language Models for PowerPoint Task Completion☆56Updated last year
- ☆21Updated 2 weeks ago
- 仓库主要记录 NLP 算法工程师相关的顶会论文研读笔记【文本匹配篇】☆13Updated 3 years ago
- helper functions for processing and integrating visual language information with Qwen-VL Series Model☆15Updated last year
- ☆49Updated 3 years ago
- This is a repo for showcasing using MCTS with LLMs to solve gsm8k problems☆87Updated 5 months ago
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆41Updated last year
- Automatically select the best random seed based on ancient Chinese I Ching. Good luck and best wishes !☆48Updated 3 years ago
- Feeling confused about super alignment? Here is a reading list☆43Updated last year
- RLHF experiments on a single A100 40G GPU. Support PPO, GRPO, REINFORCE, RAFT, RLOO, ReMax, DeepSeek R1-Zero reproducing.☆69Updated 6 months ago