Full stack LLM (Pre-training/finetuning, PPO(RLHF), Inference, Quant, etc.)
☆31Feb 21, 2025Updated last year
Alternatives and similar repositories for FullLLM
Users that are interested in FullLLM are comparing it to the libraries listed below
Sorting:
- Simplest AlphaZero Implementation☆26Nov 6, 2024Updated last year
- ☆18Mar 28, 2022Updated 3 years ago
- PPO in one file☆27Oct 26, 2024Updated last year
- ☆12Jul 4, 2024Updated last year
- ☆13May 13, 2025Updated 9 months ago
- Official implementation of the paper "Pretraining Language Models to Ponder in Continuous Space"☆25Jul 21, 2025Updated 7 months ago
- An official PyTorch implementation of "Certifiably Robust Graph Contrastive Learning" (NeurIPS 2023)☆11Jan 22, 2024Updated 2 years ago
- 收集量子机器学习的基础、算法、学习、项目等资料的收集。Here you can get all the Quantum Machine learning Basics, Algorithms ,Study Materials ,Projects and the descri…☆11Jan 4, 2018Updated 8 years ago
- Code for "What really matters in matrix-whitening optimizers?"☆21Oct 31, 2025Updated 4 months ago
- ☆10Jan 28, 2024Updated 2 years ago
- Tool to bridge Blender animation and physics-based robotic simulation☆16Updated this week
- 使用Few-Shot方法来做文本分类任务,基于THUCNews数据☆10Jun 4, 2020Updated 5 years ago
- End-to-end implementation of the Social Graph Network (SGN), described in the Structural Reasoning for Image-based Social Relation Recogn…☆13Apr 3, 2024Updated last year
- ☆11Updated this week
- This repository contains reference implementation for multi-LLM ToM paper (accepted to EMNLP 2023), Theory of Mind for Multi-Agent Collab…☆18Jun 11, 2024Updated last year
- Official pytorch implementation of "Tool-R1: Sample-Efficient Reinforcement Learning for Agentic Tool Use"☆20Sep 16, 2025Updated 5 months ago
- ☆12Feb 11, 2026Updated 2 weeks ago
- Code search model based the self-attention☆12Oct 16, 2020Updated 5 years ago
- ☆16Oct 11, 2025Updated 4 months ago
- ☆11May 9, 2023Updated 2 years ago
- The repository of "Addressing Shortcomings in Fair Graph Learning Datasets: Towards a New Benchmark" (KDD'24)☆13Jan 27, 2026Updated last month
- ☆19Jul 31, 2025Updated 7 months ago
- Official code for Guiding Language Model Math Reasoning with Planning Tokens☆18Feb 29, 2024Updated 2 years ago
- training BART from scratch☆12Dec 31, 2021Updated 4 years ago
- Graph Neural Convection-Diffusion with Heterophily☆11May 29, 2023Updated 2 years ago
- Implementation from scratch in C of the Multi-head latent attention used in the Deepseek-v3 technical paper.☆18Jan 15, 2025Updated last year
- add a Arg: label_smoothing for torch.nn.CrossEntropyLoss()☆14Jan 13, 2021Updated 5 years ago
- ☆53Feb 11, 2025Updated last year
- codes for Neural Architecture Ranker and detailed cell information datasets based on NAS-Bench series☆12Jul 11, 2022Updated 3 years ago
- ☆31Sep 11, 2025Updated 5 months ago
- Papers on fairness☆12Oct 20, 2020Updated 5 years ago
- [ACM MM'25] NeuroPump: Simultaneous Geometric and Color Rectification for Underwater Images☆26Oct 25, 2025Updated 4 months ago
- This repository contains three standalone demos showcasing the OpenAI Agents Python SDK integrated with Temporal's durable execution.☆40Nov 28, 2025Updated 3 months ago
- Code for paper "Out-of-Domain Robustness via Targeted Augmentations"☆14Feb 25, 2023Updated 3 years ago
- ☆14Oct 11, 2023Updated 2 years ago
- ☆18Apr 20, 2025Updated 10 months ago
- Code of "Regularized Best-of-N Sampling with Minimum Bayes Risk Objective for Language Model Alignment" (2025).☆14Apr 4, 2025Updated 10 months ago
- ☆12Jul 17, 2023Updated 2 years ago
- ☆16Sep 27, 2023Updated 2 years ago