☆96Jul 20, 2025Updated 9 months ago
Alternatives and similar repositories for cs336
Users that are interested in cs336 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- LLM Tokenizer with BPE algorithm☆49May 7, 2024Updated last year
- My implementation of Stanford CS336 assignments.☆239Mar 15, 2026Updated last month
- My Solution and Notes for the Stanford CS336: LLM from scratch☆218Mar 23, 2026Updated last month
- 🏆🏆 「大模型」All in one & All from scratch. 🌍🌍 收集、清洗数据,训练Tokenizer,预训练、SFT、GRPO!☆57Aug 12, 2025Updated 8 months ago
- Multi-Critic Policy Gradient Optimization for Quadcopter Coordination☆14Aug 10, 2021Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆10May 23, 2022Updated 3 years ago
- An implementation of AutoScale regression-based method☆12Oct 27, 2020Updated 5 years ago
- Tiny-DeepSpeed, a minimalistic re-implementation of the DeepSpeed library☆52Aug 20, 2025Updated 8 months ago
- 第六届 中国软件杯 软件设计大赛 企业增值税发票数据分析系统☆15Aug 14, 2017Updated 8 years ago
- Official code for MotionBench (CVPR 2025)☆71Mar 3, 2025Updated last year
- Repo for paper "Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability"☆100Apr 23, 2026Updated last week
- Collections of RLxLM experiments using minimal codes☆14Feb 17, 2025Updated last year
- OpenFTA☆14Jun 14, 2013Updated 12 years ago
- 从零预训练LLM、SFT、RLHF、DPO笔记整理+面试问题☆21Sep 2, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Codebase for BRDiv: Diverse teammate generation for ad hoc teamwork☆13May 2, 2024Updated 2 years ago
- Official PyTorch code for ICLR 2025 paper "Gnothi Seauton: Empowering Faithful Self-Interpretability in Black-Box Models"☆23Mar 4, 2025Updated last year
- Math-VR Benchmark & CodePlot-CoT: Mathematical Visual Reasoning by Thinking with Code-Driven Images☆59Nov 4, 2025Updated 6 months ago
- ☆11Oct 8, 2022Updated 3 years ago
- [ECCV'24] UNIT: Backdoor Mitigation via Automated Neural Distribution Tightening☆10Dec 18, 2025Updated 4 months ago
- Taurix OS kernel. Taurix 系统内核,操作系统原理实(xjb)践(写)☆12Dec 20, 2020Updated 5 years ago
- [AAAI'26] Official implementation of CMMCoT: Enhancing Complex Multi-Image Comprehension via Multi-Modal Chain-of-Thought and Memory Augm…☆11Dec 5, 2025Updated 5 months ago
- 量化交易网站,软工三大作业迭代三,团队项目☆11Mar 8, 2018Updated 8 years ago
- An Efficent BPE Algorithm Faster then Hugging Face Tokenizer's Implementation☆13Sep 9, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code release for "Generating Code World Models with Large Language Models Guided by Monte Carlo Tree Search" published at NeurIPS '24.☆18Feb 21, 2025Updated last year
- ☆18Sep 19, 2025Updated 7 months ago
- ☆21Jun 16, 2025Updated 10 months ago
- 使用Sentencepiece对中文语料进行分词☆13Nov 30, 2023Updated 2 years ago
- LLM Inference via Triton (Flexible & Modular): Focused on Kernel Optimization using CUBIN binaries, Starting from gpt-oss Model☆113Apr 28, 2026Updated last week
- ☆2,922Apr 29, 2026Updated last week
- 基于 BPE 实现的中文分词。优化:预处理,并行 计算,多字词,多词表☆14May 14, 2022Updated 3 years ago
- ☆11Dec 11, 2024Updated last year
- 同济大学计科机器学习大作业☆10Mar 22, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆19Sep 27, 2023Updated 2 years ago
- 理工科-大模型入门实训课程☆120Aug 24, 2025Updated 8 months ago
- Student version of Assignment 1 for Stanford CS336 - Language Modeling From Scratch☆1,558Apr 7, 2026Updated 3 weeks ago
- a brief repo about paper research☆15Sep 4, 2024Updated last year
- Approximate dynamic programming (ADP) and Policy gradient (PG) based sequential optimal experimental design (sOED)☆21Jun 26, 2022Updated 3 years ago
- ViralDynamic is a Python & Matlab framework specifically developed for the simulation and analysis of epidemic spreading on complex netwo…☆50Jan 21, 2025Updated last year
- Official codebase for Generating Diverse Cooperative Agents by Learning Incompatible Policies (notable-top-25% @ ICLR 2023)☆19May 10, 2024Updated last year