☆96Jul 20, 2025Updated 10 months ago
Alternatives and similar repositories for cs336
Users that are interested in cs336 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- LLM Tokenizer with BPE algorithm☆49May 7, 2024Updated 2 years ago
- My implementation of Stanford CS336 assignments.☆240Mar 15, 2026Updated 2 months ago
- My Solution and Notes for the Stanford CS336: LLM from scratch☆236Mar 23, 2026Updated 2 months ago
- 🏆🏆 「大模型」All in one & All from scratch. 🌍🌍 收集、清洗数据,训练Tokenizer,预训练、SFT、GRPO!☆57Aug 12, 2025Updated 9 months ago
- A Minimalistic Auto-Diff Optimization Framework for Teaching and Understanding Pytorch☆27Mar 12, 2026Updated 2 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Solutions for Stanford CS224n, Winter 2020.☆12Jun 5, 2021Updated 4 years ago
- Tiny-DeepSpeed, a minimalistic re-implementation of the DeepSpeed library☆52Aug 20, 2025Updated 9 months ago
- 第六届 中国软件杯 软件设计大赛 企业增值税发票数据分析系统☆15Aug 14, 2017Updated 8 years ago
- ☆12May 23, 2024Updated 2 years ago
- ☆13May 10, 2021Updated 5 years ago
- [ICLR 2025] "Rethinking LLM Unlearning Objectives: A Gradient Perspective and Go Beyond"☆16Feb 27, 2025Updated last year
- Simple template DAG scheduler in c++☆15Aug 13, 2020Updated 5 years ago
- (包含完整代码和坑点记录)Student version of Assignment 1 for Stanford CS336 - Language Modeling From Scratch☆33Jan 22, 2026Updated 4 months ago
- ☆17Feb 6, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Repo for paper "Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability"☆106Apr 23, 2026Updated last month
- Codebase for BRDiv: Diverse teammate generation for ad hoc teamwork☆13May 2, 2024Updated 2 years ago
- Official PyTorch code for ICLR 2025 paper "Gnothi Seauton: Empowering Faithful Self-Interpretability in Black-Box Models"☆23Mar 4, 2025Updated last year
- [ECCV'24] UNIT: Backdoor Mitigation via Automated Neural Distribution Tightening☆10Dec 18, 2025Updated 5 months ago
- Official Code for Learning to Sample Effective and Diverse Prompts for Text-to-Image Generation (CVPR 2025)☆14Apr 2, 2025Updated last year
- An asynchronous streaming data management module for efficient post-training.☆72May 18, 2026Updated last week
- [AAAI'26] Official implementation of CMMCoT: Enhancing Complex Multi-Image Comprehension via Multi-Modal Chain-of-Thought and Memory Augm…☆11Dec 5, 2025Updated 5 months ago
- Math-VR Benchmark & CodePlot-CoT: Mathematical Visual Reasoning by Thinking with Code-Driven Images☆61Nov 4, 2025Updated 6 months ago
- Official repository of the "Attention-Propagation Network for Egocentric Heatmap to 3D Pose Lifting" (CVPR 2024 Highlight)☆14Dec 24, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- NewsApp包含客户端源码、服务端源码、数据库文件。 基于Miscrosoft人工智能项目ProjectOxford中的Recognition Emotion做的, 主要是基于用户的面部表情来推送不同类别的新闻。 Emotion API可以参考:https://www.p…☆10Mar 2, 2016Updated 10 years ago
- Code release for "Generating Code World Models with Large Language Models Guided by Monte Carlo Tree Search" published at NeurIPS '24.☆18Feb 21, 2025Updated last year
- ☆18Sep 19, 2025Updated 8 months ago
- 下载复旦论文数据库的便利工具☆11Feb 13, 2025Updated last year
- ☆22Jun 16, 2025Updated 11 months ago
- 使用Sentencepiece对中文语料进行分词☆13Nov 30, 2023Updated 2 years ago
- personal notes (30k loc)☆13Updated this week
- ☆3,145Updated this week
- LLM Inference via Triton (Flexible & Modular): Focused on Kernel Optimization using CUBIN binaries, Starting from gpt-oss Model☆117Apr 28, 2026Updated 3 weeks ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ffdnet-pytorch 简单修改就可以跑起来☆15Nov 11, 2021Updated 4 years ago
- 基于 BPE 实现的中文分词。优化:预处理,并行计算,多字词,多词表☆14May 14, 2022Updated 4 years ago
- Official repository of “MatchSeg"☆12Mar 22, 2024Updated 2 years ago
- 基于MFCC特征构建单核GMM的0-9独立词语音识别,MFCC,GMM,sklearn,Isolated word recognition。☆10Nov 18, 2020Updated 5 years ago
- Repository for "Generative Adversarial Super-Resolution at the Edge with Knowledge Distillation" (Angarano et al., 2022).☆14May 15, 2023Updated 3 years ago
- AI-based analytical tools for the analysis of STEM images.☆12Oct 22, 2024Updated last year
- [NeurIPS'25] Backdoor Cleaning without External Guidance in MLLM Fine-tuning☆20Oct 13, 2025Updated 7 months ago