CS336 作业 5 实现, 附加作业里面的 dpo/rlhf 也完成了, 消融实验分析也放在飞书文档里面了, 仅供参考
☆27Sep 27, 2025Updated 6 months ago
Alternatives and similar repositories for cs336_assignment-5
Users that are interested in cs336_assignment-5 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- replication of micro-price on crytocurrency data☆10Feb 27, 2022Updated 4 years ago
- (包含完整代码和坑点记录)Student version of Assignment 1 for Stanford CS336 - Language Modeling From Scratch☆28Jan 22, 2026Updated 2 months ago
- I love reinforcement learning.☆12Jan 15, 2025Updated last year
- ☆10Dec 8, 2022Updated 3 years ago
- [AAAI 2024] Official code for "Hyp-OW: Exploiting Hierarchical Structure Learning with Hyperbolic Distance Enhances Op…☆16Feb 14, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- This repository documents Barry's journey in learning deep learning for speech processing. Here, you'll find scripts and code snippets re…☆13Oct 8, 2025Updated 5 months ago
- Implementation of my CS336 assignment1☆41Dec 23, 2025Updated 3 months ago
- ☆28Jun 27, 2025Updated 9 months ago
- 纯Python实现的深度学习框架,帮助你理解底层细节斩获offer☆21Aug 26, 2022Updated 3 years ago
- This code is the official implementation of paper "Certifiably Robust Image Watermark".☆15Aug 7, 2024Updated last year
- ☆18Nov 28, 2022Updated 3 years ago
- Learnable Descriptive Convolutional Network for Face Anti-Spoofing (BMVC'22)☆17Nov 12, 2024Updated last year
- The source code for the paper "Robust Data Hiding Using Inverse Gradient Attention".☆14Oct 17, 2022Updated 3 years ago
- A general approach for using deep neural network for digital watermarking☆15Mar 30, 2020Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- A Hierarchical Graph V-Net with Semi-supervised Pre-training for Breast Cancer Histology Image Classification" (IEEE TMI)☆24Oct 23, 2023Updated 2 years ago
- 知识图谱编辑系统,可以实现对知识图谱的增删改查,另外构建了专业领域(计算机)领域的知识图谱☆25Jun 29, 2020Updated 5 years ago
- 现代应用统计 Modern Applied Statistics with R, INLA and Stan☆25Jan 27, 2026Updated 2 months ago
- A private repo for learning CS336☆36Sep 2, 2025Updated 6 months ago
- FineCLIP: Self-distilled Region-based CLIP for Better Fine-grained Understanding (NIPS24)☆37Nov 12, 2025Updated 4 months ago
- 《Transformer系列视频的手稿整理》☆82Jan 4, 2026Updated 2 months ago
- 包含了LLM的一些手撕代码,如强化学习。可以帮助从代码层面深入理解原理,以及有助于准备大模型面试可能出现的手撕。后续会更新Transformer等更多手撕☆79Mar 15, 2026Updated 2 weeks ago
- ☆37Sep 23, 2022Updated 3 years ago
- Some useful tools☆20Nov 28, 2019Updated 6 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Extension for 3D Slicer containing various tools for module development and debugging☆34Sep 17, 2025Updated 6 months ago
- [CVPR 2026] Fine-Grained GRPO for Precise Preference Alignment in Flow Models☆53Updated this week
- ☆36Dec 19, 2022Updated 3 years ago
- ☆25Jul 15, 2022Updated 3 years ago
- A radiomics tool with a variety of brain MRI processing functions, including affine registration, hippocampus segmentation and feature ca…☆29Jul 20, 2021Updated 4 years ago
- Assignments for course IERG 6130: Reinforcement Learning and Beyond☆12Mar 9, 2021Updated 5 years ago
- PR2024 GDB: Gated convolutions-based Document Binarization. This repository comprehensively collects the datasets that may be used in do…☆16Nov 27, 2023Updated 2 years ago
- Official repository for the ICCV 25 paper: QR-LoRA: Efficient and Disentangled Fine-tuning via QR Decomposition for Customized Generati…☆21Jul 29, 2025Updated 8 months ago
- 基于React + FastAPI + LangChain + 通义千问的智能医疗问答系统,支持基于检索增强生成(RAG)的医疗知识问答。☆72Mar 27, 2025Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- 基于外挂知识库的大模型问答☆23Mar 6, 2024Updated 2 years ago
- Official repository for the CVPR 2021 paper "Learning Feature Aggregation for Deep 3D Morphable Models"☆46Jun 21, 2021Updated 4 years ago
- A simple code editor which helps you working on OpenJudge☆45Jun 30, 2025Updated 8 months ago
- 零基础基于U-Net网络实战眼底图像血管提取☆14Oct 5, 2021Updated 4 years ago
- PKU course, Reinforced Learning, final project☆27Mar 23, 2021Updated 5 years ago
- ONNX-compatible DocShadow: High-Resolution Document Shadow Removal. Supports TensorRT 🚀☆25Sep 13, 2023Updated 2 years ago
- 上海交通大学2020春研究生的部分课程作业整理☆16Jun 14, 2020Updated 5 years ago