Student version of Assignment 2 for Stanford CS336 - Language Modeling From Scratch
☆227May 1, 2026Updated 3 weeks ago
Alternatives and similar repositories for assignment2-systems
Users that are interested in assignment2-systems are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆63May 8, 2026Updated 2 weeks ago
- ☆149Updated this week
- Code for "What really matters in matrix-whitening optimizers?"☆24Oct 31, 2025Updated 6 months ago
- OLMost every training recipe you need to perform data interventions with the OLMo family of models.☆73Apr 28, 2026Updated 3 weeks ago
- ☆13Mar 30, 2026Updated last month
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- CLaMR: Contextualized Late-Interaction for Multimodal Content Retrieval☆25Jun 28, 2025Updated 10 months ago
- ☆36Jul 24, 2025Updated 10 months ago
- ☆33Jan 7, 2025Updated last year
- 6,080-param transformer achieving 100% accuracy on 10-digit addition. Trained from scratch in 10 minutes.☆22Feb 19, 2026Updated 3 months ago
- Pytorch implementation of Adaptative Dropout a.ka Standout.☆12Feb 22, 2018Updated 8 years ago
- ☆3,145Updated this week
- ☆35Jul 5, 2023Updated 2 years ago
- PAL: Predictive Analysis & Laws of Large Language Models☆39May 19, 2026Updated last week
- Metadata for my UK Domestic Appliance-Level Electricity (UK-DALE) dataset☆16Jul 16, 2017Updated 8 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆12May 30, 2025Updated 11 months ago
- [ACL 2025] Official implementation of the "CoT-ICL Lab" framework☆11May 1, 2026Updated 3 weeks ago
- Deep universal probabilistic programming with Python and PyTorch☆12Apr 1, 2020Updated 6 years ago
- Student lab assignments for MIT 6.1600☆11May 1, 2025Updated last year
- [ACL 2026 Main Conference] Paper list for the survey "A Survey of Deep Learning for Geometry Problem Solving"☆34Sep 14, 2025Updated 8 months ago
- A new repository for model serving examples using Docker, Git HOOKS, Celery and Flask☆11Apr 29, 2026Updated 3 weeks ago
- A PyTorch implementation of BatchBALD on the MNIST dataset☆13Sep 16, 2020Updated 5 years ago
- [NeurIPS2024] BoostAdapter: Improving Test-Time Adaptation via Regional Bootstrapping☆19Feb 28, 2026Updated 2 months ago
- Advanced Dropout: A Model-free Methodology for Bayesian Dropout Optimization (IEEE TPAMI 2021)☆17Jun 4, 2021Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- An MLIR-based compiler from C/C++ to AMD-Xilinx Versal AIE☆17Aug 5, 2022Updated 3 years ago
- ☆39Aug 4, 2025Updated 9 months ago
- ☆15Jul 28, 2025Updated 9 months ago
- Code for "Echo Chamber: RL Post-training Amplifies Behaviors Learned in Pretraining"☆28Oct 14, 2025Updated 7 months ago
- ☆13May 12, 2025Updated last year
- Advanced NLP, Fall 2025 https://cmu-l3.github.io/anlp-fall2025/☆62Jan 18, 2026Updated 4 months ago
- A Prot paper related materials☆11Sep 5, 2022Updated 3 years ago
- FROM $f(x)$ AND $g(x)$ TO $f(g(x))$: LLMs Learn New Skills in RL by Composing Old Ones☆67Jan 26, 2026Updated 4 months ago
- Text-to-video generation.☆10Jul 22, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ReCAP: Recursive Context-Aware Reasoning and Planning for Large Language Model Agents, NeurIPS 2025☆36Nov 15, 2025Updated 6 months ago
- Resources for our AAAI 2022 paper: "Unsupervised Editing for Counterfactual Stories".☆12Oct 25, 2022Updated 3 years ago
- Library that provides metrics to assess representation quality☆27Feb 5, 2025Updated last year
- ☆26Feb 20, 2026Updated 3 months ago
- Scripts for converting TCIA LIDC-IDRI collection derived data into standard DICOM representation from project-specific XML format.☆27Jun 17, 2021Updated 4 years ago
- The Full Spectrum of Deepnet Hessians at Scale: Dynamics with SGD Training and Sample Size☆19May 19, 2019Updated 7 years ago
- ⚡ Fast full-text search for Elixir☆17Jul 18, 2024Updated last year