Assignment 1 for Stanford CS336 - Language Modeling From Scratch
☆77Jul 7, 2025Updated 9 months ago
Alternatives and similar repositories for stanford-cs336-a1
Users that are interested in stanford-cs336-a1 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- My implementation of Stanford CS336 assignments.☆237Mar 15, 2026Updated last month
- Code for Research Project TLDR☆25Jul 28, 2025Updated 8 months ago
- 南京大学2022春季PA实验☆13Aug 27, 2023Updated 2 years ago
- [ICML 2024] Probabilistic Conceptual Explainers (PACE): Trustworthy Conceptual Explanations for Vision Foundation Models☆18Sep 25, 2025Updated 6 months ago
- A curated collection of resources, frameworks, papers, and best practices for designing, evaluating, and deploying agentic AI systems—fro…☆34Apr 5, 2026Updated 2 weeks ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆32Oct 22, 2025Updated 5 months ago
- LLMs Learn Task Heuristics from Demonstrations: A Heuristic-Driven Prompting Strategy for Document-Level Event Argument Extraction (ACL 2…☆14Aug 12, 2024Updated last year
- ☆20Jun 26, 2025Updated 9 months ago
- A Modern Configuration/Registry System designed for deeplearning, with some utils.☆18Dec 23, 2025Updated 3 months ago
- Source code for "A Two-Stream AMR-enhanced Model for Document-level Event Argument Extraction" @ NAACL 2022☆19May 1, 2022Updated 3 years ago
- 🕵️♂️ ML project to identify malicious web payloads, aimed at boosting the effectiveness of WAFs and IDSs.☆15Apr 7, 2024Updated 2 years ago
- Automate dating apps with AI☆20Jan 18, 2024Updated 2 years ago
- [ACL 2024 Findings] The code for Beyond Single-Event Extraction: Towards Efficient Document-Level Multi-Event Argument Extraction☆20Nov 4, 2024Updated last year
- Redefining Video Management with power of SQL☆11Oct 15, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆27Apr 8, 2026Updated last week
- Implementation codes for NeurIPS23 paper "Spectral Invariant Learning for Dynamic Graphs under Distribution Shifts"☆14Mar 19, 2024Updated 2 years ago
- ☆39Jun 14, 2025Updated 10 months ago
- Enformer Celltyping is a tensorflow, multi-headed attention based model that incorporates distal effects of Deoxyribonucleic Acid (DNA) i…☆16Jun 25, 2025Updated 9 months ago
- translate skyzh/mini-lsm to go version☆10Jun 7, 2023Updated 2 years ago
- [Paper][AAAI2023] Analogical Inference Enhanced Knowledge Graph Embedding☆13Jan 19, 2023Updated 3 years ago
- ☆18Nov 12, 2025Updated 5 months ago
- [ICLR 25] A novel framework for building intrinsically interpretable LLMs with human-understandable concepts to ensure safety, reliabilit…☆32Feb 5, 2026Updated 2 months ago
- code for [ACL23] An AMR-based Link Prediction Approach for Document-level Event Argument Extraction☆24Oct 2, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆10Dec 11, 2024Updated last year
- ☆11Mar 8, 2024Updated 2 years ago
- code for ACL 2023 paper 'Event Extraction as Question Generation and Answering'☆24Aug 13, 2023Updated 2 years ago
- Code for “ACE-HGNN: Adaptive Curvature ExplorationHyperbolic Graph Neural Network”☆17Mar 3, 2022Updated 4 years ago
- 华中科技大学Linux协会(HUSTLUG)开源镜像站☆12Oct 26, 2023Updated 2 years ago
- ☆13Nov 12, 2021Updated 4 years ago
- Official eval code for ROVER: Benchmarking Reciprocal Cross-Modal Reasoning for Omnimodal Generation☆27Dec 12, 2025Updated 4 months ago
- ☆53Nov 22, 2025Updated 4 months ago
- WIP: Unnoficial implementation of diffusion autoencoders, using pytorch☆11Feb 15, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- USC CSCI 571 2020 fall, Web Technologies☆20Dec 17, 2020Updated 5 years ago
- Xmixers: A collection of SOTA efficient token/channel mixers☆28Sep 4, 2025Updated 7 months ago
- awesome SAE papers☆75May 24, 2025Updated 10 months ago
- ☆24May 25, 2022Updated 3 years ago
- Code release for "Generative Modeling of Weights: Generalization or Memorization?"☆19Apr 9, 2026Updated last week
- CoCo: Code as CoT for Text-to-Image Preview and Rare Concept Generation☆50Apr 9, 2026Updated last week
- [NeurIPS 2023] "Understanding the Limitations of Deep Models for Molecular Property Prediction: Insights and Solutions"☆12Jan 26, 2024Updated 2 years ago