Student version of Assignment 1 for Stanford CS336 - Language Modeling From Scratch
☆33Apr 27, 2025Updated 11 months ago
Alternatives and similar repositories for cs336-a1
Users that are interested in cs336-a1 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [EMNLP 2024] FlowBench: Revisiting and Benchmarking Workflow-Guided Planning for LLM-based Agents☆22Jan 6, 2025Updated last year
- ☆12Jan 10, 2025Updated last year
- 这个仓库包含了我在上人工智能课时完成的拼音输入法作业。☆11Feb 16, 2022Updated 4 years ago
- [ICLR 2025 Oral] Official Implementation for "Do Vision-Language Models Represent Space and How? Evaluating Spatial Frame of Reference Un…☆21Oct 24, 2024Updated last year
- Higher Order SVD implementation in PyTorch☆13Nov 14, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- TEMPURA enables video-language models to reason about causal event relationships and generate fine-grained, timestamped descriptions of u…☆25Jun 4, 2025Updated 10 months ago
- ☆11Aug 1, 2019Updated 6 years ago
- ☆14Dec 17, 2018Updated 7 years ago
- [EMNLP 2024 Main] Official repository of paper "SLANG: New Concept Comprehension of Large Language Models"☆14Oct 27, 2024Updated last year
- ☆12Jul 1, 2017Updated 8 years ago
- ☆11Mar 10, 2023Updated 3 years ago
- ☆13Jul 12, 2024Updated last year
- [NAACL 2022] Contrastive Learning for Prompt-based Few-shot Language Learners☆22Jan 26, 2023Updated 3 years ago
- Official Pytorch Code Implementation for "UniDB: A Unified Diffusion Bridge Framework via Stochastic Optimal Control", accepted by ICML 2 …☆32Jan 30, 2026Updated 2 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Dataset for AAAI paper "Natural Language Inference in Context - Investigating Contextual Reasoning over Long Texts"☆11Nov 18, 2022Updated 3 years ago
- code for ACL2024-main: BatchEval: Towards Human-like Text Evaluation☆19May 20, 2024Updated last year
- ☆32Feb 23, 2025Updated last year
- smsckf 注释☆15Nov 5, 2019Updated 6 years ago
- We propose a novel real time monocular Hybrid visual odometry formulation which combines the high precision of indirect approaches with t…☆14Mar 10, 2021Updated 5 years ago
- arxiv.org api for scientific papers☆11Oct 12, 2015Updated 10 years ago
- Story understanding and plot analysis pilot.☆11Dec 27, 2022Updated 3 years ago
- Anthropic's Contextual Retrieval implementation with visual chunk comparison. Preview context enrichment before/after embedding.☆27Sep 25, 2025Updated 6 months ago
- Code release for "TempLM: Distilling Language Models into Template-Based Generators"☆14Jul 21, 2022Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆16Oct 29, 2023Updated 2 years ago
- ☆11Aug 21, 2023Updated 2 years ago
- Real or Fake Text? Evaluation criteria for human-written and computer-generated text through the gamification of annotation. Published in…☆11Dec 22, 2022Updated 3 years ago
- a transformer implemented primarily using einops and trained on the tinystories dataset☆13Jun 21, 2024Updated last year
- Official Code for NAACL 2022 paper: "Persona-Guided Planning for Controlling the Protagonist's Persona in Story Generation"☆16Sep 1, 2022Updated 3 years ago
- A Framework aims to wisely initialize unseen subword embeddings in PLMs for efficient large-scale continued pretraining☆18Nov 26, 2023Updated 2 years ago
- ☆15Nov 5, 2024Updated last year
- The corresponding code from our paper " COINS: Dynamically Generating COntextualized Inference Rules for Narrative Story Completion (ACL …☆18Jun 27, 2022Updated 3 years ago
- [ACL 2025, Main Conference, Oral] Intuitive Fine-Tuning: Towards Simplifying Alignment into a Single Process☆30Aug 2, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆15May 20, 2023Updated 2 years ago
- LDSO 注释☆24Nov 6, 2019Updated 6 years ago
- ☆112Jan 18, 2026Updated 2 months ago
- [COLM '24] Source-Aware Training Enables Knowledge Attribution in Language Models☆18Apr 1, 2025Updated last year
- Torchserve + TensorRT + Detection☆19Feb 16, 2022Updated 4 years ago
- Hugging Face RoBERTa with Flash Attention 2☆24Sep 14, 2025Updated 6 months ago
- Scalable Meta-Evaluation of LLMs as Evaluators☆43Feb 15, 2024Updated 2 years ago