stanford-cs336 / assignment4-dataLinks
☆23Updated 2 months ago
Alternatives and similar repositories for assignment4-data
Users that are interested in assignment4-data are comparing it to the libraries listed below
Sorting:
- Simple & Scalable Pretraining for Neural Architecture Research☆294Updated last month
- Student version of Assignment 2 for Stanford CS336 - Language Modeling From Scratch☆78Updated 2 months ago
- Official PyTorch implementation for Hogwild! Inference: Parallel LLM Generation with a Concurrent Attention Cache☆124Updated last month
- ☆62Updated 2 months ago
- Easy to use, High Performant Knowledge Distillation for LLMs☆93Updated 4 months ago
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆77Updated 6 months ago
- ☆133Updated 4 months ago
- GPTQ and efficient search for GGUF☆48Updated last week
- qwen3 experiments☆31Updated 2 months ago
- ☆155Updated 5 months ago
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆92Updated 8 months ago
- ☆68Updated 4 months ago
- [NeurIP'25 Oral] Query-agnostic KV cache eviction: 3–4× reduction in memory and 2× decrease in latency (Qwen3/2.5, Gemma3, LLaMA3)☆108Updated last week
- ☆57Updated 3 months ago
- Easily view and modify JSON datasets for large language models☆83Updated 4 months ago
- Testing paligemma2 finetuning on reasoning dataset☆18Updated 8 months ago
- ☆115Updated 3 months ago
- ☆57Updated 7 months ago
- A mcp server that uses the Osmosis-Apply-1.7B model to apply code merges☆53Updated 2 months ago
- Steering LLM Thinking with Budget Guidance☆24Updated last month
- [EMNLP 2025] The official implementation for paper "Agentic-R1: Distilled Dual-Strategy Reasoning"☆99Updated 3 weeks ago
- The official repo for “Unleashing the Reasoning Potential of Pre-trained LLMs by Critique Fine-Tuning on One Problem” [EMNLP25]☆32Updated 3 weeks ago
- ☆104Updated 3 months ago
- LLM Inference on consumer devices☆124Updated 6 months ago
- Testing LLM reasoning abilities with family relationship quizzes.☆63Updated 7 months ago
- Measuring Thinking Efficiency in Reasoning Models - Research Repository☆36Updated 3 weeks ago
- 🚀 FlexLLama - Lightweight self-hosted tool for running multiple llama.cpp server instances with OpenAI v1 API compatibility and multi-GP…☆33Updated this week
- ☆51Updated last year
- Simple examples using Argilla tools to build AI☆55Updated 10 months ago
- Transplants vocabulary between language models, enabling the creation of draft models for speculative decoding WITHOUT retraining.☆42Updated 2 weeks ago