EurekaLabsAI / ngram
The n-gram Language Model
☆1,363Updated 5 months ago
Alternatives and similar repositories for ngram:
Users that are interested in ngram are comparing it to the libraries listed below
- The Multilayer Perceptron Language Model☆532Updated 5 months ago
- The Autograd Engine☆550Updated 4 months ago
- nanoGPT style version of Llama 3.1☆1,290Updated 5 months ago
- NanoGPT (124M) in 3.4 minutes☆2,068Updated last week
- The Tensor (or Array)☆418Updated 5 months ago
- Video+code lecture on building nanoGPT from scratch☆3,782Updated 5 months ago
- Run PyTorch LLMs locally on servers, desktop and mobile☆3,462Updated this week
- Deep learning for dummies. All the practical details and useful utilities that go into working with real models.☆752Updated this week
- UNet diffusion model in pure CUDA☆596Updated 6 months ago
- A PyTorch native library for large model training☆3,091Updated this week
- System 2 Reasoning Link Collection☆722Updated this week
- Large Concept Models: Language modeling in a sentence representation space☆1,713Updated this week
- Puzzles for learning Triton☆1,300Updated last month
- An ML Systems Onboarding list☆647Updated 2 months ago
- MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.☆1,217Updated last month
- DataComp for Language Models☆1,206Updated last month
- Everything about the SmolLM & SmolLM2 family of models☆1,554Updated last week
- Code for BLT research paper☆1,314Updated this week
- ☆4,050Updated 7 months ago
- Training LLMs with QLoRA + FSDP☆1,436Updated 2 months ago
- Official implementation of "Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling"☆831Updated last month
- Following master Karpathy with GPT-2 implementation and training, writing lots of comments cause I have memory of a goldfish☆167Updated 5 months ago
- ☆1,403Updated last week
- Recipes to scale inference-time compute of open models☆932Updated this week
- An autoregressive character-level language model for making more things☆2,702Updated 7 months ago
- Entropy Based Sampling and Parallel CoT Decoding☆3,197Updated 2 months ago
- What would you do with 1000 H100s...☆948Updated last year
- Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.☆5,749Updated last month
- gpt-2 from scratch in mlx☆367Updated 7 months ago
- Implementation for MatMul-free LM.☆2,941Updated 2 months ago