Alx-AI / AI_Diplomacy
☆218Updated this week
Alternatives and similar repositories for AI_Diplomacy:
Users that are interested in AI_Diplomacy are comparing it to the libraries listed below
- The Tensor (or Array)☆429Updated 8 months ago
- The Multilayer Perceptron Language Model☆543Updated 8 months ago
- Following master Karpathy with GPT-2 implementation and training, writing lots of comments cause I have memory of a goldfish☆174Updated 8 months ago
- procedural reasoning datasets☆571Updated this week
- The Autograd Engine☆597Updated 7 months ago
- Fast bare-bones BPE for modern tokenizer training☆154Updated 3 weeks ago
- A repository to unravel the language of GPUs, making their kernel conversations easy to understand☆178Updated this week
- Deep learning for dummies. All the practical details and useful utilities that go into working with real models.☆786Updated last month
- Build your own visual reasoning model☆341Updated this week
- small auto-grad engine inspired from Karpathy's micrograd and PyTorch☆252Updated 5 months ago
- System 2 Reasoning Link Collection☆826Updated last month
- ☆108Updated 4 months ago
- This repo has all the basic things you'll need in-order to understand complete vision transformer architecture and its various implementa…☆215Updated 3 months ago
- large population models☆329Updated 2 weeks ago
- UNet diffusion model in pure CUDA☆602Updated 9 months ago
- Solve puzzles to improve your tinygrad skills!☆122Updated last month
- Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym☆438Updated 3 weeks ago
- Our solution for the arc challenge 2024☆134Updated last month
- A comprehensive repository of reasoning tasks for LLMs (and beyond)☆432Updated 6 months ago
- (WIP) A small but powerful, homemade PyTorch from scratch.☆543Updated last week
- Simple Transformer in Jax☆136Updated 10 months ago
- ☆45Updated 3 weeks ago
- Single File, Single GPU, From Scratch, Efficient, Full Parameter Tuning library for "RL for LLMs"☆419Updated 2 weeks ago
- Implementation of Diffusion Transformer (DiT) in JAX☆272Updated 10 months ago
- Learnings and programs related to CUDA☆379Updated 2 months ago
- Testing baseline LLMs performance across various models☆257Updated last week
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆96Updated last month
- A Collection of Competitive Text-Based Games for Language Model Evaluation and Reinforcement Learning☆139Updated this week
- Implementing DeepSeek R1's GRPO algorithm from scratch☆1,184Updated last week
- $100K or 100 Days: Trade-offs when Pre-Training with Academic Resources☆135Updated last month