alphaXiv / TinyRecursiveModelsLinks
☆28Updated 2 weeks ago
Alternatives and similar repositories for TinyRecursiveModels
Users that are interested in TinyRecursiveModels are comparing it to the libraries listed below
Sorting:
- Repository of implementations of classic and sota rl algorithms from scratch in PyTorch☆217Updated this week
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆61Updated last year
- How to build the best search, one step at a time!☆229Updated last month
- a tiny vectorstore implementation built with numpy.☆63Updated last year
- Here's all my Python/Numba (CUDA) code for the encoder block I made :)☆68Updated 8 months ago
- Simple & Scalable Pretraining for Neural Architecture Research☆305Updated 3 weeks ago
- A simple MLX implementation for pretraining LLMs on Apple Silicon.☆84Updated 4 months ago
- Agent Engineering course files☆71Updated 5 months ago
- ☆22Updated 7 months ago
- Tensor-Slayer : Manipulate weights and tensors of LLMs to achieve performance upgrades and introduce a novel inferenceless mechanistic in…☆27Updated 7 months ago
- A tool that facilitates easy, efficient and high-quality fine-tuning of Cohere's models☆76Updated 9 months ago
- qwen3 experiments☆33Updated 6 months ago
- ☆86Updated last year
- Advanced NLP, Fall 2025 https://cmu-l3.github.io/anlp-fall2025/☆45Updated last month
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆84Updated 9 months ago
- AI agent with RAG+ReAct on Indian Constitution & BNS☆76Updated 6 months ago
- look how they massacred my boy☆63Updated last year
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆34Updated 8 months ago
- The original BabyAGI, updated with LiteLLM and no vector database reliance (csv instead)☆21Updated last year
- 📓 A collection of generative AI open-source repositories that are actively being developed. If you are looking to build a solid profile …☆85Updated 2 months ago
- everything i know about cuda and triton☆13Updated 11 months ago
- A repository consisting of paper/architecture replications of classic/SOTA AI/ML papers in pytorch☆397Updated last month
- Coding an LLM and its building blocks from scratch.☆104Updated 9 months ago
- "LLM from Zero to Hero: An End-to-End Large Language Model Journey from Data to Application!"☆141Updated last month
- Official CLI and Python SDK for Prime Intellect - access GPU compute, remote sandboxes, RL environments, and distributed training infrast…☆126Updated this week
- In this repository I have a code and brief explanations of the attempts that I made at the ARC-AGI (2024) challenges :)☆26Updated last year
- Learn the building blocks of how to build gpt-oss from scratch☆107Updated 3 months ago
- A collection of lightweight interpretability scripts to understand how LLMs think☆74Updated this week
- ☆68Updated 7 months ago
- Rust Implementation of micrograd☆53Updated last year