hesamsheikh / AnimAI-TrainerLinks
Train an LLM to generate cracked Manim animations for mathematical concepts.
☆16Updated 2 months ago
Alternatives and similar repositories for AnimAI-Trainer
Users that are interested in AnimAI-Trainer are comparing it to the libraries listed below
Sorting:
- An overview of GRPO & DeepSeek-R1 Training with Open Source GRPO Model Fine Tuning☆32Updated 2 weeks ago
- One click away from a locally downloaded, fine-tuned model, hosted on hugging face, with inference built in. In two hours.☆22Updated 2 months ago
- Coding an LLM and its building blocks from scratch.☆38Updated 2 months ago
- rl from zero pretrain, can it be done? we'll see.☆24Updated this week
- ☆162Updated 2 weeks ago
- Transformers from scratch using PyTorch & NumPy.☆24Updated 4 months ago
- ☆46Updated 2 months ago
- in this repository, i'm going to implement increasingly complex llm inference optimizations☆58Updated 2 weeks ago
- ☆36Updated 2 weeks ago
- ☆38Updated 3 months ago
- A simple MLX implementation for pretraining LLMs on Apple Silicon.☆76Updated last month
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorch☆95Updated 5 months ago
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆64Updated 7 months ago
- Fine tune Gemma 3 on an object detection task☆46Updated this week
- ☆39Updated last month
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆67Updated 2 months ago
- ☆84Updated 3 weeks ago
- Personal project, Generative AI, Streamlit, Python☆52Updated last month
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆80Updated last year
- Join 15k builders to the Real-World ML Newsletter ⬇️⬇️⬇️☆48Updated last year
- A repository consisting of paper/architecture replications of classic/SOTA AI/ML papers in pytorch☆196Updated last month
- ☆92Updated 2 months ago
- So, I trained a Llama a 130M architecture I coded from ground up to build a small instruct model from scratch. Trained on FineWeb dataset…☆15Updated 2 months ago
- a simple CLI command that will create a template of a generic ML Project☆80Updated 8 months ago
- Here's all my Python/Numba (CUDA) code for the encoder block I made :)☆63Updated last month
- Training an LLM to use a calculator with multi-turn reinforcement learning, achieving a **62% absolute increase in evaluation accuracy**.☆38Updated last month
- ☆75Updated 5 months ago
- This repository contain the simple llama3 implementation in pure jax.☆64Updated 3 months ago
- ☆59Updated 2 weeks ago
- Compiling useful links, papers, benchmarks, ideas, etc.☆46Updated 2 months ago