hesamsheikh / AnimAI-Trainer
Train an LLM to generate cracked Manim animations for mathematical concepts.
☆14Updated last month
Alternatives and similar repositories for AnimAI-Trainer:
Users that are interested in AnimAI-Trainer are comparing it to the libraries listed below
- ☆45Updated 3 weeks ago
- ☆78Updated last week
- Coding an LLM and its building blocks from scratch.☆34Updated 3 weeks ago
- Transformers from scratch using PyTorch & NumPy.☆22Updated 2 months ago
- An overview of GRPO & DeepSeek-R1 Training with Open Source GRPO Model Fine Tuning☆31Updated 2 months ago
- "LLM from Zero to Hero: An End-to-End Large Language Model Journey from Data to Application!"☆27Updated this week
- Compiling useful links, papers, benchmarks, ideas, etc.☆42Updated last month
- So, I trained a Llama a 130M architecture I coded from ground up to build a small instruct model from scratch. Trained on FineWeb dataset…☆14Updated last month
- Question paper of courses taught at IISC as part of MTech AI curriculum☆62Updated 4 months ago
- ☆43Updated this week
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆63Updated last month
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆64Updated 5 months ago
- a simple CLI command that will create a template of a generic ML Project☆79Updated 6 months ago
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆96Updated last month
- Hub for researchers exploring VLMs and Multimodal Learning:)☆25Updated this week
- chrome & firefox extension to chat with webpages: local llms☆113Updated 4 months ago
- An introduction to LLM Sampling☆77Updated 4 months ago
- A repository consisting of paper/architecture replications of classic/SOTA AI/ML papers in pytorch☆160Updated this week
- Improving AI Systems with Self-Defense Mechanisms☆13Updated last month
- NanoGPT-speedrunning for the poor T4 enjoyers☆62Updated this week
- Implementation of a GPT-4o like Multimodal from Scratch using Python☆52Updated 3 weeks ago
- One click away from a locally downloaded, fine-tuned model, hosted on hugging face, with inference built in. In two hours.☆21Updated last month
- This repo has all the basic things you'll need in-order to understand complete vision transformer architecture and its various implementa…☆215Updated 3 months ago
- ☆38Updated last month
- Fine tuning ModernBERT-embed-base on synthetic domain specific data for improvement to unseen queries☆27Updated 3 months ago
- 100 days of learning & making kernels in cuda / triton☆22Updated last month
- ☆97Updated 6 months ago
- Testing paligemma2 finetuning on reasoning dataset☆18Updated 3 months ago
- Train your own SOTA deductive reasoning model☆88Updated last month
- repo of paper implementations☆18Updated 2 months ago