FareedKhan-dev / text2video-from-scratchLinks
A Straightforward, Step-by-Step Implementation of a Video Diffusion Model
☆65Updated 3 months ago
Alternatives and similar repositories for text2video-from-scratch
Users that are interested in text2video-from-scratch are comparing it to the libraries listed below
Sorting:
- Building LLaMA 4 MoE from Scratch☆68Updated 7 months ago
- First-principle implementations of groundbreaking AI algorithms using a wide range of deep learning frameworks, accompanied by supporting…☆179Updated 3 months ago
- Easy to use, High Performant Knowledge Distillation for LLMs☆95Updated 6 months ago
- Parameter-efficient finetuning script for Phi-3-vision, the strong multimodal language model by Microsoft.☆58Updated last year
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorch☆102Updated 10 months ago
- ☆55Updated 11 months ago
- ☆299Updated 5 months ago
- CursorCore: Assist Programming through Aligning Anything☆132Updated 9 months ago
- ☆156Updated last week
- xllamacpp - a Python wrapper of llama.cpp☆65Updated this week
- From scratch implementation of a vision language model in pure PyTorch☆248Updated last year
- minimal GRPO implementation from scratch☆99Updated 8 months ago
- A new novel multi-modality (Vision) RAG architecture☆32Updated last year
- Maximizing the Performance of a Simple RAG using RL☆83Updated 8 months ago
- Distill thinking dataset more compactly and accurately!☆36Updated 5 months ago
- ☆45Updated 6 months ago
- ☆57Updated 9 months ago
- Building a 2.3M-parameter LLM from scratch with LLaMA 1 architecture.☆191Updated last year
- ☆86Updated last year
- 🎈 A series of lightweight GPT models featuring TinyGPT Base (~51M params) and TinyGPT-MoE (~85M params). Fast, creative text generation …☆16Updated 2 months ago
- ☆180Updated 3 months ago
- Official repository for RAGViz: Diagnose and Visualize Retrieval-Augmented Generation [EMNLP 2024]☆88Updated 10 months ago
- [EMNLP 2025] The official implementation for paper "Agentic-R1: Distilled Dual-Strategy Reasoning"☆101Updated 2 months ago
- A collection of notebooks for the Hugging Face blog series (https://huggingface.co/blog).☆45Updated last year
- Fused Qwen3 MoE layer for faster training, compatible with HF Transformers, LoRA, 4-bit quant, Unsloth☆207Updated 2 weeks ago
- ☆74Updated last year
- ☆18Updated 7 months ago
- Lightweight continuous batching OpenAI compatibility using HuggingFace Transformers include T5 and Whisper.☆29Updated 8 months ago
- A pipeline parallel training script for LLMs.☆162Updated 6 months ago
- Open-source examples and guides for building with the Qwen. Browse a collection of snippets, advanced techniques and walkthroughs.☆30Updated last year