FareedKhan-dev / AI-text-to-video-model-from-scratchLinks
In this blog, we will build a small scale text-to-video model from scratch. We will input a text prompt, and our trained model will generate a video based on that prompt.
☆221Updated last year
Alternatives and similar repositories for AI-text-to-video-model-from-scratch
Users that are interested in AI-text-to-video-model-from-scratch are comparing it to the libraries listed below
Sorting:
- This repository contains the code for a virtual try-on application built using Flask, Twilio's WhatsApp API, and Gradio's virtual try-on …☆349Updated last year
- ☆732Updated this week
- Have a natural voice conversation with an LLM☆261Updated last week
- The simplest open-source implementation of perplexity.ai☆325Updated last year
- Building a 2.3M-parameter LLM from scratch with LLaMA 1 architecture.☆197Updated last year
- An agentic workflow for story book generation☆31Updated 10 months ago
- Gradio-powered application that converts audio recordings of meetings into transcripts and provides concise summaries using whisper.☆143Updated 4 months ago
- ☆170Updated last year
- A Deep Research agent from scratch☆214Updated 8 months ago
- ☆152Updated last year
- Implementation of Stable Diffusion with PyTorch☆360Updated 11 months ago
- a collection of resources around LLMs, aggregated for the workshop "Mastering LLMs: End-to-End Fine-Tuning and Deployment" by Dan Becker …☆110Updated last year
- Open Source AI Math Notes☆497Updated last year
- podcastfy.ai gradio demo app☆333Updated last year
- ☆57Updated last year
- ☆209Updated 11 months ago
- ☆171Updated last year
- Voice-Enabled Math Tutor Powered by Groq that Calculates and Renders Live Problems and Instruction with LaTeX in Seconds!☆239Updated last month
- ☆298Updated last year
- ☆176Updated last year
- ☆80Updated 9 months ago
- A repo with an automated prompt engineering workflow from scratch. It leverages the OPRO technique.☆203Updated last year
- This project is a **proof of concept** that aims to replicate the reasoning capabilities of OpenAI's newly released O1 model.☆91Updated last year
- Video Search and Streaming Agent 🕵️♂️☆500Updated last year
- Ace interviews with AI practice. Our agent role-plays personalized interview tailored to your background, listening and replying like a r…☆123Updated last year
- A Gradio app that transcribes YouTube videos using audio extraction and OpenAI’s Whisper model.☆361Updated last year
- Model Activity Visualiser☆520Updated 9 months ago
- ☆222Updated last year
- LLaMA 3 is one of the most promising open-source model after Mistral, we will recreate it's architecture in a simpler manner.☆197Updated last year
- An opensource implementation of NotebookLM using Deepseek-V3 and PlayHT TTS.☆297Updated last year