Something in the middle of Karpathy's mingpt model and video lectures, BabyGPT is an easy to use model on a much smaller scale (16 and 256 out channels , 5 heads, fine tuned).
☆24Jan 13, 2026Updated 2 months ago
Alternatives and similar repositories for BabyGPT
Users that are interested in BabyGPT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Run baby Llama 2 model in windows☆14Jul 26, 2023Updated 2 years ago
- wav2lip in a Vector Quantized (VQ) space☆27Jun 20, 2023Updated 2 years ago
- ☆12May 5, 2024Updated last year
- The diffusion model is simple to implement☆17Oct 10, 2022Updated 3 years ago
- NeurIPS 2022☆39Nov 23, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- 浅尝LLM☆33Jun 14, 2023Updated 2 years ago
- Gradio_demo.py with Blinking on Still Mode Video Creation☆12Jun 21, 2023Updated 2 years ago
- Implementation of the SOTA Transformer architecture from PaLM - Scaling Language Modeling with Pathways in JAX/Flax☆14Jun 22, 2022Updated 3 years ago
- Co-Developer GPT Engine: server that provides read/write file access to a local directory from ChatGPT as OpenAI GPT actions, incl. execu…☆17Oct 1, 2025Updated 5 months ago
- EmoCapCLIP: Learning Transferable Facial Emotion Representations from Large-Scale Semantically Rich Captions☆20Jul 29, 2025Updated 8 months ago
- Generate slides using GPT models☆10Feb 24, 2023Updated 3 years ago
- Dan's repository of OpenFst (manually created by downloading certain versions of OpenFst), created to track certain patches.☆13Mar 8, 2016Updated 10 years ago
- Face recognition using Siamese Networks☆12Nov 29, 2017Updated 8 years ago
- A multimodal large-scale model, which performs close to the closed-source Qwen-VL-PLUS on many datasets and significantly surpasses the p…☆14Feb 5, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- This tool helps you remove ghost followers to boost your instagram engagement!☆15Sep 13, 2023Updated 2 years ago
- 基于DINet的推理服务,推理视频流和视频☆17Nov 8, 2023Updated 2 years ago
- AgentAvatar: Disentangling Planning, Driving and Rendering for Photorealistic Avatar Agents☆11Dec 4, 2023Updated 2 years ago
- A neural text process python lib for context-based feature extraction on Seq-Tagging data.☆10Jul 27, 2018Updated 7 years ago
- An automatic video creator using GPT Text Completion API☆11Oct 2, 2023Updated 2 years ago
- ☆12Sep 29, 2017Updated 8 years ago
- Manage your users, devices, door locks, payments, RFID tags and readers with this open-source web-app☆22Aug 25, 2016Updated 9 years ago
- AI Agent Demo Using GPT Function Calling☆13Jul 16, 2023Updated 2 years ago
- ☆15May 12, 2022Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Unofficial implementation of Sketch-Guided Text-to-Image Diffusion Models☆13Jun 19, 2023Updated 2 years ago
- ☆24Dec 24, 2025Updated 3 months ago
- ☆13Oct 24, 2018Updated 7 years ago
- A image processing project that produces face morphing videos☆11Jul 9, 2015Updated 10 years ago
- VectorTalker: SVG Talking Face Generation with Progressive Vectorisation☆15Dec 25, 2023Updated 2 years ago
- Image search with Deep Learning☆14Jan 22, 2018Updated 8 years ago
- ☆29Dec 19, 2023Updated 2 years ago
- Neurocomputing "Deep Multi-Center Learning for Face Alignment"☆12Mar 28, 2020Updated 6 years ago
- ☆49Oct 24, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆17Jun 14, 2023Updated 2 years ago
- Flask skeleton using Bootstrap, SCSS, Docker, console and rotating file logging, HTTP basic auth and web and api views with Blueprint☆10Nov 18, 2018Updated 7 years ago
- ☆16Mar 29, 2022Updated 4 years ago
- [NeurIPS 2023] Official pytorch implementation of "Domain Re-Modulation for Few-Shot Generative Domain Adaption"☆13Aug 2, 2024Updated last year
- Implementation of simple content-based image retrieval☆12Apr 9, 2018Updated 7 years ago
- 3D Earth Globe to display your travels☆11May 29, 2025Updated 10 months ago
- 内网穿透及端口转发工 具☆10Apr 7, 2022Updated 3 years ago