borisdayma / sora-mini
☆17Updated 11 months ago
Alternatives and similar repositories for sora-mini:
Users that are interested in sora-mini are comparing it to the libraries listed below
- Implementation of a holodeck, written in Pytorch☆17Updated last year
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data☆21Updated 5 months ago
- ☆58Updated 10 months ago
- Low-Rank Adaptation of Large Language Models clean implementation☆9Updated last year
- 🎨 Imagine what Picasso could have done with AI. Self-host your StableDiffusion API.☆50Updated last year
- Load any clip model with a standardized interface☆21Updated 8 months ago
- Contains Colab Notebooks show cool use-cases of different GCP ML APIs.☆10Updated 4 years ago
- 🚀 🤗 A collection of templates for Hugging Face Spaces☆35Updated last year
- CogNetX is an advanced, multimodal neural network architecture inspired by human cognition. It integrates speech, vision, and video proce…☆13Updated this week
- Code for the examples presented in the talk "Training a Llama in your backyard: fine-tuning very large models on consumer hardware" given…☆14Updated last year
- ☆18Updated last month
- This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.☆20Updated 7 months ago
- Experimental scripts for researching data adaptive learning rate scheduling.☆23Updated last year
- Utilities for PyTorch distributed☆23Updated last year
- This repository hosts the code to port NumPy model weights of BiT-ResNets to TensorFlow SavedModel format.☆14Updated 3 years ago
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆23Updated this week
- Implementation of VisionLLaMA from the paper: "VisionLLaMA: A Unified LLaMA Interface for Vision Tasks" in PyTorch and Zeta☆16Updated 2 months ago
- Use Grounding DINO, Segment Anything, and CLIP to label objects in images.☆23Updated last year
- A list of language models with permissive licenses such as MIT or Apache 2.0☆24Updated 2 months ago
- ☆24Updated last year
- Engineering the state of RNN language models (Mamba, RWKV, etc.)☆32Updated 7 months ago
- Hugging Face Deep RL Class notes☆10Updated 2 years ago
- LoRA fine-tuned Stable Diffusion Deployment☆31Updated last year
- QLoRA for Masked Language Modeling☆21Updated last year
- Exploring an idea where one forgets about efficiency and carries out attention across each edge of the nodes (tokens)☆44Updated 3 months ago
- Visual RAG using less than 300 lines of code.☆24Updated 10 months ago
- Explorations into improving ViTArc with Slot Attention☆37Updated 2 months ago
- A Data Source for Reasoning Embodied Agents☆19Updated last year
- Official repository for the paper "End-to-End Visual Editing with a Generatively Pre-Trained Artist", which is accepted at ECCV 2022. Her…☆29Updated 2 years ago