borisdayma / sora-mini
β17Updated 9 months ago
Related projects β
Alternatives and complementary repositories for sora-mini
- Implementation of a holodeck, written in Pytorchβ17Updated last year
- ππ€ A collection of templates for Hugging Face Spacesβ35Updated last year
- Code for the examples presented in the talk "Training a Llama in your backyard: fine-tuning very large models on consumer hardware" givenβ¦β14Updated last year
- Exploring an idea where one forgets about efficiency and carries out attention across each edge of the nodes (tokens)β43Updated last month
- The official implementation of MARS: Unleashing the Power of Variance Reduction for Training Large Modelsβ65Updated this week
- This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.β20Updated 5 months ago
- Use Grounding DINO, Segment Anything, and CLIP to label objects in images.β23Updated 10 months ago
- Low-Rank Adaptation of Large Language Models clean implementationβ9Updated last year
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"β23Updated last week
- β22Updated last year
- QLoRA for Masked Language Modelingβ20Updated last year
- Official repository for the paper "End-to-End Visual Editing with a Generatively Pre-Trained Artist", which is accepted at ECCV 2022. Herβ¦β29Updated last year
- Utilities for PyTorch distributedβ23Updated last year
- This repository hosts the code to port NumPy model weights of BiT-ResNets to TensorFlow SavedModel format.β14Updated 2 years ago
- Implementation of the general framework for AMIE, from the paper "Towards Conversational Diagnostic AI", out of Google Deepmindβ53Updated 2 months ago
- Scripts to prep PC for development use after OS installsβ37Updated last week
- Tool to take your ML model from local to production with one-line of code.β23Updated 10 months ago
- The open source community's implementation of the all-new Multi-Modal Causal Attention from "DeepSpeed-VisualChat: Multi-Round Multi-Imagβ¦β12Updated 8 months ago
- My explorations into editing the knowledge and memories of an attention networkβ34Updated last year
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open dataβ18Updated 3 months ago
- Training hybrid models for dummies.β15Updated 3 weeks ago
- CogNetX is an advanced, multimodal neural network architecture inspired by human cognition. It integrates speech, vision, and video proceβ¦β12Updated last week
- Using short models to classify long textsβ20Updated last year
- Code for Fooling Contrastive Language-Image Pre-trainined Models with CLIPMasterPrintsβ16Updated last month
- β13Updated last year
- Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and teβ¦β42Updated 10 months ago
- β15Updated 2 weeks ago