hu-po / streamdocsLinks
Documentation, notes, links, etc for streams.
☆84Updated last year
Alternatives and similar repositories for streamdocs
Users that are interested in streamdocs are comparing it to the libraries listed below
Sorting:
- documentation for content creation☆234Updated 3 months ago
- Implementation of a framework for Genie2 in Pytorch☆156Updated last year
- From scratch implementation of a vision language model in pure PyTorch☆253Updated last year
- Projects based on SigLIP (Zhai et. al, 2023) and Hugging Face transformers integration 🤗☆297Updated 11 months ago
- Implementation of Lumiere, SOTA text-to-video generation from Google Deepmind, in Pytorch☆281Updated last year
- ☆203Updated last year
- [CVPR 2024] VCoder: Versatile Vision Encoders for Multimodal Large Language Models☆280Updated last year
- A minimal implementation of LLaVA-style VLM with interleaved image & text & video processing ability.☆98Updated last year
- Unofficial implementation and experiments related to Set-of-Mark (SoM) 👁️☆88Updated 2 years ago
- Implementation of TiTok, proposed by Bytedance in "An Image is Worth 32 Tokens for Reconstruction and Generation"☆183Updated last year
- PyTorch Implementation of Object Recognition as Next Token Prediction [CVPR'24 Highlight]☆182Updated 8 months ago
- This is the repository for the Photorealistic Unreal Graphics (PUG) datasets for representation learning.☆237Updated last year
- Implementation of the premier Text to Video model from OpenAI☆56Updated last year
- Quick implementation of nGPT, learning entirely on the hypersphere, from NvidiaAI☆293Updated 7 months ago
- ClickDiffusion: Harnessing LLMs for Interactive Precise Image Editing☆70Updated last year
- LLaVA-Interactive-Demo☆380Updated last year
- Data release for the ImageInWords (IIW) paper.☆224Updated last year
- The official repo for the paper "VeCLIP: Improving CLIP Training via Visual-enriched Captions"☆249Updated last year
- ☆71Updated last year
- Python Library to evaluate VLM models' robustness across diverse benchmarks☆220Updated 3 months ago
- This code repository contains the code used for my "Optimizing Memory Usage for Training LLMs and Vision Transformers in PyTorch" blog po…☆92Updated 2 years ago
- Implementation of Mind Evolution, Evolving Deeper LLM Thinking, from Deepmind☆59Updated 7 months ago
- NeuMeta transforms neural networks by allowing a single model to adapt on the fly to different sizes, generating the right weights when n…☆44Updated last year
- This repo contains the code for the paper "Intuitive physics understanding emerges fromself-supervised pretraining on natural videos"☆214Updated 11 months ago
- ☆306Updated 9 months ago
- LoRA and DoRA from Scratch Implementations☆215Updated last year
- Use Segment Anything 2, grounded with Florence-2, to auto-label data for use in training vision models.☆134Updated last year
- Official PyTorch implementation of TokenSet.☆127Updated 10 months ago
- A easy, reliable, fluid template for python packages complete with docs, testing suites, readme's, github workflows, linting and much muc…☆198Updated last week
- Simple large-scale training of stable diffusion with multi-node support.☆133Updated 2 years ago