hu-po / TubeGPTLinks
YouTube Assistant
β12Updated 2 years ago
Alternatives and similar repositories for TubeGPT
Users that are interested in TubeGPT are comparing it to the libraries listed below
Sorting:
- Unofficial implementation and experiments related to Set-of-Mark (SoM) ποΈβ88Updated last year
- Finetune any model on HF in less than 30 secondsβ57Updated last week
- GPT as Knowledger Worker (or if you really want, GPT Sorta' Takes the CPA Exam)β13Updated 2 years ago
- β54Updated last year
- This project is under development.β23Updated 2 years ago
- β29Updated last year
- LoRA fine-tuned Stable Diffusion Deploymentβ31Updated 2 years ago
- Not financial advice.β28Updated 2 years ago
- Integrate an LLM copilot within your Keras model development workflowβ28Updated last year
- An EXA-Scale repository of Multi-Modality AI resources from papers and models, to foundational libraries!β40Updated last year
- Implementation of VisionLLaMA from the paper: "VisionLLaMA: A Unified LLaMA Interface for Vision Tasks" in PyTorch and Zetaβ16Updated 10 months ago
- Simple CogVLM client scriptβ14Updated last year
- The Next Generation Multi-Modality Superintelligenceβ70Updated last year
- Use Grounding DINO, Segment Anything, and GPT-4V to label images with segmentation masks for use in training smaller, fine-tuned models.β66Updated last year
- BH hackathonβ14Updated last year
- β11Updated 2 years ago
- Create topological graph for image segments.β22Updated 11 months ago
- Simple script to re-rank images using OpenAI's CLIP https://github.com/openai/CLIP.β16Updated 4 years ago
- Summarize any Arixv Paper with easeβ66Updated 2 years ago
- My personal implementation of the model from "Qwen-VL: A Frontier Large Vision-Language Model with Versatile Abilities", they haven't relβ¦β12Updated last year
- β17Updated last year
- Using GPT-3 and Carrot (GPT-3 for computer vision) to create detailed descriptions of images.β14Updated 3 years ago
- A forest of autonomous agents.β19Updated 7 months ago
- Extract information, summarize, ask questions, and search videos using OpenAI's Vision API ππ¦β63Updated last year
- β13Updated last year
- Interactive Textbook Demoβ45Updated last year
- Simple Implementation of TinyGPTV in super simple Zeta lego blocksβ16Updated 10 months ago
- Notebooks using the Neural Magic libraries πβ39Updated last year
- β14Updated last year
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open dataβ21Updated last year