kyegomez / TinyGPTV
Simple Implementation of TinyGPTV in super simple Zeta lego blocks
☆16Updated 5 months ago
Alternatives and similar repositories for TinyGPTV:
Users that are interested in TinyGPTV are comparing it to the libraries listed below
- My personal implementation of the model from "Qwen-VL: A Frontier Large Vision-Language Model with Versatile Abilities", they haven't rel…☆13Updated last year
- A simple reproducible template to implement AI research papers☆23Updated 7 months ago
- Implementation of the LDP module block in PyTorch and Zeta from the paper: "MobileVLM: A Fast, Strong and Open Vision Language Assistant …☆16Updated last year
- Implementation of the premier Text to Video model from OpenAI☆57Updated 5 months ago
- Implementation of VisionLLaMA from the paper: "VisionLLaMA: A Unified LLaMA Interface for Vision Tasks" in PyTorch and Zeta☆16Updated 5 months ago
- OmniByteFormer is a generalized Transformer model that can process any type of data by converting it into byte sequences, bypassing tradi…☆10Updated this week
- Official implementation of ECCV24 paper: POA☆24Updated 8 months ago
- BH hackathon☆14Updated last year
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆29Updated last week
- ☆13Updated last year
- A list of language models with permissive licenses such as MIT or Apache 2.0☆24Updated last month
- Mamba R1 represents a novel architecture that combines the efficiency of Mamba's state space models with the scalability of Mixture of Ex…☆20Updated 2 weeks ago
- Finetune any model on HF in less than 30 seconds☆58Updated 2 weeks ago
- The open source implementation of the model from "Scaling Vision Transformers to 22 Billion Parameters"☆28Updated 2 weeks ago
- Enhancement in Multimodal Representation Learning.☆40Updated last year
- Train, tune, and infer Bamba model☆88Updated 3 months ago
- The open source implementation of the base model behind GPT-4 from OPENAI [Language + Multi-Modal]☆11Updated last year
- Official Pytorch Implementation of Self-emerging Token Labeling☆33Updated last year
- AgentParse is a high-performance parsing library designed to map various structured data formats (such as Pydantic models, JSON, YAML, an…☆13Updated last week
- Pytorch implementation of HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal Large Language Models☆28Updated last year
- CogNetX is an advanced, multimodal neural network architecture inspired by human cognition. It integrates speech, vision, and video proce…☆14Updated this week
- Lottery Ticket Adaptation☆39Updated 5 months ago
- ☆13Updated 7 months ago
- A simple package for leveraging Falcon 180B and the HF ecosystem's tools, including training/inference scripts, safetensors, integrations…☆13Updated last year
- A single repo with all scripts and utils to train / fine-tune the Mamba model with or without FIM☆54Updated last year
- implementation of https://arxiv.org/pdf/2312.09299☆20Updated 9 months ago
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data☆21Updated 8 months ago
- Exploration into the Firefly algorithm in Pytorch☆38Updated 2 months ago
- Pytorch Implementation of the paper: "Learning to (Learn at Test Time): RNNs with Expressive Hidden States"☆24Updated this week
- imagetokenizer is a python package, helps you encoder visuals and generate visuals token ids from codebook, supports both image and video…☆33Updated 10 months ago