ShareChatAI / 3MASSIVLinks
☆11Updated 3 years ago
Alternatives and similar repositories for 3MASSIV
Users that are interested in 3MASSIV are comparing it to the libraries listed below
Sorting:
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data☆21Updated 10 months ago
- A dashboard for exploring timm learning rate schedulers☆19Updated 6 months ago
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆23Updated last week
- Load any clip model with a standardized interface☆21Updated last year
- The collection of bulding blocks building fine-tunable metric learning models☆32Updated last month
- ☆24Updated 3 years ago
- Describe the format of image/text datasets☆11Updated 3 years ago
- Visionner turn raw image data into numpy array, more suitable for deep learning task☆10Updated last year
- Implementation of TableFormer, Robust Transformer Modeling for Table-Text Encoding, in Pytorch☆39Updated 3 years ago
- Library for converting from RGB / GrayScale image to base64 and back.☆19Updated 2 years ago
- A multimodal large-scale model, which performs close to the closed-source Qwen-VL-PLUS on many datasets and significantly surpasses the p…☆14Updated last year
- This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.☆20Updated last year
- Index of URLs to pdf files all over the internet and scripts☆23Updated 2 years ago
- Aggregating embeddings over time☆31Updated 2 years ago
- Simple script to re-rank images using OpenAI's CLIP https://github.com/openai/CLIP.☆16Updated 4 years ago
- code for paper "Accessing higher dimensions for unsupervised word translation"☆21Updated last year
- Simple and easy stable diffusion inference with LightningModule on GPU, CPU and MPS (Possibly all devices supported by Lightning).☆17Updated last year
- ☆28Updated 2 years ago
- Implementation of VisionLLaMA from the paper: "VisionLLaMA: A Unified LLaMA Interface for Vision Tasks" in PyTorch and Zeta☆16Updated 6 months ago
- PostText is a QA system for querying your text data. When appropriate structured views are in place, PostText is good at answering querie…☆32Updated last year
- PyTorch reimplementation of the paper "HyperMixer: An MLP-based Green AI Alternative to Transformers" [arXiv 2022].☆17Updated 3 years ago
- [COLM 2024] Early Weight Averaging meets High Learning Rates for LLM Pre-training☆16Updated 7 months ago
- Using Gradio interface to build UI for converting text to speech☆13Updated 4 years ago
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages☆13Updated 2 years ago
- Visualize multi-model embedding spaces. The first goal is to quickly get a lay of the land of any embedding space. Then be able to scroll…☆27Updated last year
- a tool for gerenate dataset from doc☆12Updated 2 months ago
- ☆13Updated last year
- [ICCV'21] The Right to Talk: An Audio-Visual Transformer Approach☆20Updated 3 years ago
- ☆15Updated 3 months ago
- ☆13Updated 9 months ago