ShareChatAI / 3MASSIV
☆11Updated 2 years ago
Alternatives and similar repositories for 3MASSIV:
Users that are interested in 3MASSIV are comparing it to the libraries listed below
- A dashboard for exploring timm learning rate schedulers☆19Updated 2 months ago
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆23Updated last week
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data☆21Updated 6 months ago
- PyTorch reimplementation of the paper "HyperMixer: An MLP-based Green AI Alternative to Transformers" [arXiv 2022].☆17Updated 2 years ago
- Visionner turn raw image data into numpy array, more suitable for deep learning task☆10Updated last year
- Describe the format of image/text datasets☆11Updated 2 years ago
- The collection of bulding blocks building fine-tunable metric learning models☆32Updated last month
- ☆24Updated 3 years ago
- ☆19Updated 2 years ago
- Load any clip model with a standardized interface☆21Updated 9 months ago
- Using Gradio interface to build UI for converting text to speech☆12Updated 4 years ago
- Open sourced backend for Martian's LLM Inference Provider Leaderboard☆17Updated 6 months ago
- ☆12Updated 9 months ago
- This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.☆20Updated 8 months ago
- ☆15Updated 3 years ago
- ☆28Updated last year
- Tools for merging pretrained large language models.☆19Updated 8 months ago
- Companion Repo for the Vision Language Modelling YouTube series - https://bit.ly/3PsbsC2 - by Prithivi Da. Open to PRs and collaborations☆14Updated 2 years ago
- imagetokenizer is a python package, helps you encoder visuals and generate visuals token ids from codebook, supports both image and video…☆30Updated 7 months ago
- Index of URLs to pdf files all over the internet and scripts☆21Updated last year
- Aggregating embeddings over time☆31Updated 2 years ago
- Python Tools for Visual Dataset Transformation☆26Updated 2 months ago
- My Implementation of " Structure and Content-Guided Video Synthesis with Diffusion Models" by RunwayML☆27Updated last year
- An open source implementation of CLIP.☆32Updated 2 years ago
- [ICCV'21] The Right to Talk: An Audio-Visual Transformer Approach☆20Updated 3 years ago
- 🚀🤗 A collection of templates for Hugging Face Spaces☆35Updated last year
- Simple script to re-rank images using OpenAI's CLIP https://github.com/openai/CLIP.☆15Updated 3 years ago
- Create a source of truth for ML model results and browse it on Papers with Code☆26Updated 3 years ago
- Implementation of TableFormer, Robust Transformer Modeling for Table-Text Encoding, in Pytorch☆37Updated 2 years ago
- [COLM 2024] Early Weight Averaging meets High Learning Rates for LLM Pre-training☆14Updated 4 months ago