google-deepmind / neptune
☆18Updated last month
Related projects ⓘ
Alternatives and complementary repositories for neptune
- In this codebase we establish a benchmark for egocentric user adaptation based on Ego4d.First, we start from a population model which ha…☆11Updated last year
- code for paper "Accessing higher dimensions for unsupervised word translation"☆21Updated last year
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆23Updated last week
- ☆21Updated 7 months ago
- Official Code Repository for EnvGen: Generating and Adapting Environments via LLMs for Training Embodied Agents (COLM 2024)☆25Updated 4 months ago
- ☆38Updated last year
- SSL Video Representation Learning project☆10Updated 11 months ago
- Code for T-MARS data filtering☆35Updated last year
- Implementation of the model: "(MC-ViT)" from the paper: "Memory Consolidation Enables Long-Context Video Understanding"☆16Updated this week
- Repository for the paper Do SSL Models Have Déjà Vu? A Case of Unintended Memorization in Self-supervised Learning☆37Updated last year
- ☆24Updated 3 years ago
- Official implementation of "Gemini in Reasoning: Unveiling Commonsense in Multimodal Large Language Models"☆35Updated 10 months ago
- A JAX implementation of Broaden Your Views for Self-Supervised Video Learning, or BraVe for short.☆48Updated 4 months ago
- A Data Source for Reasoning Embodied Agents☆19Updated last year
- [NeurIPS-24] This is the official implementation of the paper "DeepStack: Deeply Stacking Visual Tokens is Surprisingly Simple and Effect…☆32Updated 4 months ago
- A huge dataset for Document Visual Question Answering☆13Updated 3 months ago
- Official implementation for Sparse MetA-Tuning (SMAT)☆14Updated 4 months ago
- A big_vision inspired repo that implements a generic Auto-Encoder class capable in representation learning and generative modeling.☆29Updated 4 months ago
- ☆12Updated 2 months ago
- Code for "Merging Text Transformers from Different Initializations"☆19Updated 3 months ago
- DPO, but faster 🚀☆21Updated 2 weeks ago
- Un-*** 50 billions multimodality dataset☆24Updated 2 years ago
- Directed masked autoencoders☆14Updated last year
- Code for the paper: "No Zero-Shot Without Exponential Data: Pretraining Concept Frequency Determines Multimodal Model Performance" [NeurI…☆71Updated 6 months ago
- Load any clip model with a standardized interface☆21Updated 6 months ago
- [ICLR2024] (EvALign-ICL Benchmark) Beyond Task Performance: Evaluating and Reducing the Flaws of Large Multimodal Models with In-Context …☆20Updated 8 months ago
- Recursive Visual Programming☆15Updated 4 months ago
- ☆19Updated last year
- ☆24Updated 9 months ago