LargeWorldModel / LWMLinks
Large World Model -- Modeling Text and Video with Millions Context
☆7,393Updated last year
Alternatives and similar repositories for LWM
Users that are interested in LWM are comparing it to the libraries listed below
Sorting:
- Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"☆3,333Updated last year
- The official PyTorch implementation of Google's Gemma models☆5,601Updated 8 months ago
- ☆4,113Updated last year
- Modeling, training, eval, and inference code for OLMo☆6,299Updated 2 months ago
- A series of large language models trained from scratch by developers @01-ai☆7,846Updated last year
- PyTorch code and models for V-JEPA self-supervised learning from video.☆3,499Updated 11 months ago
- This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.☆12,122Updated 3 months ago
- ☆2,547Updated last year
- PyTorch native post-training library☆5,660Updated this week
- Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.☆2,083Updated last year
- 【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection☆3,448Updated last year
- a state-of-the-art-level open visual language model | 多模态预训练模型☆6,723Updated last year
- CoreNet: A library for training deep neural networks☆7,018Updated 3 months ago
- official repository of aiXcoder-7B Code Large Language Model☆2,273Updated 6 months ago
- GPT4V-level open-source multi-modal model based on Llama3-8B☆2,431Updated 11 months ago
- Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.☆6,180Updated 5 months ago
- lightweight, standalone C++ inference engine for Google's Gemma models.☆6,721Updated last week
- Mora: More like Sora for Generalist Video Generation☆1,581Updated last year
- Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models☆3,153Updated last year
- Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.☆10,293Updated last year
- High-speed Large Language Model Serving for Local Deployment☆8,635Updated 2 weeks ago
- 【TMM 2025🔥】 Mixture-of-Experts for Large Vision-Language Models☆2,299Updated 6 months ago
- VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and clou…☆3,734Updated 2 months ago
- Official Code for Stable Cascade☆6,581Updated last year
- Training LLMs with QLoRA + FSDP☆1,537Updated last year
- Code and dataset for photorealistic Codec Avatars driven from audio☆2,855Updated last year
- A PyTorch native platform for training generative AI models☆5,023Updated this week
- Reaching LLaMA2 Performance with 0.1M Dollars☆987Updated last year
- A Pythonic framework to simplify AI service building☆2,804Updated last week
- [ICLR 2024] Efficient Streaming Language Models with Attention Sinks☆7,182Updated last year