LargeWorldModel / LWMLinks
Large World Model -- Modeling Text and Video with Millions Context
☆7,389Updated last year
Alternatives and similar repositories for LWM
Users that are interested in LWM are comparing it to the libraries listed below
Sorting:
- Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"☆3,330Updated last year
- Modeling, training, eval, and inference code for OLMo☆6,245Updated last month
- a state-of-the-art-level open visual language model | 多模态预训练模型☆6,712Updated last year
- ☆4,109Updated last year
- Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.☆6,169Updated 4 months ago
- High-speed Large Language Model Serving for Local Deployment☆8,503Updated 4 months ago
- Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.☆10,225Updated last year
- official repository of aiXcoder-7B Code Large Language Model☆2,272Updated 5 months ago
- PyTorch code and models for V-JEPA self-supervised learning from video.☆3,353Updated 10 months ago
- 【TMM 2025 🔥】 Mixture-of-Experts for Large Vision-Language Models☆2,285Updated 5 months ago
- A Next-Generation Training Engine Built for Ultra-Large MoE Models☆5,031Updated this week
- 【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection☆3,424Updated last year
- Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.☆2,070Updated last year
- [ICLR 2024] Efficient Streaming Language Models with Attention Sinks☆7,165Updated last year
- ☆2,552Updated last year
- GPT4V-level open-source multi-modal model based on Llama3-8B☆2,426Updated 9 months ago
- LMDeploy is a toolkit for compressing, deploying, and serving LLMs.☆7,437Updated this week
- Reaching LLaMA2 Performance with 0.1M Dollars☆988Updated last year
- The official PyTorch implementation of Google's Gemma models☆5,585Updated 6 months ago
- Run Mixtral-8x7B models in Colab or consumer desktops☆2,327Updated last year
- This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.☆12,097Updated 2 months ago
- Mora: More like Sora for Generalist Video Generation☆1,584Updated last year
- Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference☆4,594Updated last year
- Official release of InternLM series (InternLM, InternLM2, InternLM2.5, InternLM3).☆7,128Updated last month
- TinyGPT-V: Efficient Multimodal Large Language Model via Small Backbones☆1,305Updated last year
- Gemma open-weight LLM library, from Google DeepMind☆3,908Updated last month
- A Pythonic framework to simplify AI service building☆2,801Updated last week
- Code and models for ICML 2024 paper, NExT-GPT: Any-to-Any Multimodal Large Language Model☆3,601Updated 7 months ago
- VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and clou…☆3,706Updated last month
- Official implementations for paper: Anydoor: zero-shot object-level image customization☆4,202Updated last year