Code release for "LLMs can see and hear without any training"
☆457May 8, 2025Updated 9 months ago
Alternatives and similar repositories for MILS
Users that are interested in MILS are comparing it to the libraries listed below
Sorting:
- ☆200May 5, 2025Updated 10 months ago
- A browser-based, WebGL2 implementation of GPT-2 with transform block and attention matrix visualization☆343Oct 24, 2025Updated 4 months ago
- Plugin Marketplace for Claude Code☆20Feb 8, 2026Updated 3 weeks ago
- [IJCV 2026] HiPrompt: Tuning-free Higher-Resolution Generation with Hierarchical MLLM Prompts☆26Feb 28, 2025Updated last year
- [ICLR 2026] TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching☆840Jan 28, 2026Updated last month
- \infty-Video: A Training-Free Approach to Long Video Understanding via Continuous-Time Memory Consolidation☆19Feb 14, 2025Updated last year
- A Simple Scenes Based Movie Generation App☆52Nov 8, 2024Updated last year
- ☆10Feb 14, 2025Updated last year
- Run larger LLMs with longer contexts on Apple Silicon by using differentiated precision for KV cache quantization. KVSplit enables 8-bit …☆362May 21, 2025Updated 9 months ago
- [ACM MM25] Official Pytorch implementation of [Decoupled Global-Local Alignment for Improving Compositional Understanding]☆15Jul 15, 2025Updated 7 months ago
- Animating R1's thoughts.☆383Feb 17, 2025Updated last year
- Implement recursion using English as the programming language and an LLM as the runtime.☆240Apr 3, 2023Updated 2 years ago
- LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve spee…☆3,125May 19, 2025Updated 9 months ago
- Live-bending a foundation model’s output at neural network level.☆272Apr 7, 2025Updated 10 months ago
- Frame Guidance: Training-Free Guidance for Frame-Level Control in Video Diffusion Models (ICLR 2026)☆42Updated this week
- (Accepted by IJCV) Liquid: Language Models are Scalable and Unified Multi-modal Generators☆640Nov 10, 2025Updated 3 months ago
- Fully neural approach for text chunking☆406Oct 23, 2025Updated 4 months ago
- [ACL 2025 🔥] Rethinking Step-by-step Visual Reasoning in LLMs☆310May 21, 2025Updated 9 months ago
- ☆13Jul 10, 2024Updated last year
- ☆13Jan 22, 2025Updated last year
- Make your LLM agent and chat with it simple and fast!☆68Nov 22, 2025Updated 3 months ago
- Transductive regular expressions☆254Sep 25, 2025Updated 5 months ago
- Official code implementation for the ACL 2025 paper: 'Dynamic Scaling of Unit Tests for Code Reward Modeling'☆27May 16, 2025Updated 9 months ago
- A JPEG Image Compression Service using Part Homomorphic Encryption.☆31Mar 7, 2025Updated 11 months ago
- Code to train and evaluate Neural Attention Memory Models to obtain universally-applicable memory systems for transformers.☆349Oct 22, 2024Updated last year
- Everything about the SmolLM and SmolVLM family of models☆3,636Jan 13, 2026Updated last month
- The code for "TokenPacker: Efficient Visual Projector for Multimodal LLM", IJCV2025☆276May 26, 2025Updated 9 months ago
- Neurox control helm chart details☆30Apr 29, 2025Updated 10 months ago
- 🍃 MINT-1T: A one trillion token multimodal interleaved dataset.☆829Jul 31, 2024Updated last year
- ☆42Jul 9, 2025Updated 7 months ago
- [ICLR'25] Official repository of paper: Ranking-aware adapter for text-driven image ordering with CLIP☆16Apr 17, 2025Updated 10 months ago
- Rewriting Principia Mathematica in Lean☆138Feb 5, 2026Updated last month
- ☆266Mar 6, 2025Updated 11 months ago
- ☆136Aug 11, 2025Updated 6 months ago
- TinyGPT-V: Efficient Multimodal Large Language Model via Small Backbones☆1,307Feb 5, 2026Updated last month
- Music production for silent film clips.☆32Apr 30, 2025Updated 10 months ago
- A universal RPC layer for AI agents. Connect to any function, any language, any framework, in minutes.☆127Nov 24, 2025Updated 3 months ago
- Things you can do with the token embeddings of an LLM☆1,453Dec 1, 2025Updated 3 months ago
- DMM: Building a Versatile Image Generation Model via Distillation-Based Model Merging☆47Apr 27, 2025Updated 10 months ago