SmallDoges / small-dogeLinks
Doge Family of Small Language Models
☆174Updated last month
Alternatives and similar repositories for small-doge
Users that are interested in small-doge are comparing it to the libraries listed below
Sorting:
- [ICLR 2025] MoE++: Accelerating Mixture-of-Experts Methods with Zero-Computation Experts☆239Updated 11 months ago
- Codura is an intelligent code assistant designed to supercharge your IDE with context-aware code completion, inline explanations, test ca…☆39Updated 2 months ago
- Flash Dynamic Mask Attention☆287Updated this week
- ☆173Updated 5 months ago
- 🎉 The code repository for "Parrot: Multilingual Visual Instruction Tuning" in PyTorch.☆76Updated 3 months ago
- MoH: Multi-Head Attention as Mixture-of-Head Attention☆274Updated 10 months ago
- Official code for paper "Learning to Use Tools via Cooperative and Interactive Agents"☆218Updated last year
- Official code for ACL2025 "🔍 Retrieval Models Aren’t Tool-Savvy: Benchmarking Tool Retrieval for Large Language Models"☆195Updated 2 months ago
- Official Repository of ACL 2025 paper OmniAlign-V: Towards Enhanced Alignment of MLLMs with Human Preference☆141Updated 6 months ago
- LLaVA-Mini is a unified large multimodal model (LMM) that can support the understanding of images, high-resolution images, and videos in …☆526Updated 2 months ago
- VARGPT: Unified Understanding and Generation in a Visual Autoregressive Multimodal Large Language Model☆342Updated 5 months ago
- ☆195Updated 9 months ago
- All-in-one Web Agent framework for post-training. Start building with a few clicks!☆270Updated 2 months ago
- ☆109Updated 3 months ago
- Stream-Omni is a GPT-4o-like language-vision-speech chatbot that simultaneously supports interaction across various modality combinations…☆343Updated 3 months ago
- ☆112Updated last year
- A python library for social event detection☆583Updated last month
- Official repo for 'Large Multimodal Models Evaluation: A Survey'☆73Updated last week
- Toolkit for Prompt Compression☆272Updated 7 months ago
- GPT-4 level function calling models for real-world tool using use cases☆225Updated 11 months ago
- 🏆 ICML 2025 Spotlight☆306Updated 2 months ago
- jyf-drawing-board是一个背景透明的Web画板项目,使用HTML5 的<canvas>元素来实现绘图功能。☆20Updated 7 months ago
- This project is a text editor developed using Qt, integrating common text editing features and providing management capabilities for vari…☆12Updated last month
- Multi-agent to generate LangGPT prompts.☆171Updated 8 months ago
- This is the code related to "🔥Effective Training Data Synthesis for Improving MLLM Chart Understanding" (ICCV 2025).☆54Updated last month
- ☆121Updated 2 weeks ago
- [ICLR 2025] Train Small, Infer Large: Memory-Efficient LoRA Training for Large Language Models☆71Updated 5 months ago
- [ICLR 2025] xFinder: Large Language Models as Automated Evaluators for Reliable Evaluation☆176Updated 6 months ago
- Llama-github is an open-source Python library that empowers LLM Chatbots, AI Agents, and Auto-dev Solutions to conduct Agentic RAG from a…☆302Updated 3 weeks ago
- Run o3-pro on your computer. 🌌☆22Updated 6 months ago