Tencent / Tencent-Hunyuan-Large
☆1,164Updated last week
Alternatives and similar repositories for Tencent-Hunyuan-Large:
Users that are interested in Tencent-Hunyuan-Large are comparing it to the libraries listed below
- Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.☆1,963Updated this week
- ☆883Updated 5 months ago
- Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.☆3,371Updated 2 months ago
- GPT4V-level open-source multi-modal model based on Llama3-8B☆2,151Updated 3 months ago
- Yi-1.5 is an upgraded version of Yi, delivering stronger performance in coding, math, reasoning, and instruction-following capability.☆525Updated 3 weeks ago
- ✨✨VITA: Towards Open-Source Interactive Omni Multimodal LLM☆972Updated last month
- Next-Token Prediction is All You Need☆1,869Updated last month
- Janus-Series: Unified Multimodal Understanding and Generation Models☆1,181Updated 3 weeks ago
- 🔥🔥 LLaVA++: Extending LLaVA with Phi-3 and LLaMA-3 (LLaVA LLaMA-3, LLaVA Phi-3)☆814Updated 4 months ago
- DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model☆3,700Updated 2 months ago
- ☆930Updated last week
- An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)☆4,012Updated 3 weeks ago
- A novel Multimodal Large Language Model (MLLM) architecture, designed to structurally align visual and textual embeddings.☆555Updated last week
- The First Multimodal Seach Engine Pipeline and Benchmark for LMMs☆399Updated last month
- DeepSeek-VL: Towards Real-World Vision-Language Understanding☆2,111Updated 7 months ago
- DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models☆1,018Updated 10 months ago
- MINT-1T: A one trillion token multimodal interleaved dataset.☆780Updated 4 months ago
- An Open Large Reasoning Model for Real-World Solutions☆1,025Updated this week
- The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.☆1,271Updated 3 months ago
- An LLM-based Web Navigating Agent (KDD'24)☆760Updated 2 months ago
- OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, …☆4,216Updated this week
- SEED-Story: Multimodal Long Story Generation with Large Language Model☆755Updated last month
- LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs☆1,519Updated last month
- DeepSeek LLM: Let there be answers☆1,500Updated 9 months ago
- HuixiangDou: Overcoming Group Chat Scenarios with LLM-based Technical Assistance☆1,532Updated last month
- VideoSys: An easy and efficient system for video generation☆1,789Updated last week
- Efficient AI Inference & Serving☆459Updated 10 months ago
- LLaVA-CoT, a visual language model capable of spontaneous, systematic reasoning☆1,489Updated this week
- [NeurIPS'24 Spotlight] To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention, which reduces in…☆815Updated this week
- InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output☆2,540Updated last month