Tencent / Tencent-Hunyuan-7B
☆16Updated 3 months ago
Alternatives and similar repositories for Tencent-Hunyuan-7B:
Users that are interested in Tencent-Hunyuan-7B are comparing it to the libraries listed below
- Official repository for ICML 2024 paper "MoRe Fine-Tuning with 10x Fewer Parameters"☆17Updated this week
- ☆29Updated 8 months ago
- ☆27Updated 2 months ago
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆36Updated last year
- XVERSE-MoE-A36B: A multilingual large language model developed by XVERSE Technology Inc.☆36Updated 7 months ago
- imagetokenizer is a python package, helps you encoder visuals and generate visuals token ids from codebook, supports both image and video…☆33Updated 10 months ago
- Our 2nd-gen LMM☆33Updated 11 months ago
- In-Context Alignment: Chat with Vanilla Language Models Before Fine-Tuning☆34Updated last year
- LLaVA combines with Magvit Image tokenizer, training MLLM without an Vision Encoder. Unifying image understanding and generation.☆38Updated 10 months ago
- An End-to-End Model with Adaptive Filtering for Retrieval-Augmented Generation☆10Updated 5 months ago
- Exploration of the multi modal fuyu-8b model of Adept. 🤓 🔍☆28Updated last year
- ☆17Updated last year
- ☆13Updated 8 months ago
- Official implementation of ECCV24 paper: POA☆24Updated 8 months ago
- Copy the MLP of llama3 8 times as 8 experts , created a router with random initialization,add load balancing loss to construct an 8x8b Mo…☆26Updated 9 months ago
- Implementation of the paper: "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention" from Google in pyTO…☆55Updated this week
- Pytorch implementation of HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal Large Language Models☆28Updated last year
- [ICLR 2025] LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization☆33Updated last month
- A multimodal large-scale model, which performs close to the closed-source Qwen-VL-PLUS on many datasets and significantly surpasses the p…☆14Updated last year
- [NCMMSC'2024] Emotion-Aware Prosodic Phrasing for Expressive Text-to-Speech☆22Updated 8 months ago
- ☆32Updated 3 months ago
- Multimodal Open Source Framework for Conversational Agent Research and Development.☆19Updated 2 months ago
- ☆44Updated last month
- SpeechAgents: Human-Communication Simulation with Multi-Modal Multi-Agent Systems☆81Updated last year
- Empirical Study Towards Building An Effective Multi-Modal Large Language Model☆22Updated last year
- This repo contains code and data for ICLR 2025 paper MIA-Bench: Towards Better Instruction Following Evaluation of Multimodal LLMs☆30Updated last month
- The official code repo and data hub of top_nsigma sampling strategy for LLMs.☆24Updated 2 months ago
- Train, tune, and infer Bamba model☆88Updated 3 months ago
- A list of language models with permissive licenses such as MIT or Apache 2.0☆24Updated last month
- Fine-tune of Florence-2 for shot categorization.☆24Updated last month