Tencent / llm.hunyuan.turbo-s
☆79Updated 2 months ago
Alternatives and similar repositories for llm.hunyuan.turbo-s
Users that are interested in llm.hunyuan.turbo-s are comparing it to the libraries listed below
Sorting:
- Code for Paper: Harnessing Webpage Uis For Text Rich Visual Understanding☆51Updated 5 months ago
- ☆74Updated last week
- ☆16Updated 2 months ago
- Repository for the Q-Filters method (https://arxiv.org/pdf/2503.02812)☆30Updated 2 months ago
- OpenVLThinker: An Early Exploration to Vision-Language Reasoning via Iterative Self-Improvement☆83Updated this week
- FuseAI Project☆86Updated 3 months ago
- ☆84Updated last week
- EvaByte: Efficient Byte-level Language Models at Scale☆92Updated 3 weeks ago
- ☆74Updated 7 months ago
- Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya☆108Updated 2 months ago
- faster parallel inference of mochi-1 video generation model☆119Updated 2 months ago
- ☆54Updated last month
- A Qwen .5B reasoning model trained on OpenR1-Math-220k☆14Updated 2 months ago
- ☆64Updated last month
- Repo for "Z1: Efficient Test-time Scaling with Code"☆58Updated last month
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆52Updated 3 months ago
- Recaption large (Web)Datasets with vllm and save the artifacts.☆52Updated 5 months ago
- XVERSE-MoE-A36B: A multilingual large language model developed by XVERSE Technology Inc.☆37Updated 8 months ago
- From GaLore to WeLore: How Low-Rank Weights Non-uniformly Emerge from Low-Rank Gradients. Ajay Jaiswal, Lu Yin, Zhenyu Zhang, Shiwei Liu,…☆47Updated 3 weeks ago
- Data preparation code for CrystalCoder 7B LLM☆44Updated last year
- ☆20Updated 11 months ago
- ☆64Updated last month
- RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best…☆44Updated last month
- ☆76Updated last month
- Code, results and other artifacts from the paper introducing the WildChat-50m dataset and the Re-Wild model family.☆29Updated last month
- Lego for GRPO☆28Updated last month
- Official Repository of Are Your LLMs Capable of Stable Reasoning?☆25Updated last month
- How Do LLMs Acquire New Knowledge? A Knowledge Circuits Perspective on Continual Pre-Training☆32Updated 3 weeks ago
- A repository for research on medium sized language models.☆76Updated 11 months ago
- Official PyTorch implementation of TokenSet.☆118Updated last month