Tencent / llm.hunyuan.turbo-s

☆79

Alternatives and similar repositories for llm.hunyuan.turbo-s

Users that are interested in llm.hunyuan.turbo-s are comparing it to the libraries listed below

Sorting:

neulab / MultiUI
Code for Paper: Harnessing Webpage Uis For Text Rich Visual Understanding
☆51Updated 5 months ago
si0wang / ThinkLite-VL
☆74Updated last week
ZihanWang314 / coeCheck
☆16Updated 2 months ago
NathanGodey / qfilters
Repository for the Q-Filters method (https://arxiv.org/pdf/2503.02812)
☆30Updated 2 months ago
yihedeng9 / OpenVLThinker
OpenVLThinker: An Early Exploration to Vision-Language Reasoning via Iterative Self-Improvement
☆83Updated this week
18907305772 / FuseAI
FuseAI Project
☆86Updated 3 months ago
gkamradt / SnakeBench
☆84Updated last week
OpenEvaByte / evabyte
EvaByte: Efficient Byte-level Language Models at Scale
☆92Updated 3 weeks ago
huggingface / fineVideo
☆74Updated 7 months ago
nahidalam / maya
Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya
☆108Updated 2 months ago
xdit-project / mochi-xdit
faster parallel inference of mochi-1 video generation model
☆119Updated 2 months ago
kyleliang919 / Super_Muon
☆54Updated last month
HarleyCoops / smolThinker-.5B
A Qwen .5B reasoning model trained on OpenR1-Math-220k
☆14Updated 2 months ago
benchflow-ai / pokemon-gym
☆64Updated last month
efficientscaling / Z1
Repo for "Z1: Efficient Test-time Scaling with Code"
☆58Updated last month
s-smits / grpo-optuna
Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna
☆52Updated 3 months ago
sayakpaul / simple-image-recaptioning
Recaption large (Web)Datasets with vllm and save the artifacts.
☆52Updated 5 months ago
xverse-ai / XVERSE-MoE-A36B
XVERSE-MoE-A36B: A multilingual large language model developed by XVERSE Technology Inc.
☆37Updated 8 months ago
VITA-Group / WeLore
From GaLore to WeLore: How Low-Rank Weights Non-uniformly Emerge from Low-Rank Gradients. Ajay Jaiswal, Lu Yin, Zhenyu Zhang, Shiwei Liu,…
☆47Updated 3 weeks ago
LLM360 / crystalcoder-data-prep
Data preparation code for CrystalCoder 7B LLM
☆44Updated last year
LLM360 / k2-data-prep
☆20Updated 11 months ago
du-nlp-lab / MLR-Copilot
☆64Updated last month
RWKV / RWKV-LM
RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best…
☆44Updated last month
Tencent / llm.hunyuan.T1
☆76Updated last month
penfever / wildchat-50m
Code, results and other artifacts from the paper introducing the WildChat-50m dataset and the Re-Wild model family.
☆29Updated last month
kubernetes-bad / reward-composer
Lego for GRPO
☆28Updated last month
open-compass / GPassK
Official Repository of Are Your LLMs Capable of Stable Reasoning?
☆25Updated last month
zjunlp / DynamicKnowledgeCircuits
How Do LLMs Acquire New Knowledge? A Knowledge Circuits Perspective on Continual Pre-Training
☆32Updated 3 weeks ago
TRI-ML / linear_open_lm
A repository for research on medium sized language models.
☆76Updated 11 months ago
Gengzigang / TokenSet
Official PyTorch implementation of TokenSet.
☆118Updated last month