Tencent / llm.hunyuan.T1View external linksLinks
☆82Apr 3, 2025Updated 10 months ago
Alternatives and similar repositories for llm.hunyuan.T1
Users that are interested in llm.hunyuan.T1 are comparing it to the libraries listed below
Sorting:
- Measuring Thinking Efficiency in Reasoning Models - Research Repository☆39Dec 2, 2025Updated 2 months ago
- WideSearch: Benchmarking Agentic Broad Info-Seeking☆119Oct 9, 2025Updated 4 months ago
- XVERSE-MoE-A4.2B: A multilingual large language model developed by XVERSE Technology Inc.☆39May 8, 2024Updated last year
- ☆113Sep 13, 2025Updated 5 months ago
- Useful resources for creating apps and working with flow.☆11Oct 28, 2024Updated last year
- ☆16Apr 1, 2025Updated 10 months ago
- the official code of DriveMonkey☆43May 24, 2025Updated 8 months ago
- HALO: Hadamard-Assisted Low-Precision Optimization and Training method for finetuning LLMs. 🚀 The official implementation of https://arx…☆29Feb 17, 2025Updated 11 months ago
- Recursive Abstractive Processing for Tree-Organized Retrieval☆10May 30, 2024Updated last year
- BrowseComp-Plus: A More Fair and Transparent Evaluation Benchmark of Deep-Research Agent☆175Dec 11, 2025Updated 2 months ago
- Our 2nd-gen LMM☆34May 22, 2024Updated last year
- [NeurIPS 2025] Simple extension on vLLM to help you speed up reasoning model without training.☆220May 31, 2025Updated 8 months ago
- DeepTrace: A lightweight, scalable real-time diagnostic and analysis tool for distributed training tasks.☆18Nov 4, 2025Updated 3 months ago
- ☆74May 30, 2025Updated 8 months ago
- [CVPR 2025] A Unified Image-Dense Annotation Generation Model for Underwater Scenes☆54Apr 9, 2025Updated 10 months ago
- [ICLR 2026] RuleReasoner: Reinforced Rule-based Reasoning via Domain-aware Dynamic Sampling☆31Feb 1, 2026Updated 2 weeks ago
- CFT-RAG: An Entity Tree Based Retrieval Augmented Generation Algorithm With Cuckoo Filter☆22May 28, 2025Updated 8 months ago
- An iterative density based clustering algorithm for sparse data(text) clustering.☆16Feb 1, 2024Updated 2 years ago
- Unleashing the Power of Reinforcement Learning for Math and Code Reasoners☆743Jun 6, 2025Updated 8 months ago
- ☆129Jun 6, 2025Updated 8 months ago
- ☆18Apr 18, 2025Updated 9 months ago
- [NAACL 2025] Representing Rule-based Chatbots with Transformers☆23Feb 9, 2025Updated last year
- ☆16Jun 12, 2023Updated 2 years ago
- ☆11Updated this week
- The code for "VISTA: Enhancing Long-Duration and High-Resolution Video Understanding by VIdeo SpatioTemporal Augmentation" [CVPR2025]☆21Feb 27, 2025Updated 11 months ago
- Creating the DeepSeek V3 model from scratch☆25Mar 28, 2025Updated 10 months ago
- ☆19Sep 19, 2024Updated last year
- ☆18Oct 26, 2024Updated last year
- Official implementation of CEED-VLA: Consistency Vision-Language-Action Model with Early-Exit Decoding.☆49Sep 15, 2025Updated 5 months ago
- [ICML 2025 Spotlight] ShadowKV: KV Cache in Shadows for High-Throughput Long-Context LLM Inference☆283May 1, 2025Updated 9 months ago
- [ECCV 2024] Make Your ViT-based Multi-view 3D Detectors Faster via Token Compression☆50Sep 21, 2024Updated last year
- Forces DeepSeek R1 models to engage in extended reasoning by intercepting early termination tokens.☆19Feb 12, 2025Updated last year
- Qwen-WisdomVast is a large model trained on 1 million high-quality Chinese multi-turn SFT data, 200,000 English multi-turn SFT data, and …☆18Apr 12, 2024Updated last year
- This repository is a collection of legal instruction datasets☆26Jul 12, 2024Updated last year
- Understanding R1-Zero-Like Training: A Critical Perspective☆1,209Aug 27, 2025Updated 5 months ago
- The official repository of the Omni-MATH benchmark.☆93Dec 22, 2024Updated last year
- An Open-Source RAG Workload Trace to Optimize RAG Serving Systems☆35Nov 18, 2025Updated 2 months ago
- Gemma2(9B), Llama3-8B-Finetune-and-RAG, code base for sample, implemented in Kaggle platform☆22Feb 8, 2025Updated last year
- ☆352Jul 29, 2025Updated 6 months ago