aliyun / qwen-dianjinView external linksLinks
Qwen DianJin: LLMs for the Financial Industry by Alibaba Cloud(通义点金:阿里云金融大模型)
☆421Feb 5, 2026Updated last week
Alternatives and similar repositories for qwen-dianjin
Users that are interested in qwen-dianjin are comparing it to the libraries listed below
Sorting:
- This is the repo of developing reasoning models in the specific domain of financial, aim to enhance models capabilities in handling finan…☆69Jun 23, 2025Updated 7 months ago
- ☆24Aug 19, 2025Updated 5 months ago
- ☆13Mar 16, 2025Updated 11 months ago
- [ICLR 26] The official code repository for the paper "Mirage or Method? How Model–Task Alignment Induces Divergent RL Conclusions".☆15Feb 9, 2026Updated last week
- A lightweight, production-ready C++ library for LLM tokenization, fully compatible with HuggingFace tokenizer.json.☆22Jan 4, 2026Updated last month
- This work has been accepted to Findings of EMNLP 2025!☆48Sep 5, 2025Updated 5 months ago
- [ACL 2025] Analyzing LLMs' Multilingual Knowledge Boundary Cognition Across Languages Through the Lens of Internal Representations☆16Oct 18, 2025Updated 3 months ago
- ☆13Jan 22, 2025Updated last year
- AI Demo 项目,一个专门为希望学习和探索人工智能(AI)技术的开发者准备的实战案例集合。☆25Jan 3, 2026Updated last month
- A local search system implementation using Elasticsearch for Wikipedia data indexing and retrieval.☆12May 17, 2025Updated 9 months ago
- LoPA: Scaling dLLM Inference via Lookahead Parallel Decoding☆34Jan 16, 2026Updated last month
- Fin-R1 is a large language model for complex financial reasoning developed and open-sourced with the joint efforts of the SUFE-AIFLM-Lab …☆741Mar 27, 2025Updated 10 months ago
- DCPO: Dynamic Adaptive Clipping for RL☆45Dec 20, 2025Updated last month
- Official Implementation of our paper "THOR: Tool-Integrated Hierarchical Optimization via RL for Mathematical Reasoning".☆31Sep 19, 2025Updated 4 months ago
- ☆68Feb 5, 2026Updated last week
- Math-VR Benchmark & CodePlot-CoT: Mathematical Visual Reasoning by Thinking with Code-Driven Images☆52Nov 4, 2025Updated 3 months ago
- OneEdit: A Neural-Symbolic Collaboratively Knowledge Editing System.☆19Oct 14, 2024Updated last year
- Kaggle AIMO2 solution with token-efficient reasoning LLM recipes☆42Aug 7, 2025Updated 6 months ago
- ☆148Jul 31, 2025Updated 6 months ago
- A MemAgent framework that can be extrapolated to 3.5M, along with a training framework for RL training of any agent workflow.☆888Jul 31, 2025Updated 6 months ago
- Control LLM☆22Apr 6, 2025Updated 10 months ago
- ☆15Jun 20, 2024Updated last year
- coze api to openai☆15Sep 1, 2024Updated last year
- Youtu-Parsing: Perception, Structuring and Recognition via High-Parallelism Decoding☆39Feb 10, 2026Updated last week
- A Recipe for Building LLM Reasoners to Solve Complex Instructions☆29Oct 9, 2025Updated 4 months ago
- Codebase for Instruction Following without Instruction Tuning☆36Sep 24, 2024Updated last year
- ☆18Jun 10, 2025Updated 8 months ago
- a simple lightweight large language model pipeline framework.☆28Apr 25, 2025Updated 9 months ago
- Probably one of the lightest native RAG + Agent apps out there,experience the power of Agent-powered models and Agent-driven knowledge ba…☆32May 30, 2025Updated 8 months ago
- [NAACL 2025] Representing Rule-based Chatbots with Transformers☆23Feb 9, 2025Updated last year
- 金融多模态研究报告生成☆119Jul 2, 2025Updated 7 months ago
- ☆16Jul 23, 2024Updated last year
- Time-R1 is a two-stage reinforcement fine-tuning framework that trains large language models to perform slow-thinking, step-by-step reaso…☆91Jan 28, 2026Updated 2 weeks ago
- Agentic RAG R1 Framework via Reinforcement Learning☆387Jan 30, 2026Updated 2 weeks ago
- ☆96Dec 6, 2024Updated last year
- Emergent Hierarchical Reasoning in LLMs/VLMs through Reinforcement Learning☆62Oct 24, 2025Updated 3 months ago
- Implementation of "Decoding-time Realignment of Language Models", ICML 2024.☆21Jun 17, 2024Updated last year
- Official repository of paper "Context-DPO: Aligning Language Models for Context-Faithfulness"☆21Feb 17, 2025Updated last year
- CODA: Coordinating the Cerebrum and Cerebellum for a Dual-Brain Computer Use Agent with Decoupled Reinforcement Learning☆34Aug 28, 2025Updated 5 months ago