The-FinAI / Fino1Links
This is the repo of developing reasoning models in the specific domain of financial, aim to enhance models capabilities in handling financial reasoning tasks.
☆70Updated 7 months ago
Alternatives and similar repositories for Fino1
Users that are interested in Fino1 are comparing it to the libraries listed below
Sorting:
- [ICML 2025] Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search☆108Updated 8 months ago
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆143Updated last year
- [ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆125Updated 8 months ago
- ☆104Updated last year
- ☆67Updated 10 months ago
- ☆118Updated 8 months ago
- Verifiers for LLM Reinforcement Learning☆80Updated 9 months ago
- ☆96Updated last year
- MPO: Boosting LLM Agents with Meta Plan Optimization (EMNLP 2025 Findings)☆74Updated 5 months ago
- ☆23Updated last year
- Designing Multi-Agent Systems with Zero Supervision☆113Updated 7 months ago
- We aim to provide the best references to search, select, and synthesize high-quality and large-quantity data for post-training your LLMs.☆61Updated last year
- WideSearch: Benchmarking Agentic Broad Info-Seeking☆118Updated 4 months ago
- ☆53Updated last year
- This is the code repo for our paper "Enhancing Knowledge Integration and Utilization of Large Language Models via Constructivist Cognitio…☆110Updated 4 months ago
- AutoCoA (Automatic generation of Chain-of-Action) is an agent model framework that enhances the multi-turn tool usage capability of reaso…☆130Updated 10 months ago
- Code for ICLR 2024 paper "CRAFT: Customizing LLMs by Creating and Retrieving from Specialized Toolsets"☆60Updated last year
- Automatic prompt optimization framework for multi-step agent tasks.☆36Updated last year
- The official implementation of "LevelRAG: Enhancing Retrieval-Augmented Generation with Multi-hop Logic Planning over Rewriting Augmented…☆49Updated 10 months ago
- ☆46Updated 8 months ago
- [NeurIPS 2025] A multi-agent framework that leverages LLMs to simulate socio-economic systems☆45Updated 3 months ago
- Data and Code for EMNLP 2025 Findings Paper "MCTS-RAG: Enhancing Retrieval-Augmented Generation with Monte Carlo Tree Search"☆87Updated 3 months ago
- Official Code Repository for the paper "Distilling LLM Agent into Small Models with Retrieval and Code Tools"☆200Updated 3 months ago
- The source code and dataset mentioned in the paper Seal-Tools: Self-Instruct Tool Learning Dataset for Agent Tuning and Detailed Benchmar…☆53Updated last year
- Implementation for OAgents: An Empirical Study of Building Effective Agents☆306Updated 4 months ago
- ☆63Updated last year
- 超简单复现Deepseek-R1-Zero和Deepseek-R1,以「24点游戏」为例。通过zero-RL、SFT以及SFT+RL,以激发LLM的自主验证反思能力。 About Clean, minimal, accessible reproduction of Dee…☆33Updated 10 months ago
- [ICLR 2026] Efficient Agent Training for Computer Use☆137Updated 5 months ago
- Open source code of the paper: "OmniEval: An Omnidirectional and Automatic RAG Evaluation Benchmark in Financial Domain"☆81Updated last year
- ☆84Updated last year