YihongT / LLMSynthorLinks
☆21Updated 5 months ago
Alternatives and similar repositories for LLMSynthor
Users that are interested in LLMSynthor are comparing it to the libraries listed below
Sorting:
- ☆82Updated last year
- ☆67Updated 8 months ago
- Verifiers for LLM Reinforcement Learning☆80Updated 8 months ago
- ☆40Updated last year
- The code for paper: Decoupled Planning and Execution: A Hierarchical Reasoning Framework for Deep Search☆63Updated 5 months ago
- ☆24Updated last year
- ☆63Updated last year
- Official code repository for the paper "ToMAP: Training Opponent-Aware LLM Persuaders with Theory of Mind"☆22Updated 2 months ago
- WideSearch: Benchmarking Agentic Broad Info-Seeking☆106Updated 2 months ago
- DeepDive: Advancing Deep Search Agents with Knowledge Graphs and Multi-Turn RL☆214Updated 2 months ago
- ☆46Updated 6 months ago
- Official repository for RAGViz: Diagnose and Visualize Retrieval-Augmented Generation [EMNLP 2024]☆88Updated 11 months ago
- ☆31Updated last year
- [EMNLP 2025] The official implementation for paper "Agentic-R1: Distilled Dual-Strategy Reasoning"☆100Updated 3 months ago
- ☆95Updated last year
- Efficient Agent Training for Computer Use☆134Updated 3 months ago
- Data preparation code for CrystalCoder 7B LLM☆45Updated last year
- Implemented a script that automatically adjusts Qwen3's inference and non-inference capabilities, based on an OpenAI-like API. The infere…☆22Updated 7 months ago
- ☆52Updated last year
- ☆18Updated last year
- [NAACL 2025] Representing Rule-based Chatbots with Transformers☆23Updated 10 months ago
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Updated last year
- ☆11Updated last year
- In-Context Alignment: Chat with Vanilla Language Models Before Fine-Tuning☆35Updated 2 years ago
- [ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆115Updated 6 months ago
- Official Repository for Paper "BaichuanSEED: Sharing the Potential of ExtensivE Data Collection and Deduplication by Introducing a Compet…☆18Updated last year
- Improving Text Embedding of Language Models Using Contrastive Fine-tuning☆66Updated last year
- SSRL: Self-Search Reinforcement Learning☆158Updated 4 months ago
- ☆24Updated 6 months ago
- Code Implementation, Evaluations, Documentation, Links and Resources for Min P paper☆45Updated 4 months ago