YihongT / LLMSynthorLinks
☆21Updated 6 months ago
Alternatives and similar repositories for LLMSynthor
Users that are interested in LLMSynthor are comparing it to the libraries listed below
Sorting:
- ☆84Updated last year
- ☆67Updated 9 months ago
- Verifiers for LLM Reinforcement Learning☆80Updated 8 months ago
- When Reasoning Meets Its Laws☆33Updated last week
- ☆63Updated last year
- ☆31Updated last year
- ☆39Updated last year
- ☆46Updated 6 months ago
- ☆23Updated last year
- Official repository for RAGViz: Diagnose and Visualize Retrieval-Augmented Generation [EMNLP 2024]☆88Updated 11 months ago
- ☆96Updated last year
- ☆53Updated last year
- ☆41Updated 7 months ago
- The code for paper: Decoupled Planning and Execution: A Hierarchical Reasoning Framework for Deep Search☆63Updated 6 months ago
- Data preparation code for CrystalCoder 7B LLM☆45Updated last year
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer☆45Updated last year
- [EMNLP 2025] The official implementation for paper "Agentic-R1: Distilled Dual-Strategy Reasoning"☆101Updated 4 months ago
- Multi-Layer Key-Value sharing experiments on Pythia models☆34Updated last year
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Updated last year
- Training setup for Langchain's Open Deep Research☆74Updated 4 months ago
- Reproducible Language Agent Research☆32Updated 6 months ago
- WideSearch: Benchmarking Agentic Broad Info-Seeking☆110Updated 3 months ago
- ☆20Updated 9 months ago
- [ACL 2025] Agentic Knowledgeable Self-awareness☆91Updated 6 months ago
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper☆32Updated last year
- In-Context Alignment: Chat with Vanilla Language Models Before Fine-Tuning☆35Updated 2 years ago
- ☆18Updated last year
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆40Updated 2 years ago
- Official code repository for the paper "ToMAP: Training Opponent-Aware LLM Persuaders with Theory of Mind"☆22Updated 3 months ago
- [ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆119Updated 7 months ago