thunlp / LLMxMapReduce
☆197Updated this week
Alternatives and similar repositories for LLMxMapReduce:
Users that are interested in LLMxMapReduce are comparing it to the libraries listed below
- ☆151Updated 2 weeks ago
- R1-searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning☆443Updated 3 weeks ago
- Scaling Deep Research via Reinforcement Learning in Real-world Environments.☆200Updated this week
- [ICLR 2025] The official implementation of paper "ToolGen: Unified Tool Retrieval and Calling via Generation"☆135Updated 3 weeks ago
- Offical Repo for "Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale"☆234Updated 2 months ago
- ☆94Updated 4 months ago
- [EMNLP 2024: Demo Oral] RAGLAB: A Modular and Research-Oriented Unified Framework for Retrieval-Augmented Generation☆294Updated 5 months ago
- ☆314Updated 6 months ago
- Building Open LLM Web Agents with Self-Evolving Online Curriculum RL☆356Updated last week
- StructRAG: Boosting Knowledge Intensive Reasoning of LLMs via Inference-time Hybrid Information Structurization☆124Updated 3 months ago
- ☆265Updated 8 months ago
- Official repo for "LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs".☆228Updated 7 months ago
- ☆218Updated 11 months ago
- Mixture-of-Experts (MoE) Language Model☆186Updated 7 months ago
- The code of our paper "InfLLM: Unveiling the Intrinsic Capacity of LLMs for Understanding Extremely Long Sequences with Training-Free Mem…☆350Updated 11 months ago
- Implementation of the LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens Paper☆130Updated 8 months ago
- OpenSeek aims to unite the global open source community to drive collaborative innovation in algorithms, data and systems to develop next…☆131Updated this week
- Codes for the paper "∞Bench: Extending Long Context Evaluation Beyond 100K Tokens": https://arxiv.org/abs/2402.13718☆317Updated 6 months ago
- [ACL 2024] AutoAct: Automatic Agent Learning from Scratch for QA via Self-Planning☆219Updated 3 months ago
- Repo for Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent☆297Updated 3 weeks ago
- This is the official repository for Auto-RAG.☆206Updated 3 months ago
- Codes for our paper "RQ-RAG: Learning to Refine Queries for Retrieval Augmented Generation"☆171Updated 8 months ago
- The RedStone repository includes code for preparing extensive datasets used in training large language models.☆131Updated 2 months ago
- A highly capable 2.4B lightweight LLM using only 1T pre-training data with all details.☆170Updated this week
- LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA☆486Updated 3 months ago
- Code for Parametric RAG, SIGIR 2025 Full Paper☆150Updated this week
- This is a user guide for the MiniCPM and MiniCPM-V series of small language models (SLMs) developed by ModelBest. “面壁小钢炮” focuses on achi…☆228Updated 5 months ago
- Implementation of "RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Horizon Generation".☆230Updated 10 months ago
- PC Agent: While You Sleep, AI Works - A Cognitive Journey into Digital World☆222Updated 3 months ago
- Meta-Chunking: Learning Efficient Text Segmentation via Logical Perception☆149Updated this week