thunlp / LLMxMapReduce
☆123Updated last month
Related projects ⓘ
Alternatives and complementary repositories for LLMxMapReduce
- Offical Repo for "Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale"☆191Updated last month
- ☆287Updated 2 months ago
- StructRAG: Boosting Knowledge Intensive Reasoning of LLMs via Inference-time Hybrid Information Structurization☆71Updated last week
- Reformatted Alignment☆112Updated last month
- The official implementation of paper "ToolGen: Unified Tool Retrieval and Calling via Generation"☆99Updated 3 weeks ago
- ☆78Updated last month
- [ACL 2024 Demo] Official GitHub repo for UltraEval: An open source framework for evaluating foundation models.☆220Updated 3 weeks ago
- Official repo for "LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs".☆192Updated 2 months ago
- Building Open LLM Web Agents with Self-Evolving Online Curriculum RL☆204Updated this week
- [EMNLP 2024] LongAlign: A Recipe for Long Context Alignment of LLMs☆217Updated 6 months ago
- Expert Specialized Fine-Tuning☆145Updated last month
- Meta-Chunking: Learning Efficient Text Segmentation via Logical Perception☆61Updated last week
- ☆83Updated 2 weeks ago
- ☆217Updated 3 months ago
- Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models☆126Updated 5 months ago
- ☆154Updated 2 weeks ago
- FuseAI Project☆76Updated 3 months ago
- ☆49Updated 2 months ago
- Codes for our paper "RQ-RAG: Learning to Refine Queries for Retrieval Augmented Generation"☆130Updated 3 months ago
- Mixture-of-Experts (MoE) Language Model☆180Updated 2 months ago
- Implementation of the LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens Paper☆124Updated 4 months ago
- Awesome papers for role-playing with language models☆122Updated 2 weeks ago
- 🤠 Agent-as-a-Judge and DevAI dataset☆192Updated this week
- Generate multi-round conversation roleplay data based on self-instruct and evol-instruct.☆111Updated last week
- ☆116Updated 5 months ago
- AIR-Bench: Automated Heterogeneous Information Retrieval Benchmark☆106Updated last month
- [EMNLP 2024] LongRAG: A Dual-perspective Retrieval-Augmented Generation Paradigm for Long-Context Question Answering☆79Updated 2 weeks ago
- ☆78Updated 7 months ago
- Codes for the paper "∞Bench: Extending Long Context Evaluation Beyond 100K Tokens": https://arxiv.org/abs/2402.13718☆285Updated last month
- Official repository for "Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing". Your efficient and high-quality s…☆491Updated 2 weeks ago