PCA-anonymous / PCALinks
☆19Updated 10 months ago
Alternatives and similar repositories for PCA
Users that are interested in PCA are comparing it to the libraries listed below
Sorting:
- A Multi-Dimensional Constraint Framework for Evaluating and Improving Instruction Following in Large Language Models☆19Updated 7 months ago
- ☆14Updated 11 months ago
- The implementation for CIKM 2024: Towards Completeness-Oriented Tool Retrieval for Large Language Models.☆23Updated last year
- [AAAI'25] CharacterBench: Benchmarking Character Customization of Large Language Models☆19Updated 5 months ago
- Official repository of Graph RAG-Tool Fusion and ToolLinkOS dataset.☆22Updated 10 months ago
- Implementation of "Investigating the Factual Knowledge Boundary of Large Language Models with Retrieval Augmentation"☆21Updated 2 years ago
- Cue-CoT: Chain-of-thought Prompting for Responding to In-depth Dialogue Questions with LLMs [EMNLP 2023 Findings]☆25Updated 2 years ago
- Prompt-Guided Retrieval For Non-Knowledge-Intensive Tasks☆12Updated 2 years ago
- ☆12Updated last year
- [ICLR 2025] Bridging and Modeling Correlations in Pairwise Data for Direct Preference Optimization☆12Updated 11 months ago
- ☆18Updated last year
- MUA-RL: MULTI-TURN USER-INTERACTING AGENT REINFORCEMENT LEARNING FOR AGENTIC TOOL USE☆52Updated 2 months ago
- Official Implementation of Flash-Searcher: Fast and Effective Web Agents via DAG-Based Parallel Execution☆58Updated last month
- ☆18Updated last year
- AutoLibra: Metric Induction for Agents from Open-Ended Human Feedback☆17Updated 2 months ago
- DataSciBench: An LLM Agent Benchmark for Data Science☆48Updated 4 months ago
- Dataset for Findings of ACL 23 "VCSum: A Versatile Chinese Meeting Summarization Dataset"☆49Updated 2 years ago
- Open source code of the paper: "OmniEval: An Omnidirectional and Automatic RAG Evaluation Benchmark in Financial Domain"☆80Updated last year
- ☆16Updated 7 months ago
- ☆32Updated 2 years ago
- Source code of paper: Process vs. Outcome Reward: Which is Better for Agentic RAG Reinforcement Learning☆45Updated 6 months ago
- MemoChat: Tuning LLMs to Use Memos for Consistent Long-Range Open-Domain Conversation☆28Updated last year
- [ACL 2025] LongSafety: Evaluating Long-Context Safety of Large Language Models☆15Updated 6 months ago
- ☆16Updated last year
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper☆32Updated last year
- ☆20Updated last year
- ☆35Updated 3 months ago
- A collection of instruction data and scripts for machine translation.☆20Updated 2 years ago
- Trying to predict a movie's success based on the script (before filming)☆49Updated 5 years ago
- The official code for NAACL 2024 paper: $E^5$: Zero-shot Hierarchical Table Analysis using Augmented LLMs via Explain, Extract, Execute, …☆15Updated last year