AkariAsai / OpenScholar_ExpertEvalLinks
This repository contains expert evaluation interface and data evaluation script for the OpenScholar project.
☆25Updated 7 months ago
Alternatives and similar repositories for OpenScholar_ExpertEval
Users that are interested in OpenScholar_ExpertEval are comparing it to the libraries listed below
Sorting:
- This repository contains ScholarQABench data and evaluation pipeline.☆73Updated 3 months ago
- ☆66Updated 3 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆60Updated 10 months ago
- ☆36Updated last month
- Codebase accompanying the Summary of a Haystack paper.☆79Updated 9 months ago
- ☆20Updated 4 months ago
- ☆45Updated last month
- ☆69Updated last month
- ☆47Updated 9 months ago
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆75Updated 8 months ago
- Aioli: A unified optimization framework for language model data mixing☆27Updated 5 months ago
- ☆40Updated 7 months ago
- ReBase: Training Task Experts through Retrieval Based Distillation☆29Updated 5 months ago
- Are LLMs Capable of Data-based Statistical and Causal Reasoning? Benchmarking Advanced Quantitative Reasoning with Data☆41Updated 4 months ago
- ☆22Updated 3 weeks ago
- ☆20Updated 3 months ago
- [ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆95Updated last month
- ☆56Updated 7 months ago
- ☆54Updated last year
- ☆50Updated 2 weeks ago
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆35Updated last year
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆115Updated 8 months ago
- ☆62Updated 11 months ago
- Code, results and other artifacts from the paper introducing the WildChat-50m dataset and the Re-Wild model family.☆29Updated 3 months ago
- The first dense retrieval model that can be prompted like an LM☆80Updated 2 months ago
- Improving Text Embedding of Language Models Using Contrastive Fine-tuning☆64Updated 11 months ago
- Source code for the collaborative reasoner research project at Meta FAIR.☆94Updated 2 months ago
- Verifiers for LLM Reinforcement Learning☆64Updated 3 months ago
- SiriuS: Self-improving Multi-agent Systems via Bootstrapped Reasoning☆60Updated last week
- ☆63Updated last year