bethgelab / CiteMELinks
CiteME is a benchmark designed to test the abilities of language models in finding papers that are cited in scientific texts.
☆48Updated 8 months ago
Alternatives and similar repositories for CiteME
Users that are interested in CiteME are comparing it to the libraries listed below
Sorting:
- [ACL 2024] <Large Language Models for Automated Open-domain Scientific Hypotheses Discovery>. It has also received the best poster award …☆42Updated 8 months ago
- Analysis code for paper "SciArena: An Open Evaluation Platform for Foundation Models in Scientific Literature Tasks"☆42Updated 2 weeks ago
- ☆69Updated last month
- Mixing Language Models with Self-Verification and Meta-Verification☆106Updated 7 months ago
- Functional Benchmarks and the Reasoning Gap☆88Updated 9 months ago
- ReBase: Training Task Experts through Retrieval Based Distillation☆29Updated 5 months ago
- Codebase accompanying the Summary of a Haystack paper.☆79Updated 9 months ago
- ☆20Updated 4 months ago
- Official implementation of the ACL 2024: Scientific Inspiration Machines Optimized for Novelty☆81Updated last year
- Synthetic data generation and benchmark implementation for "Episodic Memories Generation and Evaluation Benchmark for Large Language Mode…☆48Updated 3 months ago
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"☆58Updated 7 months ago
- ☆86Updated 6 months ago
- ☆51Updated 3 weeks ago
- PyTorch library for Active Fine-Tuning☆87Updated 5 months ago
- accompanying material for sleep-time compute paper☆97Updated 2 months ago
- ☆118Updated 10 months ago
- Discovering Data-driven Hypotheses in the Wild☆99Updated last month
- Code for PHATGOOSE introduced in "Learning to Route Among Specialized Experts for Zero-Shot Generalization"☆86Updated last year
- Combining Base and Instruction-Tuned Language Models for Better Synthetic Data Generation☆33Updated 5 months ago
- An automated tool for discovering insights from research papaer corpora☆138Updated last year
- ☆33Updated 2 months ago
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" 🤖☆71Updated 7 months ago
- Attribute (or cite) statements generated by LLMs back to in-context information.☆245Updated 9 months ago
- ☆22Updated 3 weeks ago
- Source code for the collaborative reasoner research project at Meta FAIR.☆95Updated 3 months ago
- Code/data for MARG (multi-agent review generation)☆44Updated 8 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated last year
- Dynamic Cheatsheet: Test-Time Learning with Adaptive Memory☆66Updated last month
- Evaluation of neuro-symbolic engines☆38Updated 11 months ago
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆32Updated 3 months ago