siyan-sylvia-li / arxivParserLinks
☆18Updated 2 years ago
Alternatives and similar repositories for arxivParser
Users that are interested in arxivParser are comparing it to the libraries listed below
Sorting:
- ☆25Updated 8 months ago
- 🔔🧠 Easily experiment with popular language agents across diverse reasoning/decision-making benchmarks!☆53Updated 6 months ago
- Based on the tree of thoughts paper☆48Updated 2 years ago
- An attribution library for LLMs☆46Updated last year
- Exploration using DSPy to optimize modules to maximize performance on the OpenToM dataset☆24Updated last year
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆34Updated 9 months ago
- Code and Dataset for Learning to Solve Complex Tasks by Talking to Agents☆24Updated 3 years ago
- Factored Cognition Primer: How to write compositional language model programs☆50Updated 2 years ago
- Reasoning by Communicating with Agents☆29Updated 8 months ago
- Code and data for the paper "Why think step by step? Reasoning emerges from the locality of experience"☆63Updated 9 months ago
- Supervised instruction finetuning for LLM with HF trainer and Deepspeed☆36Updated 2 years ago
- Prototype advanced LLM algorithms for reasoning and planning.☆99Updated last year
- ☆59Updated last year
- Solve Geometric & Graph Problems with Large Language Models☆32Updated 2 years ago
- PyTorch implementation for MRL☆20Updated last year
- Dataset and evaluation suite enabling LLM instruction-following for scientific literature understanding.☆47Updated 10 months ago
- ☆100Updated last year
- ☆95Updated last year
- Accompanying code and SEP dataset for the "Can LLMs Separate Instructions From Data? And What Do We Even Mean By That?" paper.☆58Updated 10 months ago
- Get answers to research questions from 200M+ papers. Link to demo -☆207Updated 2 months ago
- ☆23Updated 2 years ago
- ☆105Updated last year
- 📝 Reference-Free automatic summarization evaluation with potential hallucination detection☆103Updated 2 years ago
- ☆44Updated last year
- Writing Blog Posts with Generative Feedback Loops!☆50Updated last year
- A set of utilities for running few-shot prompting experiments on large-language models☆126Updated 2 years ago
- Probabilistic LLM evaluations. [CogSci2023; ACL2023]☆73Updated last year
- [EMNLP 2024] A Retrieval Benchmark for Scientific Literature Search☆102Updated last year
- Verbosity control for AI agents☆66Updated last year
- Track the progress of LLM context utilisation☆55Updated 9 months ago