bnewm0609 / arxivDIGESTablesLinks
☆17Updated 9 months ago
Alternatives and similar repositories for arxivDIGESTables
Users that are interested in arxivDIGESTables are comparing it to the libraries listed below
Sorting:
- This GUI aims to simplify the process of converting GGUF files to llamafile format by providing an intuitive and convenient way for users…☆14Updated last year
- Common tools for data processing☆17Updated 4 months ago
- This repository contains ScholarQABench data and evaluation pipeline.☆77Updated this week
- Code, datasets, and checkpoints for the paper "CRAFT Your Dataset: Task-Specific Synthetic Dataset Generation Through Corpus Retrieval an…☆30Updated 10 months ago
- ☆19Updated 2 years ago
- This repository will contain a demo using Weaviate with data and metadata from the arXiv dataset.☆16Updated 3 years ago
- [ICLR'25] ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery☆97Updated 2 months ago
- ☆57Updated 10 months ago
- KITE (Knowledge-Intensive Task Evaluation) is an end-to-end benchmark for RAG pipelines☆19Updated 11 months ago
- CodeRepoQA dataset☆12Updated 5 months ago
- A pipeline using LLMs for Knowledge Engineering, combining knowledge probing and Wikidata entity mapping.☆37Updated 7 months ago
- Scientific articles using or citing Common Crawl data☆26Updated this week
- Exploration using DSPy to optimize modules to maximize performance on the OpenToM dataset☆17Updated last year
- 🫧 Code for Holistic Reasoning with Long-Context LMs: A Benchmark for Database Operations on Massive Textual Data (Maekawa*, Iso* et al.…☆12Updated 5 months ago
- [EMNLP 2024] A Retrieval Benchmark for Scientific Literature Search☆93Updated 8 months ago
- EMNLP 2024 "Re-reading improves reasoning in large language models". Simply repeating the question to get bidirectional understanding for…☆26Updated 8 months ago
- AutoToM: Scaling Model-based Mental Inference via Automated Agent Modeling☆24Updated 2 weeks ago
- Repo housing the open sourced code for the ai2 scholar qa app and also the corresponding library☆209Updated 2 weeks ago
- [ACL 2024] <Large Language Models for Automated Open-domain Scientific Hypotheses Discovery>. It has also received the best poster award …☆42Updated 9 months ago
- Dataset and evaluation suite enabling LLM instruction-following for scientific literature understanding.☆40Updated 4 months ago
- Autonomous Generalist Scientist / AI Scientist / Agent Scientist / Robot Scientist☆20Updated 2 months ago
- A repository of Juris-M style modules☆16Updated last year
- Code and Dataset for Learning to Solve Complex Tasks by Talking to Agents☆24Updated 3 years ago
- ☆32Updated 2 years ago
- RAG-Fusion implementation using Langchain, Weaviate and OpenAI☆13Updated last year
- ☆17Updated 2 years ago
- Small python package to measure OCR quality and other related metrics.☆25Updated last year
- Code Prompting Elicits Conditional Reasoning Abilities in Text+Code LLMs. EMNLP 2024☆26Updated 9 months ago
- ☆76Updated this week
- ☆30Updated 5 months ago