chatnoir-eu / web-content-extraction-benchmarkLinks
Web Content Extraction Benchmark
☆17Updated last year
Alternatives and similar repositories for web-content-extraction-benchmark
Users that are interested in web-content-extraction-benchmark are comparing it to the libraries listed below
Sorting:
- Official code for ACL 2023 (short, findings) paper "Recursion of Thought: A Divide and Conquer Approach to Multi-Context Reasoning with L…☆43Updated last year
- ☆62Updated 10 months ago
- Official implementation of ACL 2025 Findings paper "Autonomous Data Selection with Zero-shot Generative Classifiers for Mathematical Text…☆81Updated 3 weeks ago
- [IJCAI 2024] FactCHD: Benchmarking Fact-Conflicting Hallucination Detection☆87Updated last year
- Code and data for "Dynosaur: A Dynamic Growth Paradigm for Instruction-Tuning Data Curation" (EMNLP 2023)☆64Updated last year
- Large-language Model Evaluation framework with Elo Leaderboard and A-B testing☆52Updated 7 months ago
- The "GPT-API-Accelerate" project provides a set of Python classes for accelerating the process of generating responses to prompts using t…☆23Updated 7 months ago
- Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]☆142Updated 7 months ago
- LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)☆135Updated 6 months ago
- ☆172Updated last year
- Data preparation code for CrystalCoder 7B LLM☆44Updated last year
- Pre-training code for CrystalCoder 7B LLM☆54Updated last year
- [arXiv preprint] Official Repository for "Evaluating Language Models as Synthetic Data Generators"☆33Updated 5 months ago
- ☆57Updated 8 months ago
- A set of utilities for running few-shot prompting experiments on large-language models☆121Updated last year
- SILO Language Models code repository☆81Updated last year
- ☆69Updated last year
- [ICLR'25] Data and code for our paper "Why Does the Effective Context Length of LLMs Fall Short?"☆76Updated 6 months ago
- An Open-Source Knowledge-Enhanced Multilingual Supervised Fine-tuning Dataset☆24Updated 4 months ago
- Official implementation for 'Extending LLMs’ Context Window with 100 Samples'☆78Updated last year
- ☆27Updated last year
- Code of ICLR paper: https://openreview.net/forum?id=-cqvvvb-NkI☆94Updated 2 years ago
- [EMNLP 2024] A Retrieval Benchmark for Scientific Literature Search☆89Updated 6 months ago
- ☆68Updated 2 years ago
- "Improving Mathematical Reasoning with Process Supervision" by OPENAI☆107Updated 2 weeks ago
- Implementation of the paper: "Answering Questions by Meta-Reasoning over Multiple Chains of Thought"☆95Updated last year
- ☆121Updated last year
- Scalable Meta-Evaluation of LLMs as Evaluators☆42Updated last year
- This repository contains ScholarQABench data and evaluation pipeline.☆72Updated last month
- ☆61Updated 7 months ago