chatnoir-eu / web-content-extraction-benchmarkLinks
Web Content Extraction Benchmark
☆19Updated last year
Alternatives and similar repositories for web-content-extraction-benchmark
Users that are interested in web-content-extraction-benchmark are comparing it to the libraries listed below
Sorting:
- A robust web archive analytics toolkit☆116Updated 5 months ago
- Official implementation of ACL 2025 Findings paper "Autonomous Data Selection with Zero-shot Generative Classifiers for Mathematical Text…☆85Updated last week
- SciGLM: Training Scientific Language Models with Self-Reflective Instruction Annotation and Tuning (NeurIPS D&B Track 2024)☆83Updated last year
- ☆62Updated last year
- Benchmarking LLMs with Challenging Tasks from Real Users☆241Updated 10 months ago
- Pretraining Efficiently on S2ORC!☆168Updated 10 months ago
- 🚢 Data Toolkit for Sailor Language Models☆94Updated 6 months ago
- [EMNLP 2024] A Retrieval Benchmark for Scientific Literature Search☆96Updated 9 months ago
- The Effect of Sampling Temperature on Problem Solving in Large Language Models☆23Updated 9 months ago
- Pre-training code for Amber 7B LLM☆167Updated last year
- Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]☆148Updated 10 months ago
- ☆23Updated 3 weeks ago
- SSRL: Self-Search Reinforcement Learning☆131Updated 3 weeks ago
- BrowseComp-Plus: A More Fair and Transparent Evaluation Benchmark of Deep-Research Agent☆78Updated 2 weeks ago
- WideSearch: Benchmarking Agentic Broad Info-Seeking☆90Updated last month
- Official repository for paper "ReasonIR Training Retrievers for Reasoning Tasks".☆198Updated 2 months ago
- LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)☆143Updated 10 months ago
- I-SHEEP: Iterative Self-enHancEmEnt Paradigm of LLMs through Self-Instruct and Self-Assessment☆17Updated 8 months ago
- A benchmark that challenges language models to code solutions for scientific problems☆140Updated last week
- Official implementation for 'Extending LLMs’ Context Window with 100 Samples'☆80Updated last year
- Pytorch implementation for "Compressed Context Memory For Online Language Model Interaction" (ICLR'24)☆62Updated last year
- [ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale☆262Updated 2 months ago
- The code for the paper: "Same Task, More Tokens: the Impact of Input Length on the Reasoning Performance of Large Language Models"☆54Updated last year
- ☆89Updated 10 months ago
- URS Benchmark: Evaluating LLMs on User Reported Scenarios☆30Updated 3 months ago
- ☆52Updated 10 months ago
- ☆188Updated 2 months ago
- [NeurIPS 2023] This is the code for the paper `Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias`.☆153Updated last year
- Code and Data for "Language Modeling with Editable External Knowledge"☆34Updated last year
- Implementations of online merging optimizers proposed by Online Merging Optimizers for Boosting Rewards and Mitigating Tax in Alignment☆76Updated last year