Code for M4LE: A Multi-Ability Multi-Range Multi-Task Multi-Domain Long-Context Evaluation Benchmark for Large Language Models
☆23Jul 27, 2024Updated last year
Alternatives and similar repositories for M4LE
Users that are interested in M4LE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ACL 24 Findings] Implementation of Resonance RoPE and the PosGen synthetic dataset.☆24Mar 5, 2024Updated 2 years ago
- Implementation of "Investigating the Factual Knowledge Boundary of Large Language Models with Retrieval Augmentation"☆21Jul 31, 2023Updated 2 years ago
- code for Scaling Laws of RoPE-based Extrapolation☆73Oct 16, 2023Updated 2 years ago
- ☆14Nov 20, 2022Updated 3 years ago
- [KDD24-ADS] R-Eval: A Unified Toolkit for Evaluating Domain Knowledge of Retrieval Augmented Large Language Models☆11Apr 9, 2024Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Notes on Deep Reinforcement Learning for Natural Language Processing papers☆30Jul 17, 2017Updated 8 years ago
- A PyTorch re-implementation of the persona-based neural conversation model proposed by Jiwei Li, Michel Galley, Chris Brockett, Georgios …☆26Apr 30, 2020Updated 6 years ago
- ☆11May 10, 2018Updated 7 years ago
- The code for “PromptNER: A Prompting Method for Few-shot Named Entity Recognition via k Nearest Neighbor Search”☆19Mar 13, 2024Updated 2 years ago
- ☆13Nov 2, 2025Updated 6 months ago
- ☆36Mar 25, 2024Updated 2 years ago
- Towards Systematic Measurement for Long Text Quality☆38Sep 5, 2024Updated last year
- [ICLR 2025 Spotlight] Weak-to-strong preference optimization: stealing reward from weak aligned model☆18Feb 24, 2025Updated last year
- Project of ACL 2025 "UAlign: Leveraging Uncertainty Estimations for Factuality Alignment on Large Language Models"☆14Mar 25, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆13Apr 16, 2021Updated 5 years ago
- ACL 2024 | LooGLE: Long Context Evaluation for Long-Context Language Models☆194Oct 8, 2024Updated last year
- This is for EMNLP 2024 Paper: AppBench: Planning of Multiple APIs from Various APPs for Complex User Instruction☆15Nov 4, 2024Updated last year
- Qualifying Exam Preparing☆17May 7, 2025Updated last year
- [ACL 2024] "Understanding and Patching Compositional Reasoning in LLMs"☆14Aug 28, 2024Updated last year
- a survey of long-context LLMs from four perspectives, architecture, infrastructure, training, and evaluation☆61Mar 31, 2025Updated last year
- Implementation of ICLR 2018 paper "Loss-aware Weight Quantization of Deep Networks"☆27Oct 24, 2019Updated 6 years ago
- Paradigm shift in natural language processing☆42May 29, 2022Updated 3 years ago
- 逻辑回归和单层softmax的解析解☆12Jul 29, 2021Updated 4 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- We aim to provide the best references to search, select, and synthesize high-quality and large-quantity data for post-training your LLMs.☆64Oct 3, 2024Updated last year
- [ICLR 2025] Bridging and Modeling Correlations in Pairwise Data for Direct Preference Optimization☆12Jan 26, 2025Updated last year
- This repository contains the ToolSelect dataset which was used to fine-tune Llama-2 70B for tool selection.☆22Mar 11, 2024Updated 2 years ago
- Collection of papers, benchmarks and newest trends in the domain of End-to-end ToDs☆14Nov 18, 2023Updated 2 years ago
- 随机扒取古诗文词语作为git的commit msg☆11Jan 16, 2017Updated 9 years ago
- Official code repository for the paper "ToMAP: Training Opponent-Aware LLM Persuaders with Theory of Mind"☆23Sep 25, 2025Updated 7 months ago
- [EMNLP'22] Title2Event: Benchmarking Open Event Extraction with a Large-scale Chinese Title Dataset☆20Apr 4, 2023Updated 3 years ago
- Fine-Tuning Pre-trained Transformers into Decaying Fast Weights☆19Oct 9, 2022Updated 3 years ago
- Official Implementation of ACL2023: Don't Parse, Choose Spans! Continuous and Discontinuous Constituency Parsing via Autoregressive Span …☆14Aug 25, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Codes for accepted paper "Cooperative Pruning in Cross-Domain Deep Neural Network Compression" in IJCAI 2019.☆12Aug 15, 2019Updated 6 years ago
- the instructions and demonstrations for building a formal logical reasoning capable GLM☆54Sep 3, 2024Updated last year
- ☆14Jul 5, 2023Updated 2 years ago
- Does patch ordering affect context-limited vision transformers?☆17Oct 10, 2025Updated 6 months ago
- Graph homomorphism and retract searching☆11Aug 5, 2015Updated 10 years ago
- Seniment Analysis in Torchtext☆19Apr 29, 2018Updated 8 years ago
- Elasticsearch to CSV Python Script☆11Jan 28, 2014Updated 12 years ago