[EMNLP 2024 Findings] Benchmarking Language Model Agents for Data-Driven Science
☆34Oct 25, 2024Updated last year
Alternatives and similar repositories for BLADE
Users that are interested in BLADE are comparing it to the libraries listed below
Sorting:
- Code for "From Pretraining Data to Language Models to Downstream Tasks: Tracking the Trails of Political Biases Leading to Unfair NLP Mod…☆40Mar 7, 2024Updated last year
- InfiAgent-DABench: Evaluating Agents on Data Analysis Tasks (ICML 2024)☆182May 29, 2025Updated 9 months ago
- An implementation of MSSRM method☆11Mar 23, 2023Updated 2 years ago
- A curated list of papers on LLMs and agents for scientific research and development☆86Dec 11, 2024Updated last year
- Phys4DGen: A Physics-Driven Framework for Controllable and Efficient 4D Content Generation from a Single Image☆12May 10, 2025Updated 9 months ago
- Open-source repository for the OOPSLA'24 paper "CYCLE: Learning to Self-Refine Code Generation"☆10Mar 8, 2024Updated last year
- [ICLR 2025] DSBench: How Far are Data Science Agents from Becoming Data Science Experts?☆106Aug 17, 2025Updated 6 months ago
- ☆31Jun 24, 2024Updated last year
- ☆12Feb 19, 2024Updated 2 years ago
- ☆12Aug 21, 2024Updated last year
- WIP: Ofen is a toolkit aimed at making transformer models production-ready. API included☆17Oct 2, 2024Updated last year
- Fast structured perceptron sequential labeler☆15Dec 8, 2015Updated 10 years ago
- Main repo for GIOROM☆18Sep 28, 2025Updated 5 months ago
- Repository containing the website for the EMNLP 2023 conference☆17Feb 12, 2025Updated last year
- Synthesizing realistic and diverse text-datasets from augmented LLMs☆16Jan 26, 2026Updated last month
- ☆18Jan 3, 2025Updated last year
- A dataset of 80 millon constraint preserving transformations of CAD sketches☆13Nov 22, 2024Updated last year
- An LLM-powered self-studying app using retrieval-augmented generation prompting | Streamlit LLM Hackathon 2023☆17Oct 6, 2023Updated 2 years ago
- Official implementation repository for the paper Towards General Conceptual Model Editing via Adversarial Representation Engineering.☆19Dec 6, 2024Updated last year
- RapidIn: Scalable Influence Estimation for Large Language Models (LLMs). The implementation for paper "Token-wise Influential Training Da…☆21May 4, 2025Updated 9 months ago
- ☆17Oct 22, 2024Updated last year
- [CVPR 2024] Robust Self-calibration of Focal Lengths from the Fundamental Matrix☆46Jan 1, 2025Updated last year
- Discovering Data-driven Hypotheses in the Wild☆130Jun 9, 2025Updated 8 months ago
- Official Implementation of UA^{2}-Agent and other baseline algorithms of "Towards Unified Alignment Between Agents, Humans, and Environme…☆19Nov 12, 2024Updated last year
- This is the repository for paper EscapeBench: Pushing Language Models to Think Outside the Box☆18Dec 19, 2024Updated last year
- Code repo for "Model-Generated Pretraining Signals Improves Zero-Shot Generalization of Text-to-Text Transformers" (ACL 2023)☆22Nov 1, 2023Updated 2 years ago
- [ACL'25] Mosaic-IT: Cost-Free Compositional Data Synthesis for Instruction Tuning☆20Sep 27, 2025Updated 5 months ago
- ☆25Jul 23, 2021Updated 4 years ago
- 📚 Build knowledge bases for RAG☆31Jul 3, 2025Updated 7 months ago
- Official implementation of MARIO: Model Agnostic Recipe for Improving OOD Generalization of Graph Contrastive Learning☆19Jan 27, 2024Updated 2 years ago
- DBpedia Open Text Extraction Challenge - a never ending knowledge acquisition spiral☆19Aug 7, 2017Updated 8 years ago
- Code Prompting Elicits Conditional Reasoning Abilities in Text+Code LLMs. EMNLP 2024☆27Nov 13, 2024Updated last year
- Code for our ACL '23 paper titled "Grokking of Hierarchical Structure in Vanilla Transformers"☆24Oct 8, 2023Updated 2 years ago
- Implementaiton of "DiLM: Distilling Dataset into Language Model for Text-level Dataset Distillation" (accepted by NAACL2024 Findings)".☆28Feb 10, 2025Updated last year
- The implementation for CIKM 2024: Towards Completeness-Oriented Tool Retrieval for Large Language Models.☆24Nov 6, 2024Updated last year
- Neuron Activation☆26Nov 21, 2024Updated last year
- PyTorch reimplementation of the Smooth ReLU activation function proposed in the paper "Real World Large Scale Recommendation Systems Repr…☆22Apr 13, 2022Updated 3 years ago
- Pytorch implementation of same-family gaussian mixture models with guardrails. Features separable parameter optimization and singularity …☆26May 31, 2025Updated 9 months ago
- (NeurIPS 2024) What Makes CLIP More Robust to Long-Tailed Pre-Training Data? A Controlled Study for Transferable Insights☆28Oct 28, 2024Updated last year