☆64Oct 6, 2025Updated 5 months ago
Alternatives and similar repositories for insight-bench
Users that are interested in insight-bench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆16May 6, 2025Updated 10 months ago
- [NeurIPS 2024 D&B Track] DACO: Towards Application-Driven and Comprehensive Data Analysis via Code Generation☆12Mar 5, 2025Updated last year
- ☆31Jul 3, 2025Updated 8 months ago
- python package for calculating famous measures in computational linguistics☆15Nov 5, 2024Updated last year
- Website☆12Updated this week
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆11Dec 1, 2020Updated 5 years ago
- Code for EMNLP'24 paper - On Diversified Preferences of Large Language Model Alignment☆16Aug 6, 2024Updated last year
- ☆27Mar 10, 2026Updated 2 weeks ago
- The official implementation of the ACL 2023 paper, "Paraphrasing-Guided Data Augmentation for Contrastive Prompt-based Few-shot Fine-tuni…☆11Nov 28, 2023Updated 2 years ago
- A Python library for variational inference with normalizing flow and annealing☆15Mar 13, 2025Updated last year
- Some example codes for drawing figures in research paper☆35Mar 3, 2022Updated 4 years ago
- a source code for automatic data visualization and recommendation☆14Jul 12, 2018Updated 7 years ago
- The implementation for the work "Unconstrained Monotonic Calibration of Predictions in Deep Ranking Systems".☆22Jun 11, 2025Updated 9 months ago
- An Open-source Factuality Evaluation Demo for LLMs☆32Feb 23, 2026Updated last month
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Few-Shot Relation Extraction with AllenNLP☆12Jan 27, 2019Updated 7 years ago
- GATSBI: Generative Adversarial Training for Simulation-Based Inference☆19Jul 13, 2023Updated 2 years ago
- An awesome & curated list of anything that might be useful for computer science students☆13Mar 27, 2023Updated 2 years ago
- Martingale Posteriors with Copulas☆23Mar 19, 2024Updated 2 years ago
- [ICLR 2025] EditRoom: LLM-parameterized Graph Diffusion for Composable 3D Room Layout Editing☆24Apr 1, 2025Updated 11 months ago
- ☆15Feb 18, 2021Updated 5 years ago
- Kernel Playground - A playground to run large scale experiments on the Linux Kernel☆18Nov 8, 2025Updated 4 months ago
- ☆10Jun 4, 2024Updated last year
- A mobile GUI search engine using a vision-language model☆14May 5, 2025Updated 10 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Microsoft question-answering dataset☆10Jun 16, 2023Updated 2 years ago
- Official Implementation of MDK12-Bench: A Multi-Discipline Benchmark for Evaluating Reasoning in Multimodal Large Language Models☆13Nov 1, 2025Updated 4 months ago
- mPLM-Sim: Better Cross-Lingual Similarity and Transfer in Multilingual Pretrained Language Models☆11Jan 19, 2024Updated 2 years ago
- ☆10Feb 6, 2025Updated last year
- LangChain + LiteLLM that works☆50Sep 1, 2025Updated 6 months ago
- ☆15Mar 12, 2024Updated 2 years ago
- Preschool evaluation is crucial because it gives teachers and parents influential knowledge about children's growth and development. The …☆20May 24, 2023Updated 2 years ago
- InfiAgent-DABench: Evaluating Agents on Data Analysis Tasks (ICML 2024)☆183May 29, 2025Updated 9 months ago
- Codebase for Paper Reusing Embeddings: Reproducible Reward Model Research in Large Language Model Alignment without GPUs☆22Apr 24, 2025Updated 11 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [ICSE 2023] Differentiable interpretation and failure-inducing input generation for neural network numerical bugs.☆13Jan 5, 2024Updated 2 years ago
- L4: Practical loss-based stepsize adaptation for PyTorch☆18May 7, 2021Updated 4 years ago
- Learned User Representations in Online Social Networks (Twitter) using Temporal Dynamics of Information Diffusion.☆10Oct 15, 2018Updated 7 years ago
- ⚠️ ARCHIVED - All development moved to https://github.com/itbench-hub/ITBench/tree/main/scenarios☆15Feb 24, 2026Updated last month
- Codes for the paper "Grad2Task: Improved Few-shot Text Classification Using Gradients for Task Representation"☆14Nov 24, 2022Updated 3 years ago
- The repository of the ACCV 2024 paper "FG-CXR: A Radiologist-Aligned Gaze Dataset for Enhancing Interpretability in Chest X-Ray Report Ge…☆11Jul 28, 2025Updated 7 months ago
- Official implementation: Large Language Models are Interpretable Learners - Google☆13Jun 29, 2024Updated last year