Code, Data and Red Teaming for ZeroBench
☆59Dec 23, 2025Updated 3 months ago
Alternatives and similar repositories for zerobench
Users that are interested in zerobench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- NeurIPS 2024: SciFIBench: Benchmarking Large Multimodal Models for Scientific Figure Interpretation☆13May 24, 2025Updated 10 months ago
- A collection of papers tackling automatic fact-checking (particularly of AI-generated content)☆14Nov 3, 2023Updated 2 years ago
- Accompanying repo for CVPRW'24: Charting New Territories: Exploring the Geographic and Geospatial Capabilities of Multimodal LLMs☆27May 24, 2025Updated 10 months ago
- Accompanying repo for NeurIPSW'23: GPT4GEO: How a Language Model Sees the World's Geography☆27May 24, 2025Updated 10 months ago
- Unsupervised muti-metric fusion for Full-Reference (FR) Image Quality Assessment (IQA)☆11Jul 11, 2014Updated 11 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- PyTorch implementation for our paper "Efficient Meta Reinforcement Learning for Preference-based Fast Adaptation"☆13Apr 19, 2023Updated 2 years ago
- rsbuild svg loader☆13Nov 11, 2024Updated last year
- 百度地图坐标拾取工具☆12Jan 27, 2018Updated 8 years ago
- PostgreSQL SKILLs for AI Agent☆29Feb 5, 2026Updated last month
- VST that combines the classic mdaPiano and EPiano in a new plug-in☆22Oct 10, 2025Updated 5 months ago
- [AAAI 2026] Multimodal Deepresearcher: Generating Text-Chart Interleaved Reports From Scratch with Agentic Framework☆49Jan 25, 2026Updated 2 months ago
- This is the official repository for the paper "FLUX-Reason-6M & PRISM-Bench: A Million-Scale Text-to-Image Reasoning Dataset and Comprehe…☆128Jan 29, 2026Updated last month
- ☆18Sep 20, 2017Updated 8 years ago
- [CVPR 2025] Implementation of "Forensics-Bench: A Comprehensive Forgery Detection Benchmark Suite for Large Vision Language Models"☆37Apr 28, 2025Updated 10 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [ICLR 2026] SwiReasoning: Switch-Thinking in Latent and Explicit for Pareto-Superior Reasoning LLMs☆52Oct 14, 2025Updated 5 months ago
- OllamaFX is a native, lightweight, and professional JavaFX desktop client for Ollama. Run Llama 3, Mistral, and Phi-3 locally with maximu…☆62Mar 6, 2026Updated 2 weeks ago
- ☆14Updated this week
- Code and benchmark for the paper: "A Practitioner's Guide to Continual Multimodal Pretraining" [NeurIPS'24]☆61Dec 10, 2024Updated last year
- [NAACL'25] "Revealing the Barriers of Language Agents in Planning"☆13Jun 22, 2025Updated 9 months ago
- Cross Modal Retrieval with Querybank Normalisation☆57Nov 21, 2023Updated 2 years ago
- [ICML 2025] Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"☆19Mar 10, 2025Updated last year
- 🍪 青龙助手:自动同步网站Cookie到青龙面板的Chrome扩展,支持多网站配置和完整的环境变量管理。[qnloft出品]☆58Dec 31, 2025Updated 2 months ago
- [ICML2023] Instant Soup Cheap Pruning Ensembles in A Single Pass Can Draw Lottery Tickets from Large Models. Ajay Jaiswal, Shiwei Liu, Ti…☆11Nov 28, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [NAACL'25] Contains code and documentation for our VANE-Bench paper.☆23Aug 19, 2025Updated 7 months ago
- Efficient Multi-Vehicle Trajectory Planning via Centralized Searching Decentralized Optimization☆26Jan 16, 2025Updated last year
- 时间有限,保持专注,戴上信息降噪眼镜。 只在重要时,主动通知你☆47Dec 26, 2025Updated 3 months ago
- SwooleWebRTC☆27Apr 3, 2020Updated 5 years ago
- ☆11Oct 20, 2023Updated 2 years ago
- ☆23Apr 24, 2025Updated 11 months ago
- A Framework for Evaluating AI Agent Safety in Realistic Environments☆30Oct 2, 2025Updated 5 months ago
- Implementation of MixCE method described in ACL 2023 paper by Zhang et al.☆20May 29, 2023Updated 2 years ago
- Official code for the paper "Does CLIP's Generalization Performance Mainly Stem from High Train-Test Similarity?" (ICLR 2024)☆10Aug 26, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Code for the paper: "SuS-X: Training-Free Name-Only Transfer of Vision-Language Models" [ICCV'23]☆105Aug 22, 2023Updated 2 years ago
- [ECCV2024]FALIP: Visual Prompt as Foveal Attention Boosts CLIP Zero-Shot Performance☆17Sep 11, 2024Updated last year
- ☆19Jul 31, 2025Updated 7 months ago
- [ICLR 2026] Official repository of "InternSVG: Towards Unified SVG Tasks with Multimodal Large Language Models".☆95Feb 6, 2026Updated last month
- The Good, The Bad, and The Greedy: Evaluation of LLMs Should Not Ignore Non-Determinism☆30Jul 17, 2024Updated last year
- This is an implementation of the paper "Are We Done with Object-Centric Learning?"☆12Sep 11, 2025Updated 6 months ago
- An Implementation of "Small steps and giant leaps: Minimal Newton solvers for Deep Learning" In pytorch☆21Jul 16, 2018Updated 7 years ago