GeoGuessr benchmark for language models
☆56Mar 17, 2026Updated last month
Alternatives and similar repositories for geobench
Users that are interested in geobench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- OSWorld-Human: Benchmarking the Efficiency of Computer-Use Agents☆24Jan 6, 2026Updated 4 months ago
- Automated Safety Testing of Large Language Models☆18Jan 31, 2025Updated last year
- ☆11Apr 3, 2023Updated 3 years ago
- A curated list for Efficient Large Language Models☆11Mar 25, 2024Updated 2 years ago
- Code for "SCL-RAI: Span-based Contrastive Learning with Retrieval Augmented Inference for Unlabeled Entity Problem in NER" @COLING-2022☆11Aug 20, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆13Apr 13, 2026Updated 3 weeks ago
- A reading group for system verification papers☆10Sep 28, 2023Updated 2 years ago
- VisualToolAgent (VisTA): A Reinforcement Learning Framework for Visual Tool Selection☆26May 31, 2025Updated 11 months ago
- Korean Abstract Meaning Representation (AMR) Corpus☆10Feb 27, 2022Updated 4 years ago
- Source code of our paper "Focus on the Target’s Vocabulary: Masked Label Smoothing for Machine Translation" @ ACL 2022☆13Apr 13, 2022Updated 4 years ago
- MoMA: Multimodal LLM Adapter for Fast Personalized Image Generation☆15Apr 22, 2024Updated 2 years ago
- MMM 2021: Crossed-Time Delay Neural Network for Speaker Recognition☆11Dec 4, 2021Updated 4 years ago
- A holistic framework for advancing LLMs as data science agents☆40Feb 3, 2026Updated 3 months ago
- Source code for paper "On the Pareto Front of Multilingual Neural Machine Translation" @ NeurIPS 2023☆17Sep 27, 2023Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Benchmark for agentic spatial data analysis☆27Apr 29, 2026Updated last week
- This is a python library. Install with "python3 -m pip install rp" then run with "python3 -m rp" or just "rp". Requires python≥3.5☆13Updated this week
- Guidelines for our secondary layer of annotation adding multi-sentence AMR links☆12Sep 6, 2017Updated 8 years ago
- EdgeRag is a program that runs large language models and vector databases on your local device☆14May 29, 2024Updated last year
- First Latency-Aware Competitive LLM Agent Benchmark☆28Jun 3, 2025Updated 11 months ago
- ☆11Feb 11, 2026Updated 2 months ago
- Paper-reading notes for Berkeley OS prelim exam.☆14Aug 28, 2024Updated last year
- Code for EMNLP 2021 paper: Improving Sequence-to-Sequence Pre-training via Sequence Span Rewriting☆17Nov 30, 2021Updated 4 years ago
- Masked Structural Growth for 2x Faster Language Model Pre-training☆25Apr 28, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- CLI that uses DSPy to interact with MCP servers.☆24Mar 10, 2025Updated last year
- ☆14Sep 22, 2020Updated 5 years ago
- ☆11Jul 7, 2021Updated 4 years ago
- ☆14Mar 1, 2021Updated 5 years ago
- Official code for our COLING 2022 paper: In-Context Learning for Empathetic Dialogue Generation☆20Mar 1, 2023Updated 3 years ago
- Source code for paper "ATP: AMRize Than Parse! Enhancing AMR Parsing with PseudoAMRs" @NAACL-2022☆15Mar 31, 2023Updated 3 years ago
- Randomize txt2img generation params☆26Dec 2, 2023Updated 2 years ago
- Code repository for the paper: The Coralscapes Dataset: Semantic Scene Understanding in Coral Reefs☆23Mar 27, 2025Updated last year
- Official release of the benchmark in paper "VSP: Diagnosing the Dual Challenges of Perception and Reasoning in Spatial Planning Tasks for…☆20Aug 1, 2025Updated 9 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Findings of EMNLP 2023: InfoCL: Alleviating Catastrophic Forgetting in Continual Text Classification from An Information Theoretic Perspe…☆14Aug 13, 2024Updated last year
- Original reference implementation of "Analyzing the Internals of Neural Radiance Fields"☆11Apr 10, 2024Updated 2 years ago
- [SIGGRAPH ASIA 2025] PyTorch implementation of "Inbetweening from Two Single-View Images to 4D Generation"☆14Sep 24, 2025Updated 7 months ago
- AMR-Visualization Tools, show AMR graph strcucture☆12Jul 29, 2019Updated 6 years ago
- My notes on reinforcement learning papers☆15Jun 14, 2018Updated 7 years ago
- A robot that can interpret sheet music, convert it to a midi, then play it through on the piano☆13Sep 22, 2024Updated last year
- The official implementation of our ICCV 2023 publication, C-VisDiT☆10Oct 23, 2024Updated last year