☆150Jan 4, 2024Updated 2 years ago
Alternatives and similar repositories for gemini-benchmark
Users that are interested in gemini-benchmark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An Apache 2.0 fork of HuggingFace's Large Language Model Text Generation Inference☆19Mar 10, 2024Updated 2 years ago
- Collections of RLxLM experiments using minimal codes☆14Feb 17, 2025Updated last year
- ☆165Nov 23, 2024Updated last year
- 服务器 GPU 监控程序,当 GPU 属性满足预设条件时通过微信发送提示消息☆34Aug 10, 2021Updated 4 years ago
- Source code for the paper "Prefix Language Models are Unified Modal Learners"☆44Apr 30, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆102Dec 22, 2023Updated 2 years ago
- Official implementation of "Reasoning by Superposition: A Theoretical Perspective on Chain of Continuous Thought" (NeurIPS 2025)☆39Oct 8, 2025Updated 6 months ago
- Staged Training for Transformer Language Models☆33Mar 31, 2022Updated 4 years ago
- Official repository for MATES: Model-Aware Data Selection for Efficient Pretraining with Data Influence Models [NeurIPS 2024]☆79Nov 14, 2024Updated last year
- THOUGHTSCULPT, a general reasoning and search method for complex tasks☆13Dec 13, 2024Updated last year
- Visual and Embodied Concepts evaluation benchmark☆21Oct 10, 2023Updated 2 years ago
- [NeurlPS D&B 2024] Generative AI for Math: MathPile☆420Apr 4, 2025Updated last year
- Web-grounded natural language instructions☆18Nov 25, 2024Updated last year
- Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]☆592Dec 9, 2024Updated last year
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- [NeurIPS 2023] Discover and Align Taxonomic Context Priors for Open-world Semi-Supervised Learning☆16Apr 15, 2024Updated 2 years ago
- Repository for NPHardEval, a quantified-dynamic benchmark of LLMs☆64Mar 26, 2024Updated 2 years ago
- [NeurIPS-2024] 📈 Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies https://arxiv.org/abs/2407.13623☆116Sep 26, 2024Updated last year
- [CVPR 2023] Out-of-Distributed Semantic Pruning for Robust Semi-Supervised Learning☆22Jun 11, 2023Updated 2 years ago
- Data and code for the paper Causal Reasoning of Entities and Events in Procedural Texts.☆12May 26, 2023Updated 2 years ago
- Latent Large Language Models☆19Aug 24, 2024Updated last year
- ☆55Apr 1, 2024Updated 2 years ago
- MADAv2: Advanced Multi-Anchor Based Active Domain Adaptation Segmentation☆25Jul 8, 2023Updated 2 years ago
- Official github repo for the paper "Compression Represents Intelligence Linearly" [COLM 2024]☆147Sep 20, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- The implementation of the paper "Retrieval-Free Knowledge-Grounded Dialogue Response Generation with Adapters".☆17May 24, 2022Updated 3 years ago
- 800,000 step-level correctness labels on LLM solutions to MATH problems☆2,115Jun 1, 2023Updated 2 years ago
- Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision☆124Sep 9, 2024Updated last year
- ☆134Dec 22, 2023Updated 2 years ago
- [ICLR 2024] Lemur: Open Foundation Models for Language Agents☆556Oct 28, 2023Updated 2 years ago
- [ICML'24] SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, with a focus on large mult…☆844Feb 3, 2025Updated last year
- Multimodal computer agent data collection program☆167Dec 5, 2025Updated 4 months ago
- [ICML 2024] Selecting High-Quality Data for Training Language Models☆201Dec 8, 2025Updated 4 months ago
- MathVista: data, code, and evaluation for Mathematical Reasoning in Visual Contexts☆359Sep 29, 2025Updated 6 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- [NeurIPS2023] Official implementation of the paper "Large Language Models are Visual Reasoning Coordinators"☆106Nov 9, 2023Updated 2 years ago
- GPT-4V(ision) as A Social Media Analysis Engine☆39Dec 20, 2024Updated last year
- [EMNLP-2022 Findings] Code for paper “ProGen: Progressive Zero-shot Dataset Generation via In-context Feedback”.☆27Feb 4, 2023Updated 3 years ago
- Fairer Preferences Elicit Improved Human-Aligned Large Language Model Judgments (Zhou et al., EMNLP 2024)☆14Oct 3, 2024Updated last year
- Paper collections of methods that using language to interact with environment, including interact with real world, simulated world or WWW…☆129Jul 26, 2023Updated 2 years ago
- A clone of OpenAI's Tokenizer page for HuggingFace Models☆46Nov 13, 2023Updated 2 years ago
- [Findings of EMNLP22] From Mimicking to Integrating: Knowledge Integration for Pre-Trained Language Models☆19Mar 16, 2023Updated 3 years ago