☆33Jun 4, 2025Updated 9 months ago
Alternatives and similar repositories for modular-public
Users that are interested in modular-public are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Vivaria is METR's tool for running evaluations and conducting agent elicitation research.☆135Feb 15, 2026Updated last month
- METR Task Standard☆178Feb 3, 2025Updated last year
- ☆134Oct 16, 2025Updated 5 months ago
- ☆121Jan 19, 2026Updated 2 months ago
- ☆13Jul 12, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Investigating the generalization behavior of LM probes trained to predict truth labels: (1) from one annotator to another, and (2) from e…☆28May 23, 2024Updated last year
- A blog on AI, personal development, and living a good life.☆36Updated this week
- Official code for the NeurIPS25 paper "RAT: Bridging RNN Efficiencyand Attention Accuracy in Language Modeling" (https://arxiv.org/abs/25…☆23Dec 10, 2025Updated 3 months ago
- A Kubernetes sandbox environment for use with inspect_ai☆28Mar 19, 2026Updated last week
- Collection of evals for Inspect AI☆415Updated this week
- Measuring and Controlling Persona Drift in Language Model Dialogs☆22Feb 26, 2024Updated 2 years ago
- ☆23Jun 22, 2025Updated 9 months ago
- The official code of "Building on Efficient Foundations: Effectively Training LLMs with Structured Feedforward Layers"☆19Jul 24, 2024Updated last year
- ControlArena is a collection of settings, model organisms and protocols - for running control experiments.☆164Mar 19, 2026Updated last week
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆16Oct 27, 2024Updated last year
- Multi-Camera Direct Sparse Odometry☆20Jun 9, 2020Updated 5 years ago
- Accompanying codebase for neuroscope.io, a website for displaying max activating dataset examples for language model neurons☆13Feb 13, 2023Updated 3 years ago
- ☆22May 25, 2024Updated last year
- This is a pip package implementing Reinforcement Learning algorithms in non-stationary environments supported by the OpenAI Gym toolkit.☆16Jun 28, 2024Updated last year
- Official codebase for "Quantile Reward Policy Optimization: Alignment with Pointwise Regression and Exact Partition Functions" (Matrenok …☆30Dec 8, 2025Updated 3 months ago
- The goal of this repo is to become a benchmark for pentesting☆22Oct 25, 2024Updated last year
- Finance Technical Indicators optimized with Numba☆11Mar 15, 2018Updated 8 years ago
- ☆38Oct 28, 2025Updated 4 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆24Jul 25, 2024Updated last year
- Compare how fine-tuned AI video models interpret the same prompts☆14Jan 29, 2025Updated last year
- OmniByteFormer is a generalized Transformer model that can process any type of data by converting it into byte sequences, bypassing tradi…☆15Updated this week
- using multion to find all the commenters under a given reddit post, and DMing a message to them.☆16Jul 21, 2024Updated last year
- Improving Your Model Ranking on Chatbot Arena by Vote Rigging (ICML 2025)☆26Feb 25, 2025Updated last year
- Code for Preventing Language Models From Hiding Their Reasoning, which evaluates defenses against LLM steganography.☆25Jan 26, 2024Updated 2 years ago
- A self-hosted version of WaterCrawl, a powerful web crawling and data extraction platform.☆13Jul 27, 2025Updated 7 months ago
- [ICLR 2025] SDTT: a simple and effective distillation method for discrete diffusion models☆47Feb 26, 2026Updated last month
- AI eXplainable Inference & Search. Open Sourcing on-premise, ultra-fast latency intelligence to all.☆37Feb 28, 2025Updated last year
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- Detect and defend against the nonce race exploit on Polymarket's CTF Exchange☆28Mar 17, 2026Updated last week
- Inspect: A framework for large language model evaluations☆1,851Updated this week
- Code and data for NAACL 2025 paper "IHEval: Evaluating Language Models on Following the Instruction Hierarchy"☆16Feb 25, 2025Updated last year
- Code for Voice Jailbreak Attacks Against GPT-4o.☆37May 31, 2024Updated last year
- This crate is now part of the vm-virtio workspace: https://github.com/rust-vmm/vm-virtio☆15Mar 2, 2022Updated 4 years ago
- A framework for building a AI Forecasting Bot for Metaculus. Additionally AI Forecasting tools to help humans forecast the future.☆57Updated this week
- Research attempting to beat a naive dca with volatility forecasts and range position based weightings☆12Jul 9, 2021Updated 4 years ago