Fluid Language Model Benchmarking
☆29Sep 16, 2025Updated 8 months ago
Alternatives and similar repositories for fluid-benchmarking
Users that are interested in fluid-benchmarking are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Measuring the Signal to Noise Ratio in Language Model Evaluation☆29Aug 19, 2025Updated 9 months ago
- Parallel Associative Scan for Language Models☆18Jan 8, 2024Updated 2 years ago
- ☆68Dec 29, 2025Updated 4 months ago
- This repo is built to facilitate the training and analysis of autoregressive transformers on maze-solving tasks.☆35Oct 28, 2025Updated 6 months ago
- ☆17Aug 5, 2025Updated 9 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- code and data associated with CoMPosT: Characterizing and Evaluating Caricature in LLM Simulations☆11Oct 13, 2023Updated 2 years ago
- ☆18Sep 3, 2024Updated last year
- Official Project Page for HLA: Higher-order Linear Attention (https://arxiv.org/abs/2510.27258)☆101May 13, 2026Updated last week
- A minimal home grid world environment to evaluate language understanding in interactive agents.☆24Sep 6, 2023Updated 2 years ago
- [COLM 2025] EvalTree: Profiling Language Model Weaknesses via Hierarchical Capability Trees☆31Jul 11, 2025Updated 10 months ago
- Research without Re-search: Maximal Update Parametrization Yields Accurate Loss Prediction across Scales☆32Jul 17, 2023Updated 2 years ago
- manipulating cointegrated pairs to achieve a market-neutral strategy that outperforms indices☆10Jan 12, 2021Updated 5 years ago
- Implementing the OPRO paper☆16Sep 18, 2023Updated 2 years ago
- coloring terminal text with intensities (used for plotting probability, entropy with tokens)☆12Oct 11, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A few models converted from caffe to CoreMLs format.☆15Jun 6, 2017Updated 8 years ago
- Scratchpad/Chain-of-Thought Prompts☆12Jun 6, 2022Updated 3 years ago
- Code for the paper Don't Pay Attention☆59Sep 25, 2025Updated 8 months ago
- Combining SOAP and MUON☆22Feb 11, 2025Updated last year
- This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.☆20Jun 3, 2024Updated last year
- Code to reproduce key results accompanying "SAEs (usually) Transfer Between Base and Chat Models"☆13Jul 18, 2024Updated last year
- OLMost every training recipe you need to perform data interventions with the OLMo family of models.☆73Apr 28, 2026Updated 3 weeks ago
- ☆10Jan 8, 2025Updated last year
- code for paper "Accessing higher dimensions for unsupervised word translation"☆22Jun 26, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- [EMNLP'23] Execution-Based Evaluation for Open Domain Code Generation☆50Dec 22, 2023Updated 2 years ago
- 🔧🔌 Prototype for programmatically calling and composing MCP tools☆41Feb 23, 2026Updated 3 months ago
- Introduction to the Shared Computing Cluster☆10Jan 23, 2026Updated 4 months ago
- An approximate implementation of the OpenAI paper - An Empirical Model of Large-Batch Training for MNIST☆11Nov 19, 2022Updated 3 years ago
- A Statistical Arbitrage Strategy to trade Cryptocurrency Pairs☆14Nov 6, 2020Updated 5 years ago
- Tiny evaluation of leading LLMs on competitive programming problems☆14Apr 10, 2026Updated last month
- Source code for the paper "Positional Attention: Expressivity and Learnability of Algorithmic Computation"☆14May 26, 2025Updated 11 months ago
- Uses Processing and Perlin Noise to generate a procedural 2D rendering of different landscapes, which are then rendered into 3D☆16Aug 14, 2018Updated 7 years ago
- ☆18Nov 25, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Code for NeurIPS 2024 work "MVSDet: Multi-View Indoor 3D Object Detection via Efficient Plane Sweeps"☆17Dec 11, 2024Updated last year
- This is the code which powers the Twitter Bot https://twitter.com/RGB_Colours☆15Apr 14, 2017Updated 9 years ago
- A Swiss Army Knife for computational social choice research☆21Updated this week
- Minimal implementation of TokenFormer for inference and learning☆13Nov 6, 2024Updated last year
- ☆45Apr 30, 2018Updated 8 years ago
- ☆18Mar 2, 2026Updated 2 months ago
- Schedule free optimiser implemented in JAX using Optimistix☆15May 29, 2024Updated last year