☆38Feb 18, 2025Updated last year
Alternatives and similar repositories for arithmetic-self-improve
Users that are interested in arithmetic-self-improve are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for the paper: https://arxiv.org/pdf/2309.06979.pdf☆21Jul 29, 2024Updated last year
- Implementation of approximate free-energy minimization in PyTorch☆21Oct 16, 2021Updated 4 years ago
- a few utilities to analyze Caffe prototxt files☆16Sep 27, 2017Updated 8 years ago
- Extending NERDA Library for Continual Learning☆11Mar 31, 2024Updated last year
- Repository for the paper: "TiC-LM: A Web-Scale Benchmark for Time-Continual LLM Pretraining" ACL Oral 2025☆22Mar 6, 2026Updated 3 weeks ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- ☆27Feb 1, 2023Updated 3 years ago
- FROM $f(x)$ AND $g(x)$ TO $f(g(x))$: LLMs Learn New Skills in RL by Composing Old Ones☆65Jan 26, 2026Updated 2 months ago
- Code for Columbia University COMS 3997 – LLM Ethics and Foundations☆14Jan 7, 2025Updated last year
- TRivia: Self-supervised Fine-tuning of Vision-Language Models for Table Recognition☆29Feb 5, 2026Updated last month
- ☆32Oct 21, 2025Updated 5 months ago
- Advancing the frontier of efficient AI☆56Mar 20, 2026Updated last week
- [NAACL 2025] Official Code Repository for the paper "Probing-RAG: Self-Probing to Guide Language Models in Selective Document Retrieval"☆21Jul 13, 2025Updated 8 months ago
- Code for the paper "Distinguishing the Knowable from the Unknowable with Language Models"☆11Apr 15, 2024Updated last year
- GoldFinch and other hybrid transformer components☆12Dec 9, 2025Updated 3 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Omnigrok: Grokking Beyond Algorithmic Data☆63Feb 24, 2023Updated 3 years ago
- Does all kind of cool stuff to make analyzing meta classes easier. Now featuring WRedLogger.py, the previous backend of NetDbg☆10Jun 7, 2023Updated 2 years ago
- Search through Facebook Research's PyTorch BigGraph Wikidata-dataset with the Weaviate vector search engine☆31Dec 12, 2021Updated 4 years ago
- rot13 version of claudd code☆41Mar 12, 2025Updated last year
- ANE accelerated embedding models!☆20Dec 11, 2024Updated last year
- Demonstration of CMake for Imperial ACM student chapter tutorial☆18May 23, 2014Updated 11 years ago
- ☆11Aug 13, 2024Updated last year
- Official repository for the paper "Approximating Two-Layer Feedforward Networks for Efficient Transformers"☆39Jun 11, 2025Updated 9 months ago
- codes and plots for "Active-Dormant Attention Heads: Mechanistically Demystifying Extreme-Token Phenomena in LLMs"☆10Dec 30, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- HumanLM: Simulating Users with State Alignment Beats Response Imitation☆69Feb 27, 2026Updated last month
- Zig regex experiment☆13Nov 6, 2025Updated 4 months ago
- KANs and MLPs☆12Jun 7, 2024Updated last year
- SynthTextEval: A Toolkit for Generating and Evaluating Synthetic Data For High-Stakes Domains (EMNLP 2025 System Demonstration)☆26Nov 3, 2025Updated 4 months ago
- ☆16Mar 22, 2025Updated last year
- Pytorch routines for (Ker)nel (Mac)hines☆11Oct 10, 2025Updated 5 months ago
- ☆156Feb 16, 2026Updated last month
- Simple MoE - Day 17 of 365 Days of Repos☆18Jan 17, 2025Updated last year
- Tiny-DeepSpeed, a minimalistic re-implementation of the DeepSpeed library☆50Aug 20, 2025Updated 7 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Official PyTorch implementation of CD-MOE☆12Mar 18, 2026Updated last week
- A project designed to build and render a full Minecraft crafting tree.☆10Aug 10, 2021Updated 4 years ago
- A python implementation of delta debugging tool.☆26Feb 9, 2024Updated 2 years ago
- Exploring the minimal architecture required for coherent English language generation.☆12Mar 5, 2025Updated last year
- TTRV: Test-Time Reinforcement Learning for Vision–Language Models (CVPR 2026)☆37Mar 8, 2026Updated 3 weeks ago
- Sequential Monte Carlo sampler for PyMC2 models.☆13Apr 4, 2018Updated 7 years ago
- ☆55Mar 18, 2026Updated last week