iliao2345 / CompressARCLinks
☆215Updated last month
Alternatives and similar repositories for CompressARC
Users that are interested in CompressARC are comparing it to the libraries listed below
Sorting:
- Bootstrapping ARC☆155Updated last year
- Our solution for the arc challenge 2024☆188Updated 7 months ago
- Latent Program Network (from the "Searching Latent Program Spaces" paper)☆107Updated 2 months ago
- ☆167Updated 5 months ago
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆186Updated 3 weeks ago
- Normalized Transformer (nGPT)☆198Updated last year
- A MAD laboratory to improve AI architecture designs 🧪☆137Updated last year
- Code to reproduce "Transformers Can Do Arithmetic with the Right Embeddings", McLeish et al (NeurIPS 2024)☆198Updated last year
- Public repository for "The Surprising Effectiveness of Test-Time Training for Abstract Reasoning"☆344Updated 2 months ago
- Materials for ConceptARC paper☆112Updated last year
- ☆618Updated 8 months ago
- Understand and test language model architectures on synthetic tasks.☆252Updated 3 weeks ago
- ☆111Updated 6 months ago
- ☆34Updated 11 months ago
- ☆191Updated 2 weeks ago
- Jax Codebase for Evolutionary Strategies at the Hyperscale☆218Updated last month
- An AI benchmark for creative, human-like problem solving using Sudoku variants☆158Updated last month
- ☆134Updated last year
- 🧱 Modula software package☆322Updated 5 months ago
- This repo contains the source code for the paper "Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement Learning"☆292Updated 2 months ago
- Benchmarking Agentic LLM and VLM Reasoning On Games☆228Updated 2 months ago
- ☆123Updated last week
- ☆88Updated last year
- 📄Small Batch Size Training for Language Models☆80Updated 4 months ago
- Quick implementation of nGPT, learning entirely on the hypersphere, from NvidiaAI☆293Updated 8 months ago
- Training API and CLI☆325Updated last week
- Landing repository for the paper "Softpick: No Attention Sink, No Massive Activations with Rectified Softmax"☆86Updated 4 months ago
- Supporting code for the blog post on modular manifolds.☆115Updated 4 months ago
- $100K or 100 Days: Trade-offs when Pre-Training with Academic Resources☆150Updated 4 months ago
- σ-GPT: A New Approach to Autoregressive Models☆70Updated last year