krishgoel / chronocept-baseline-modelsLinks
The official baseline implementations for Chronocept
☆10Updated last month
Alternatives and similar repositories for chronocept-baseline-models
Users that are interested in chronocept-baseline-models are comparing it to the libraries listed below
Sorting:
- In this repository I have a code and brief explanations of the attempts that I made at the ARC-AGI (2024) challenges :)☆25Updated 11 months ago
- A zero-to-one guide on scaling modern transformers with n-dimensional parallelism.☆104Updated last month
- alternative way to calculating self attention☆18Updated last year
- look how they massacred my boy☆63Updated last year
- Fine tune Gemma 3 on an object detection task☆87Updated 3 months ago
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆107Updated 8 months ago
- AI eXplainable Inference & Search. Open Sourcing on-premise, ultra-fast latency intelligence to all.☆35Updated 8 months ago
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆62Updated last year
- aesthetic tensor visualiser☆27Updated 6 months ago
- An introduction to LLM Sampling☆79Updated 10 months ago
- ☆68Updated 5 months ago
- ☆93Updated last month
- Training-Ready RL Environments + Evals☆164Updated this week
- Training an LLM to use a calculator with multi-turn reinforcement learning, achieving a **62% absolute increase in evaluation accuracy**.☆58Updated 6 months ago
- Repository to create traveling waves integrate special information through time☆55Updated 3 months ago
- An automated tool for discovering insights from research papaer corpora☆138Updated last year
- A Qwen .5B reasoning model trained on OpenR1-Math-220k☆14Updated 3 weeks ago
- Tensor-Slayer : Manipulate weights and tensors of LLMs to achieve performance upgrades and introduce a novel inferenceless mechanistic in…☆25Updated 5 months ago
- Source code for Activated LoRA☆22Updated last month
- Stream of my favorite papers and links☆43Updated this week
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆100Updated last week
- Swarming algorithms like PSO, Ant Colony, Sakana, and more in PyTorch 😊☆132Updated 2 weeks ago
- flexible search engine for video data☆66Updated last week
- Making folding experiments more accessible .☆85Updated 3 months ago
- Graph-Aware Attention for Adaptive Dynamics in Transformers☆65Updated 10 months ago
- Plotting (entropy, varentropy) for small LMs☆98Updated 5 months ago
- ☆40Updated last year
- a tiny vectorstore implementation built with numpy.☆63Updated last year
- A collection of lightweight interpretability scripts to understand how LLMs think☆61Updated last week
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆80Updated 7 months ago