goombalab / Gather-and-AggregateView external linksLinks
Experiments Notebook of "Understanding the Skill Gap in Recurrent Language Models: The Role of the Gather-and-Aggregate Mechanism"
☆14Apr 30, 2025Updated 9 months ago
Alternatives and similar repositories for Gather-and-Aggregate
Users that are interested in Gather-and-Aggregate are comparing it to the libraries listed below
Sorting:
- Official implementation of Phi-Mamba. A MOHAWK-distilled model (Transformers to SSMs: Distilling Quadratic Knowledge to Subquadratic Mode…☆119Sep 13, 2024Updated last year
- ☆14Mar 2, 2025Updated 11 months ago
- ☆21Sep 16, 2025Updated 5 months ago
- The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling☆42Dec 29, 2025Updated last month
- Voice agent using LiveKit (orchestration), Cartesia (TTS), OpenAI (LLM), and Deepgram (STT)☆20Oct 28, 2025Updated 3 months ago
- Source code for the paper "Positional Attention: Expressivity and Learnability of Algorithmic Computation"☆14May 26, 2025Updated 8 months ago
- H-Net Dynamic Hierarchical Architecture☆81Sep 11, 2025Updated 5 months ago
- Code repository for the paper - "Superposed Decoding: Multiple Generations from a Single Autoregressive Inference Pass"☆21Aug 22, 2024Updated last year
- ☆35Feb 26, 2024Updated last year
- Make reasoning models scalable☆47May 31, 2025Updated 8 months ago
- ☆35Apr 12, 2024Updated last year
- manipulating cointegrated pairs to achieve a market-neutral strategy that outperforms indices☆12Jan 12, 2021Updated 5 years ago
- Code for NeurIPS 2024 Spotlight: "Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations"☆89Oct 30, 2024Updated last year
- A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks☆36Oct 31, 2024Updated last year
- Building LLMs from scratch following the book from S. Raschka☆32Mar 27, 2025Updated 10 months ago
- Neuro-Symbolic Integration Brings Causal and Reliable Reasoning Proofs☆41Feb 15, 2024Updated 2 years ago
- TritonBench: Benchmarking Large Language Model Capabilities for Generating Triton Operators☆115Jun 14, 2025Updated 8 months ago
- A neural network layer API and library for sequence modeling, designed for easy creation of sequence models that can be executed layerwis…☆50Feb 6, 2026Updated last week
- The GraphBench package.☆24Jan 5, 2026Updated last month
- Example application for creating an MVC Express + Node + TypeScript app and deploying it to Azure☆10Nov 8, 2018Updated 7 years ago
- 📦 A collection of pastable code gathered from past projects☆12Sep 9, 2024Updated last year
- a react scrollable tabs component with many additional features☆36Jan 16, 2025Updated last year
- ☆35Mar 12, 2025Updated 11 months ago
- [NeurIPS 2024] Official implementation of the paper "MambaLRP: Explaining Selective State Space Sequence Models" 🐍☆45Nov 6, 2024Updated last year
- [NeurIPS 2025] Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models☆53Sep 29, 2025Updated 4 months ago
- ☆16Feb 22, 2025Updated 11 months ago
- [CVPR 2021] FMO Deblurring Benchmark☆13Jan 12, 2022Updated 4 years ago
- Python资源大全中文版,内容包括:Web框架、网络爬虫、网络内容提取、模板引擎、数据库、数据可视化、图片处理、文本处理、自然语言处理、机器学习、日志、代码分析等☆11May 24, 2016Updated 9 years ago
- Project focused on enhancing the quality of low-fidelity endoscopy images using Generative Adversarial Networks (GANs) implemented in PyT…☆17Jun 5, 2025Updated 8 months ago
- Official implementation of the paper "LTrack: Generalizing Multiple Object Tracking to Unseen Domains by Introducing Natural Language Rep…☆12Jul 26, 2023Updated 2 years ago
- Integrating neurosymbolic representations into LLMs for interpretability, steering, and running symbolic algorithms☆14Feb 2, 2026Updated 2 weeks ago
- Code for "RADSeg Unleashing Parameter and Compute Efficient Zero-Shot Open-Vocabulary Segmentation Using Agglomerative Models"☆28Jan 27, 2026Updated 2 weeks ago
- Communication Relay by creating a WiFi Mesh Network using ROS, and using that network for Data Telemetry, with Telemetry radios ( Ubiquit…☆11Dec 18, 2018Updated 7 years ago
- Implementation of Agent Attention in Pytorch☆93Jul 10, 2024Updated last year
- Official Pytorch Implementation of "The Curse of Depth in Large Language Models" by Wenfang Sun, Xinyuan Song, Pengxiang Li, Lu Yin,Yefen…☆66Jan 2, 2026Updated last month
- ☆10Jun 27, 2024Updated last year
- 🐛 Web Apps for Boilerplate☆10Dec 23, 2024Updated last year
- Canvas Element Recorder for React, with really simple API☆11Oct 16, 2023Updated 2 years ago
- For ACL25 paper "WAFFLE: Multi-Modal Model for Automated Front-End Development" - by Shanchao Liang and Nan Jiang and Shangshu Qian and L…☆11May 28, 2025Updated 8 months ago