GoldFinch and other hybrid transformer components
☆12Dec 9, 2025Updated 3 months ago
Alternatives and similar repositories for GoldFinch-paper
Users that are interested in GoldFinch-paper are comparing it to the libraries listed below
Sorting:
- GoldFinch and other hybrid transformer components☆45Jul 20, 2024Updated last year
- RWKV-7 mini☆12Mar 29, 2025Updated 11 months ago
- Here we will test various linear attention designs.☆62Apr 25, 2024Updated last year
- ☆18Dec 2, 2024Updated last year
- Some preliminary explorations of Mamba's context scaling.☆13Dec 18, 2024Updated last year
- Calculate the probability of a paper being accepted by EMNLP2023 based on score distribution of ACL2023.☆14Sep 7, 2023Updated 2 years ago
- ☆27Feb 26, 2026Updated last week
- RADLADS training code☆37May 7, 2025Updated 10 months ago
- Experiments on the impact of depth in transformers and SSMs.☆41Oct 23, 2025Updated 4 months ago
- Python script that converts PyTorch pth and pt files to safetensors format☆43Jul 18, 2025Updated 7 months ago
- Prepare for DeekSeek R1 inference: Benchmark CPU, DRAM, SSD, iGPU, GPU, ... with efficient code.☆73Feb 2, 2025Updated last year
- This repository contains content related to 2D and 3D lane detection, as well as video lane detection. There are not only papers here, bu…☆13Sep 1, 2024Updated last year
- Inference Speed Benchmark for Learning to (Learn at Test Time): RNNs with Expressive Hidden States☆86Jul 14, 2024Updated last year
- The purpose of this repository is for devs and non devs to carry out tests on the precompiled botanix artifacts. It contains an easy rpc …☆13Feb 23, 2026Updated 2 weeks ago
- [ICML 2024] Official Repository for the paper "Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models"☆10Jul 19, 2024Updated last year
- Lightweight 4chan board archive software (like Foolfuuka) written in Rust☆39Jul 17, 2024Updated last year
- [ICML 2025] RocketKV: Accelerating Long-Context LLM Inference via Two-Stage KV Cache Compression☆34Aug 7, 2025Updated 7 months ago
- A python algorithm to change the pitch of the voice in real time☆13Dec 13, 2020Updated 5 years ago
- A convenient primitive for creating, structing and throwing errors☆13Oct 26, 2025Updated 4 months ago
- Let us make Psychohistory (as in Asimov) a reality, and accessible to everyone. Useful for LLM grounding and games / fiction / business /…☆40Apr 9, 2023Updated 2 years ago
- (READ ONLY MIRROR) The ProB Model Checker and Animator Plugin for Rodin☆19Feb 26, 2026Updated last week
- Telegram bot made with Python to get notified when visa slots are available☆14Nov 24, 2021Updated 4 years ago
- GGUF implementation for the ComfyUI Ultimate SD Upscale node.☆15Aug 22, 2025Updated 6 months ago
- Learning to Combine Local and Global Image Information for Contactless Palmprint Recognition☆11Dec 7, 2021Updated 4 years ago
- ☆16Jul 23, 2023Updated 2 years ago
- OCRVerse: Towards Holistic OCR in End-to-End Vision-Language Models☆29Feb 4, 2026Updated last month
- An extension for the GitHub Cli application that displays your current contribution graph☆14Aug 3, 2021Updated 4 years ago
- A modern style viewer for Dannbooru or other Booru API base site.☆14May 23, 2025Updated 9 months ago
- CVPR 2023: PAniC-3D, Vtubers dataset downloader☆13Apr 22, 2023Updated 2 years ago
- Cherry Flowers everywhere☆11Jul 19, 2024Updated last year
- ☆13Apr 27, 2025Updated 10 months ago
- ☆13Mar 1, 2026Updated last week
- This is the code of a agentic rag method with dynamic workflow.☆12Jan 22, 2026Updated last month
- ☆10Sep 29, 2024Updated last year
- test images with not appropriate labels in MNIST dataset☆10Mar 3, 2018Updated 8 years ago
- Optimized primitives for collective multi-GPU communication☆10May 8, 2024Updated last year
- Documentation of the Circles UBI system☆10Jun 25, 2025Updated 8 months ago
- the indexer and search engine for irchiver, see https://irchiver.com for license and other information☆14Dec 2, 2021Updated 4 years ago
- Trust Region Preference Approximation: A simple and stable reinforcement learning algorithm for LLM reasoning☆14Jun 28, 2025Updated 8 months ago