The official code of "Building on Efficient Foundations: Effectively Training LLMs with Structured Feedforward Layers"
☆19Jul 24, 2024Updated last year
Alternatives and similar repositories for StructuredFFN
Users that are interested in StructuredFFN are comparing it to the libraries listed below
Sorting:
- Simple GRPO scripts and configurations.☆59Feb 6, 2025Updated last year
- ☆70Nov 15, 2024Updated last year
- Mastodon server running for the Doubanius Tertius project☆10Apr 4, 2022Updated 3 years ago
- This project showcases engaging interactions between two AI chatbots.☆10Jan 10, 2024Updated 2 years ago
- Model configurations for scaling SE models in the paper "Beyond Performance Plateaus: A Comprehensive Study on Scalability in Speech Enha…☆36Aug 7, 2024Updated last year
- Bayesian adaptive stimulus placement of psychometric function for MATLAB.☆10Nov 7, 2018Updated 7 years ago
- Code for reproducing the results from "CrAM: A Compression-Aware Minimizer" accepted at ICLR 2023☆10Mar 1, 2023Updated 2 years ago
- A nonparametric variational information bottleneck (NVIB) layer in Pytorch☆11Apr 15, 2025Updated 10 months ago
- Source code accompanying the NeurIPS 2022 paper "Learning Partial Equivariances From Data"☆10Nov 18, 2022Updated 3 years ago
- Fast, free, easy, and object-agnostic video anonymization☆11Dec 12, 2020Updated 5 years ago
- Code to accompany the paper Sparse Linear Networks with a Fixed Butterfly Structure: Theory and Practice☆10Aug 10, 2021Updated 4 years ago
- Generate v4 UUIDs using libsodium's RNG☆11Jun 16, 2020Updated 5 years ago
- Python API for Amplitude Analytics Logging - https://amplitude.com☆14Jun 4, 2020Updated 5 years ago
- Multi-heap-sort for many small arrays, quicksort with 3 pivots for one big array, CUDA acceleration, CUDA memory compression.☆13Sep 29, 2024Updated last year
- Calculate Mahalanobis distances for multivariate data.☆12Mar 23, 2020Updated 5 years ago
- PCM audio sample rate conversion for Node.js☆15May 13, 2013Updated 12 years ago
- REBUS: A Robust Evaluation Benchmark of Understanding Symbols☆13Aug 13, 2024Updated last year
- Incremental Consistent Topological Sort for Append-only Logs☆14Jun 28, 2022Updated 3 years ago
- Reasoning-based Evaluation and Ranking of Translations.☆19Jul 18, 2025Updated 7 months ago
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- Face2Faceの実装とか☆13Jun 11, 2016Updated 9 years ago
- A Statistical Arbitrage Strategy to trade Cryptocurrency Pairs☆13Nov 6, 2020Updated 5 years ago
- Training, optimization and deployment of Object Detection model with dinov2 backbone for efficient inference on NVIDIA Jetson☆13Jul 26, 2025Updated 7 months ago
- QLoRA: Efficient Finetuning of Quantized LLMs☆11Jul 22, 2023Updated 2 years ago
- [ICML-2025] We introduce Lie group Relative position Encodings (LieRE) that goes beyond RoPE in supporting n-dimensional inputs.☆14Aug 8, 2025Updated 6 months ago
- Generating Potent Poisons and Backdoors from Scratch with Guided Diffusion☆11Apr 1, 2024Updated last year
- Node.js Logical reasoning machine (WIP)☆10Dec 18, 2014Updated 11 years ago
- Automatically create an importmap script.☆14Oct 20, 2024Updated last year
- xast utility to build feeds (rss, atom)☆10Jul 19, 2023Updated 2 years ago
- A node module for allowing programmatic control of the useful Packer.IO tool☆16Nov 18, 2016Updated 9 years ago
- Open AI Gym Environment for the Dobot Magician Robotic Arm☆12Jul 9, 2018Updated 7 years ago
- Poetic static site generator for Node.js.☆82Jun 19, 2024Updated last year
- A Python implementation of a graph-based parser for Abstract Meaning Representation (AMR)☆11Feb 2, 2018Updated 8 years ago
- The official implementation of our work SQLFixAgent: Towards Semantic-Accurate Text-to-SQL Parsing via Consistency-Enhanced Multi-Agent C…☆23May 2, 2025Updated 9 months ago
- ☆12May 30, 2025Updated 9 months ago
- Implementation of accurate coresets for known problems from the field of machine learning.☆11Nov 21, 2019Updated 6 years ago
- ☆11Jul 21, 2024Updated last year
- Quantities in Typescript, Idris influenced☆10Dec 19, 2025Updated 2 months ago
- Experiments Notebook of "Understanding the Skill Gap in Recurrent Language Models: The Role of the Gather-and-Aggregate Mechanism"☆14Apr 30, 2025Updated 10 months ago