gregorbachmann/scaling_mlps

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/gregorbachmann/scaling_mlps)

gregorbachmann / scaling_mlps

☆53

Alternatives and similar repositories for scaling_mlps

Users that are interested in scaling_mlps are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

OpenSun3D / cvpr24-challenge
View on GitHub
OpenSUN3D Workshop Challenge - CVPR '24
☆16May 31, 2024Updated 2 years ago
enisimsar / LIME
View on GitHub
[WACV 2025] Official Implementation of LIME: Localized Image Editing via Attention Regularization in Diffusion Models
☆10Apr 7, 2025Updated last year
OpenMask3D / openmask3d.github.io
View on GitHub
☆11May 8, 2024Updated 2 years ago
human-3d / SyntheticHumanDataset
View on GitHub
This repo contains the code to generate synthetic human-scene interaction dataset used for Human3D 🧑‍🤝‍🧑, accepted at ICCV 2023.
☆15Sep 27, 2023Updated 2 years ago
aycatakmaz / search3d
View on GitHub
Search3D: Hierarchical Open-Vocabulary 3D Segmentation
☆24May 20, 2025Updated last year
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
aycatakmaz / wsvd_dataset_loader
View on GitHub
Loader for WSVD dataset (+ eliminating invalid/deleted youtube videos from the video id list)
☆13Jun 2, 2020Updated 6 years ago
tml-epfl / sharpness-vs-generalization
View on GitHub
A modern look at the relationship between sharpness and generalization [ICML 2023]
☆44Sep 11, 2023Updated 2 years ago
srush / tangent
View on GitHub
Source-to-Source Debuggable Derivatives in Pure Python
☆15Jan 23, 2024Updated 2 years ago
GindaChen / FlexFlashAttention3
View on GitHub
FlexAttention w/ FlashAttention3 Support
☆27Oct 5, 2024Updated last year
abhishekpanigrahi1996 / transformer_in_transformer
View on GitHub
☆47Oct 11, 2023Updated 2 years ago
princeton-nlp / LM-Kernel-FT
View on GitHub
A Kernel-Based View of Language Model Fine-Tuning https://arxiv.org/abs/2210.05643
☆78Sep 4, 2023Updated 2 years ago
yikangshen / megablocks
View on GitHub
☆20May 30, 2024Updated 2 years ago
facebookresearch / interaction-exploration
View on GitHub
Code for "Learning Affordance Landscapes for Interaction Exploration in 3D Environments" (NeurIPS 20)
☆38Jul 6, 2023Updated 3 years ago
renll / SeqBoat
View on GitHub
[NeurIPS 2023] Sparse Modular Activation for Efficient Sequence Modeling
☆40Dec 2, 2023Updated 2 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
MadryLab / bias-transfer
View on GitHub
☆15Jul 24, 2022Updated 4 years ago
ttbrunner / biased_boundary_attack_avc
View on GitHub
Implementation of the Biased Boundary Attack for the NeurIPS 2018 Adversarial Vision Challenge
☆13Jan 29, 2020Updated 6 years ago
epfml / pam
View on GitHub
☆16Dec 9, 2023Updated 2 years ago
tml-epfl / why-weight-decay
View on GitHub
Why Do We Need Weight Decay in Modern Deep Learning? [NeurIPS 2024]
☆73Sep 25, 2024Updated last year
CurvSurf / ARKitPointCloudRecorder
View on GitHub
ARKitPointCloudRecorder is an demo application developed by CurvSurf for recording and filtering the point cloud generated by Apple ARKit…
☆40Apr 5, 2019Updated 7 years ago
sustcsonglin / disco-pointer
View on GitHub
Official Implementation of ACL2023: Don't Parse, Choose Spans! Continuous and Discontinuous Constituency Parsing via Autoregressive Span …
☆14Aug 25, 2023Updated 2 years ago
js-d / sim_metric
View on GitHub
☆36Oct 3, 2023Updated 2 years ago
Noahs-ARK / PaLM
View on GitHub
PyTorch implementation for PaLM: A Hybrid Parser and Language Model.
☆10Jan 7, 2020Updated 6 years ago
epfml / REQ
View on GitHub
☆19Jun 10, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
johanwind / wind_rwkv
View on GitHub
☆27Feb 26, 2026Updated 4 months ago
srush / triton-autodiff
View on GitHub
Experiment of using Tangent to autodiff triton
☆81Jan 22, 2024Updated 2 years ago
Aleph-Alpha-Research / NeurIPS-WANT-submission-efficient-parallelization-layouts
View on GitHub
☆22Dec 15, 2023Updated 2 years ago
davidstutz / robust-generalization-flatness
View on GitHub
Implementation of average- and worst-case robust flatness measures for adversarial training.
☆15Nov 5, 2021Updated 4 years ago
irhum / hyena
View on GitHub
JAX/Flax implementation of the Hyena Hierarchy
☆35Apr 27, 2023Updated 3 years ago
affjljoo3581 / starcoder-jax
View on GitHub
a Jax/Flax inference code of StarCoder
☆12Jun 12, 2023Updated 3 years ago
Doraemonzzz / hgru-pytorch
View on GitHub
☆29Jul 9, 2024Updated 2 years ago
glassroom / heinsen_attention
View on GitHub
Reference implementation of "Softmax Attention with Constant Cost per Token" (Heinsen, 2024)
☆25Jun 6, 2024Updated 2 years ago
MadryLab / dataset-replication-analysis
View on GitHub
☆25May 20, 2020Updated 6 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
bdusell / stack-attention
View on GitHub
Code for the paper "Stack Attention: Improving the Ability of Transformers to Model Hierarchical Patterns"
☆18Mar 15, 2024Updated 2 years ago
Tushar-N / interaction-hotspots
View on GitHub
Learning interaction hotspots from egocentric video
☆52Dec 12, 2022Updated 3 years ago
aw31 / empirical-ntks
View on GitHub
Efficient empirical NTKs in PyTorch
☆22Jun 13, 2022Updated 4 years ago
Doraemonzzz / nanoTransNormer
View on GitHub
☆11Oct 11, 2023Updated 2 years ago
rycolab / aflt-f2023
View on GitHub
Advanced Formal Language Theory (263-5352-00L; Frühjahr 2023)
☆10Feb 21, 2023Updated 3 years ago
sustcsonglin / mamba-triton
View on GitHub
☆52Jan 28, 2024Updated 2 years ago
zetayue / CPA
View on GitHub
Source code for "Improving Attention Mechanism in Graph Neural Networks via Cardinality Preservation" (IJCAI 2020)
☆17Jul 25, 2024Updated last year