KindXiaoming / grow-crystalsLinks

Getting crystal-like representations with harmonic loss

☆193

Alternatives and similar repositories for grow-crystals

Users that are interested in grow-crystals are comparing it to the libraries listed below

Sorting:

nanowell / AdEMAMix-Optimizer-Pytorch
The AdEMAMix Optimizer: Better, Faster, Older.
☆186Updated last year
KellerJordan / cifar10-airbench
CIFAR-10 speedruns: 94% in 2.6 seconds and 96% in 27 seconds
☆320Updated 3 months ago
idiap / sigma-gpt
σ-GPT: A New Approach to Autoregressive Models
☆68Updated last year
evanatyourservice / kron_torch
An implementation of PSGD Kron second-order optimizer for PyTorch
☆96Updated 3 months ago
bloc97 / DeMo
DeMo: Decoupled Momentum Optimization
☆194Updated 10 months ago
epfml / DenseFormer
☆81Updated last year
lucidrains / grokfast-pytorch
Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"
☆102Updated 10 months ago
lucidrains / nGPT-pytorch
Quick implementation of nGPT, learning entirely on the hypersphere, from NvidiaAI
☆291Updated 4 months ago
nikhilvyas / SOAP
☆218Updated 10 months ago
apoorvkh / academic-pretraining
$100K or 100 Days: Trade-offs when Pre-Training with Academic Resources
☆147Updated 3 weeks ago
LucasPrietoAl / grokking-at-the-edge-of-numerical-stability
☆102Updated 3 months ago
JoeLi12345 / nGPT
an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)
☆107Updated 7 months ago
ethansmith2000 / fsdp_optimizers
supporting pytorch FSDP for optimizers
☆83Updated 10 months ago
HomebrewML / HeavyBall
Efficient optimizers
☆274Updated last week
PolymathicAI / xVal
Repository for code used in the xVal paper
☆144Updated last year
Zyphra / Zamba2
PyTorch implementation of models from the Zamba2 series.
☆185Updated 9 months ago
BlackHC / neural_net_checklist
☆150Updated last year
main-horse / hnet-old
H-Net Dynamic Hierarchical Architecture
☆80Updated last month
microsoft / dion
Dion optimizer algorithm
☆369Updated 3 weeks ago
VatsaDev / NanoPoor
NanoGPT-speedrunning for the poor T4 enjoyers
☆72Updated 6 months ago
apple / ml-l3m
Large multi-modal models (L3M) pre-training.
☆213Updated last month
apple / ml-sigma-reparam
☆309Updated last year
bluorion-com / ZClip
Official implementation of the paper: "ZClip: Adaptive Spike Mitigation for LLM Pre-Training".
☆135Updated last week
zaydzuhri / softpick-attention
Landing repository for the paper "Softpick: No Attention Sink, No Massive Activations with Rectified Softmax"
☆85Updated last month
jfpuget / ARC-AGI-Challenge-2024
☆56Updated 11 months ago
Think-a-Tron / evolve
open source alpha evolve
☆66Updated 5 months ago
fal-ai / diffusion-speedrun
Focused on fast experimentation and simplicity
☆75Updated 10 months ago
njwfish / DistributionEmbeddings
☆32Updated 2 weeks ago
ShadeAlsha / ICon
ICLR 2025 - official implementation for "I-Con: A Unifying Framework for Representation Learning"
☆116Updated 4 months ago
SinatrasC / entropix-smollm
smolLM with Entropix sampler on pytorch
☆150Updated 11 months ago