AnesBenmerzoug / langsfer
A library for language transfer methods and algorithms.
โ16Updated 5 months ago
Alternatives and similar repositories for langsfer:
Users that are interested in langsfer are comparing it to the libraries listed below
- Collection of design patterns that came to me in feverish nights - so bad they are good!โ9Updated 11 months ago
- A fire-tested template for production grade python libraries and packages.โ16Updated last month
- Serialize JAX, Flax, Haiku, or Objax model params with ๐ค`safetensors`โ44Updated 10 months ago
- This is a port of Mistral-7B model in JAXโ32Updated 9 months ago
- A library for calibrating classifiers and computing calibration metricsโ13Updated 2 years ago
- Fast, Modern, Memory Efficient, and Low Precision PyTorch Optimizersโ92Updated 9 months ago
- TorchFix - a linter for PyTorch-using code with autofix supportโ140Updated 2 months ago
- DoG is SGD's Best Friend: A Parameter-Free Dynamic Step Size Scheduleโ60Updated last year
- PyTorch interface for TrueGrad Optimizersโ41Updated last year
- Common Python utilities and GitHub Actions in Lightning Ecosystemโ56Updated this week
- HomebrewNLP in JAX flavour for maintable TPU-Trainingโ49Updated last year
- Check if you have training samples in your test setโ64Updated 2 years ago
- A metrics library for the JAX ecosystemโ40Updated 2 years ago
- nanoGPT-like codebase for LLM trainingโ94Updated 3 weeks ago
- โ54Updated 7 months ago
- Automatically take good care of your preemptible TPUsโ36Updated last year
- A functional training loops library for JAXโ86Updated last year
- The Python library for sensible AI.โ41Updated 3 weeks ago
- โ114Updated last week
- minGPT in JAXโ48Updated 3 years ago
- The simplest, fastest repository for training/finetuning medium-sized GPTs.โ33Updated last year
- graphpatch is a library for activation patching on PyTorch neural network models.โ14Updated 2 months ago
- Autoregressive transformer in JAX from scratchโ22Updated 3 years ago
- An implementation of the Llama architecture, to instruct and delightโ21Updated 3 months ago
- [ICML 2024] SINGD: KFAC-like Structured Inverse-Free Natural Gradient Descent (http://arxiv.org/abs/2312.05705)โ21Updated 5 months ago
- Neural Networks for JAXโ84Updated 7 months ago
- Mobile Viewer for W&B, built on top of Flutter.โ33Updated last year
- A PyTorch Lightning extension that accelerates and enhances foundation model experimentation with flexible fine-tuning schedules.โ63Updated 3 weeks ago
- โ60Updated 3 years ago
- some common Huggingface transformers in maximal update parametrization (ยตP)โ80Updated 3 years ago