AnesBenmerzoug / langsfer
A library for language transfer methods and algorithms.
☆15Updated 4 months ago
Alternatives and similar repositories for langsfer:
Users that are interested in langsfer are comparing it to the libraries listed below
- Collection of design patterns that came to me in feverish nights - so bad they are good!☆9Updated 10 months ago
- A library for calibrating classifiers and computing calibration metrics☆13Updated 2 years ago
- Fast, Modern, Memory Efficient, and Low Precision PyTorch Optimizers☆88Updated 8 months ago
- Modalities, a PyTorch-native framework for distributed and reproducible foundation model training.☆74Updated this week
- A fire-tested template for production grade python libraries and packages.☆16Updated last week
- Simple-to-use scoring function for arbitrarily tokenized texts.☆38Updated last month
- nanoGPT-like codebase for LLM training☆91Updated last week
- Transformer with Mu-Parameterization, implemented in Jax/Flax. Supports FSDP on TPU pods.☆30Updated 3 months ago
- HomebrewNLP in JAX flavour for maintable TPU-Training☆49Updated last year
- ☆73Updated 11 months ago
- [EMNLP'23] Official Code for "FOCUS: Effective Embedding Initialization for Monolingual Specialization of Multilingual Models"☆30Updated 5 months ago
- ☆124Updated this week
- ☆20Updated last year
- Efficient optimizers☆185Updated this week
- ☆33Updated last month
- Common Python utilities and GitHub Actions in Lightning Ecosystem☆54Updated this week
- TorchFix - a linter for PyTorch-using code with autofix support☆136Updated last month
- Supercharge huggingface transformers with model parallelism.☆76Updated 5 months ago
- Official implementation of "GPT or BERT: why not both?"☆49Updated 2 weeks ago
- Serialize JAX, Flax, Haiku, or Objax model params with 🤗`safetensors`☆44Updated 10 months ago
- ☆78Updated last week
- Scaling is a distributed training library and installable dependency designed to scale up neural networks, with a dedicated module for tr…☆58Updated 5 months ago
- Various transformers for FSDP research☆37Updated 2 years ago
- ☆30Updated 11 months ago
- ☆171Updated 3 months ago
- A PyTorch Lightning extension that accelerates and enhances foundation model experimentation with flexible fine-tuning schedules.☆63Updated last month
- Minimum Bayes Risk Decoding for Hugging Face Transformers☆57Updated 9 months ago
- The Python library for sensible AI.☆41Updated this week
- supporting pytorch FSDP for optimizers☆80Updated 3 months ago
- Sparse and discrete interpretability tool for neural networks☆59Updated last year