microsoft / PyMarlin
Lightweight Deep Learning Model Training library based on PyTorch
☆32Updated 2 years ago
Alternatives and similar repositories for PyMarlin:
Users that are interested in PyMarlin are comparing it to the libraries listed below
- A library to create and manage configuration files, especially for machine learning projects.☆76Updated 2 years ago
- A collection of Models, Datasets, DataModules, Callbacks, Metrics, Losses and Loggers to better integrate pytorch-lightning with transfor…☆47Updated last year
- A case study of efficient training of large language models using commodity hardware.☆68Updated 2 years ago
- ☆30Updated last month
- Parallel data preprocessing for NLP and ML.☆34Updated 2 months ago
- a lightweight transformer library for PyTorch☆72Updated 3 years ago
- Serialize JAX, Flax, Haiku, or Objax model params with 🤗`safetensors`☆42Updated 7 months ago
- AI Assistant for Building Reliable, High-performing and Fair Multilingual NLP Systems☆45Updated 2 years ago
- PyTorch Lightning implementation of Barlow Twins: Self-Supervised Learning via Redundancy Reduction.☆12Updated 3 years ago
- My explorations into editing the knowledge and memories of an attention network☆34Updated 2 years ago
- Code for paper "Do Language Models Have Beliefs? Methods for Detecting, Updating, and Visualizing Model Beliefs"☆28Updated 2 years ago
- pytest plugin for a better developer experience when working with the PyTorch test suite☆44Updated 3 years ago
- Amos optimizer with JEstimator lib.☆81Updated 8 months ago
- ☆28Updated last year
- A collection of optimizers, some arcane others well known, for Flax.☆29Updated 3 years ago
- This repository contains example code to build models on TPUs☆30Updated last year
- PyTorch implementation of GLOM☆21Updated 2 years ago
- An attempt to merge ESBN with Transformers, to endow Transformers with the ability to emergently bind symbols☆14Updated 3 years ago
- code for paper "Accessing higher dimensions for unsupervised word translation"☆21Updated last year
- GPT, but made only out of MLPs☆88Updated 3 years ago
- Hyperparameter tuning via uncertainty modeling☆46Updated 8 months ago
- A minimal TPU compatible Jax implementation of NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis.☆13Updated 2 years ago
- ☆102Updated 4 years ago
- Official code release for the paper Coder Reviewer Reranking for Code Generation.☆42Updated last year
- ☆72Updated 8 months ago
- Code repo for "Transformer on a Diet" paper☆31Updated 4 years ago
- ☆17Updated 4 years ago
- Torch Distributed Experimental☆115Updated 5 months ago
- This project shows how to derive the total number of training tokens from a large text dataset from 🤗 datasets with Apache Beam and Data…☆24Updated 2 years ago