Standalone pre-training recipe with JAX+Flax
☆35Apr 3, 2023Updated 3 years ago
Alternatives and similar repositories for sabertooth
Users that are interested in sabertooth are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Tetra-Tagging: Word-Synchronous Parsing with Linear-Time Inference☆14Jul 6, 2020Updated 5 years ago
- DeNSe parser in Dependency Parsing as Head Selection (EACL 2017) https://arxiv.org/abs/1606.01280☆25Apr 27, 2017Updated 9 years ago
- Scalable distributed reinforcement learning agents on kubernetes☆57Jul 5, 2023Updated 2 years ago
- ☆18Mar 20, 2022Updated 4 years ago
- Hal Daume's hbc☆20Jan 23, 2010Updated 16 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A package for fine-tuning Transformers with TPUs, written in Tensorflow2.0+☆37Mar 10, 2021Updated 5 years ago
- Inference code for LLaMA models in JAX☆120May 21, 2024Updated 2 years ago
- Codebase implementing LMs for learning the Dyck-(k,m) bounded hierarchical language☆16Oct 11, 2020Updated 5 years ago
- ☆18Aug 14, 2024Updated last year
- PyTorch bindings for openai-gemm☆20Feb 6, 2017Updated 9 years ago
- Fork of kingoflolz/mesh-transformer-jax with memory usage optimizations and support for GPT-Neo, GPT-NeoX, BLOOM, OPT and fairseq dense L…☆22Nov 14, 2022Updated 3 years ago
- ☆11Sep 20, 2016Updated 9 years ago
- ☆28Jan 12, 2022Updated 4 years ago
- ☆99Jul 25, 2023Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Repository for fine-tuning Transformers 🤗 based seq2seq speech models in JAX/Flax.☆39Feb 23, 2023Updated 3 years ago
- A home for audio ML in JAX. Has common features, learnable frontends, pretrained supervised and self-supervised models.☆72Jul 24, 2022Updated 3 years ago
- ☆15Apr 12, 2023Updated 3 years ago
- metaprogramming for Julia arrays☆13Sep 26, 2020Updated 5 years ago
- ☆13Oct 23, 2018Updated 7 years ago
- Julia package for xtensor-julia☆43Jul 2, 2022Updated 3 years ago
- Autograd extension for forward mode autodiff☆31Oct 31, 2017Updated 8 years ago
- Repo for the work on hierarchical state space models for disentanglement☆21Mar 15, 2021Updated 5 years ago
- Companion code in JAX for the paper Parallel Iterated Extended and Sigma-Point Kalman Smoothers.☆27Aug 9, 2024Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Run Pytorch graphs inside Theano graph (and pytorch wrapper for AIS for generative models).☆18Oct 19, 2017Updated 8 years ago
- Ukrainian ELECTRA model☆12Mar 11, 2023Updated 3 years ago
- Info for NLP1 2017 projects☆23Dec 12, 2017Updated 8 years ago
- Open Source + Multilingual MLLM + Fine-tuning + Distillation + More efficient models and learning + ?☆18Jan 31, 2025Updated last year
- Automated Question-Answering Over Knowledge Graphs in O&M of Wind Turbines☆13Aug 16, 2022Updated 3 years ago
- Algorithms for Mining Frequent Trees (in Tree Structured Datasets)☆10Mar 28, 2020Updated 6 years ago
- Tutorial to pretrain & fine-tune a 🤗 Flax T5 model on a TPUv3-8 with GCP☆58Jul 28, 2022Updated 3 years ago
- Research into identifying and correcting incorrect labels in the CoNLL-2003 corpus.☆12May 11, 2021Updated 5 years ago
- Code for the paper "Understanding the Mechanics of SPIGOT: Surrogate Gradients for Latent Structure Learning"☆11May 5, 2021Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Benchmarks for DyNet☆55Sep 22, 2025Updated 8 months ago
- Tensorflow Implementation of Multi-Function Recurrent Unit☆23Jun 13, 2016Updated 9 years ago
- clock_plot provides a simple way to visualize timeseries data, mapping 24 hours onto the 360 degrees of a polar plot☆15Apr 5, 2022Updated 4 years ago
- python package implementing a multivariate Horner scheme for efficiently evaluating multivariate polynomials☆33Mar 8, 2025Updated last year
- Efficient teacher-student models and scripts to make them☆57Dec 16, 2023Updated 2 years ago
- ☆25Jan 19, 2023Updated 3 years ago
- Code base for SRSGD.☆27Mar 5, 2020Updated 6 years ago