Standalone pre-training recipe with JAX+Flax
☆35Apr 3, 2023Updated 3 years ago
Alternatives and similar repositories for sabertooth
Users that are interested in sabertooth are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Tetra-Tagging: Word-Synchronous Parsing with Linear-Time Inference☆14Jul 6, 2020Updated 5 years ago
- Tool for managing deep learning experiments☆13Dec 22, 2017Updated 8 years ago
- DeNSe parser in Dependency Parsing as Head Selection (EACL 2017) https://arxiv.org/abs/1606.01280☆25Apr 27, 2017Updated 9 years ago
- Scalable distributed reinforcement learning agents on kubernetes☆57Jul 5, 2023Updated 2 years ago
- ☆18Mar 20, 2022Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Hal Daume's hbc☆20Jan 23, 2010Updated 16 years ago
- A package for fine-tuning Transformers with TPUs, written in Tensorflow2.0+☆37Mar 10, 2021Updated 5 years ago
- Codebase implementing LMs for learning the Dyck-(k,m) bounded hierarchical language☆16Oct 11, 2020Updated 5 years ago
- PyTorch bindings for openai-gemm☆20Feb 6, 2017Updated 9 years ago
- Fork of kingoflolz/mesh-transformer-jax with memory usage optimizations and support for GPT-Neo, GPT-NeoX, BLOOM, OPT and fairseq dense L…☆22Nov 14, 2022Updated 3 years ago
- ☆11Sep 20, 2016Updated 9 years ago
- Contains code used to conduct experiments on dependency parsing with the Tensor-LSTM model developed for our paper "Cross-Lingual Depende…☆13Jan 5, 2017Updated 9 years ago
- ☆28Jan 12, 2022Updated 4 years ago
- fun visualization scripts☆10Mar 1, 2026Updated 3 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- playing with gpt4☆13Mar 17, 2023Updated 3 years ago
- metaprogramming for Julia arrays☆13Sep 26, 2020Updated 5 years ago
- ☆13Oct 23, 2018Updated 7 years ago
- Dive into Jax, Flax, XLA and C++☆32Apr 1, 2020Updated 6 years ago
- Autograd extension for forward mode autodiff☆31Oct 31, 2017Updated 8 years ago
- Repo for the work on hierarchical state space models for disentanglement☆21Mar 15, 2021Updated 5 years ago
- Companion code in JAX for the paper Parallel Iterated Extended and Sigma-Point Kalman Smoothers.☆27Aug 9, 2024Updated last year
- Ukrainian ELECTRA model☆12Mar 11, 2023Updated 3 years ago
- Info for NLP1 2017 projects☆23Dec 12, 2017Updated 8 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Open Source + Multilingual MLLM + Fine-tuning + Distillation + More efficient models and learning + ?☆18Jan 31, 2025Updated last year
- ☆18Mar 5, 2017Updated 9 years ago
- Algorithms for Mining Frequent Trees (in Tree Structured Datasets)☆10Mar 28, 2020Updated 6 years ago
- Tutorial to pretrain & fine-tune a 🤗 Flax T5 model on a TPUv3-8 with GCP☆58Jul 28, 2022Updated 3 years ago
- Research into identifying and correcting incorrect labels in the CoNLL-2003 corpus.☆12May 11, 2021Updated 5 years ago
- HomebrewNLP in JAX flavour for maintable TPU-Training☆51Jan 20, 2024Updated 2 years ago
- Code for the paper "Understanding the Mechanics of SPIGOT: Surrogate Gradients for Latent Structure Learning"☆11May 5, 2021Updated 5 years ago
- Benchmarks for DyNet☆55Sep 22, 2025Updated 8 months ago
- Tensorflow Implementation of Multi-Function Recurrent Unit☆23Jun 13, 2016Updated 9 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- clock_plot provides a simple way to visualize timeseries data, mapping 24 hours onto the 360 degrees of a polar plot☆15Apr 5, 2022Updated 4 years ago
- python package implementing a multivariate Horner scheme for efficiently evaluating multivariate polynomials☆33Mar 8, 2025Updated last year
- Efficient teacher-student models and scripts to make them☆57Dec 16, 2023Updated 2 years ago
- Interactive Semantic Parsing for If-Then Recipes via Hierarchical Reinforcement Learning (AAAI'19)☆28Oct 31, 2019Updated 6 years ago
- A client for Isabelle server (https://isabelle.in.tum.de)☆14May 3, 2026Updated last month
- The MobSTr dataset provides artifacts that demonstrate Model-based Safety Assurance and Traceability for a safety-critical automotive sys…☆10Mar 18, 2022Updated 4 years ago
- A text-based adventure game that uses a neural network to generate audio☆16Jan 31, 2020Updated 6 years ago