tanmoyio / sahajBERT
☆14Updated 3 years ago
Alternatives and similar repositories for sahajBERT:
Users that are interested in sahajBERT are comparing it to the libraries listed below
- An attempt at the implementation of GLOM, Geoffrey Hinton's paper for emergent part-whole hierarchies from data☆37Updated 4 years ago
- ☆39Updated 2 years ago
- Memory-efficient transformer. Work in progress.☆19Updated 2 years ago
- PyTorch implementation of GLOM☆22Updated 3 years ago
- This repository hosts the code to port NumPy model weights of BiT-ResNets to TensorFlow SavedModel format.☆14Updated 3 years ago
- HomebrewNLP in JAX flavour for maintable TPU-Training☆49Updated last year
- This project shows how to derive the total number of training tokens from a large text dataset from 🤗 datasets with Apache Beam and Data…☆25Updated 2 years ago
- Highly specialized crate to parse and use `google/sentencepiece` 's precompiled_charsmap in `tokenizers`☆18Updated 2 years ago
- URL downloader supporting checkpointing and continuous checksumming.☆19Updated last year
- PyTorch Lightning implementation of Barlow Twins: Self-Supervised Learning via Redundancy Reduction.☆12Updated 4 years ago
- A python library for highly configurable transformers - easing model architecture search and experimentation.☆49Updated 3 years ago
- 📰 Computing the information content of trained neural networks☆21Updated 3 years ago
- ☆13Updated 6 years ago
- A case study of efficient training of large language models using commodity hardware.☆69Updated 2 years ago
- A collection of reusable, high-performance, well-documented, thorough-tested layers and models in Jax☆13Updated last week
- Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixing☆48Updated 3 years ago
- Code for scaling Transformers☆26Updated 4 years ago
- This repository contains example code to build models on TPUs☆30Updated 2 years ago
- Code for the paper: Saying No is An Art: Contextualized Fallback Responses for Unanswerable Dialogue Queries☆19Updated 3 years ago
- Version control for software 2.0☆64Updated 4 years ago
- A deep learning library based on Pytorch focussed on low resource language research and robustness☆70Updated 3 years ago
- TorchServe+Streamlit for easily serving your HuggingFace NER models☆33Updated 2 years ago
- Python Research Framework☆106Updated 2 years ago
- A queue service for quickly developing scripts that use all your GPUs efficiently☆83Updated 2 years ago
- A Simple but Powerful CNN Trainer For PyTorch☆26Updated 4 years ago
- A minimal TPU compatible Jax implementation of NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis.☆13Updated 3 years ago
- Helper scripts and notes that were used while porting various nlp models☆46Updated 3 years ago
- Tensorboard parser☆22Updated 5 months ago
- Minimal implementation of Denoised Smoothing (https://arxiv.org/abs/2003.01908) in TensorFlow.☆20Updated 3 years ago
- An attempt to merge ESBN with Transformers, to endow Transformers with the ability to emergently bind symbols☆15Updated 3 years ago