sayakpaul / count-tokens-hf-datasetsView external linksLinks
This project shows how to derive the total number of training tokens from a large text dataset from π€ datasets with Apache Beam and Dataflow.
β27Oct 20, 2022Updated 3 years ago
Alternatives and similar repositories for count-tokens-hf-datasets
Users that are interested in count-tokens-hf-datasets are comparing it to the libraries listed below
Sorting:
- 3D car detection competition on Kaggle : https://www.kaggle.com/c/pku-autonomous-driving/overviewβ14Feb 5, 2020Updated 6 years ago
- This repository shows how to efficiently process variable-length sequences in TensorFlow.β14Apr 26, 2022Updated 3 years ago
- Implementation of CaiT models in TensorFlow and ImageNet-1k checkpoints. Includes code for inference and fine-tuning.β12Jun 9, 2023Updated 2 years ago
- small examples to test shared layerβ11Dec 31, 2020Updated 5 years ago
- This repository hosts code for converting the original MLP Mixer models (JAX) to TensorFlow.β15Sep 29, 2021Updated 4 years ago
- Transfer Learning in Dialogue Benchmarking Toolkitβ14Mar 31, 2023Updated 2 years ago
- Neural Arithmetic Logic Units by Trask et al.β12Apr 10, 2019Updated 6 years ago
- Emotion-Aware Dialogue Response Generation by Multi-Task Learningβ13Jan 22, 2022Updated 4 years ago
- Project demonstrating dual model deployment scenarios using Vertex AI (GCP).β34Dec 28, 2021Updated 4 years ago
- Implementing the OPRO paperβ16Sep 18, 2023Updated 2 years ago
- β24Sep 2, 2022Updated 3 years ago
- Includes PyTorch -> Keras model porting code for ConvNeXt family of models with fine-tuning and inference notebooks.β102Mar 27, 2022Updated 3 years ago
- Experiments with the ideas presented in https://arxiv.org/abs/2003.00152 by Frankle et al.β29Aug 21, 2020Updated 5 years ago
- Minimal implementation of Denoised Smoothing (https://arxiv.org/abs/2003.01908) in TensorFlow.β20Aug 4, 2021Updated 4 years ago
- Contains my experiments with the `big_vision` repo to train ViTs on ImageNet-1k.β22Jan 16, 2023Updated 3 years ago
- A web interface to understand language-specific BERT-modelsβ18Apr 16, 2024Updated last year
- RetinaNet with different loss function typesβ17Sep 18, 2019Updated 6 years ago
- This repository holds files and scripts for incorporating simple CI/CD practices for model training in ML.β21Oct 26, 2021Updated 4 years ago
- Personal site of Sayak Paul. Deployed here πβ24Feb 8, 2026Updated last week
- Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https:/β¦β28Apr 17, 2024Updated last year
- Contains code to demonstrate distributed training in TensorFlow 2 with AI Platform and custom Docker contains.β20Apr 28, 2021Updated 4 years ago
- TensorFlow implementation of Barlow Twins (https://arxiv.org/abs/2103.03230).β41Jun 11, 2021Updated 4 years ago
- [UNMAINTAINED] A PyTorch Implementation of Gated Graph Sequence Neural Networks (GGNN) for Graph Classificationβ20Mar 19, 2019Updated 6 years ago
- β24Oct 30, 2019Updated 6 years ago
- [ICLR 2025] π CodeMMLU Evaluator: A framework for evaluating LM models on CodeMMLU MCQs benchmark.β29Apr 21, 2025Updated 9 months ago
- Implementation of Swin Transformers in TensorFlow along with converted pre-trained models, code for off-the-shelf classification and fineβ¦β59Jul 31, 2022Updated 3 years ago
- This repository shows various ways of deploying a vision model (TensorFlow) from π€ Transformers.β30Aug 22, 2022Updated 3 years ago
- Contains code for the paper "Vision Transformers are Robust Learners" (AAAI 2022).β122Dec 3, 2022Updated 3 years ago
- This repository hosts code for converting the original Vision Transformer models (JAX) to TensorFlow.β33Mar 23, 2022Updated 3 years ago
- β32Feb 8, 2019Updated 7 years ago
- β33Dec 9, 2022Updated 3 years ago
- Rotated Box SSD detection Framework with FPN support, next generation object detection frameworkβ31Jul 29, 2019Updated 6 years ago
- π§ A sample app to integrate react-native and open aiβ11Jan 1, 2023Updated 3 years ago
- An Educational Framework Based on PyTorch for Deep Learning Education and Explorationβ10Dec 24, 2023Updated 2 years ago
- Go SDK for the Bare Metal Cloud APIβ14Dec 20, 2025Updated last month
- Understanding how features learned by neural networks evolve throughout trainingβ39Oct 24, 2024Updated last year
- TF2 implementation of knowledge distillation using the "function matching" hypothesis from https://arxiv.org/abs/2106.05237.β88Sep 24, 2021Updated 4 years ago
- Game engine for website version avalon card-board gameβ12Aug 2, 2025Updated 6 months ago
- β13Nov 5, 2024Updated last year