This project shows how to derive the total number of training tokens from a large text dataset from π€ datasets with Apache Beam and Dataflow.
β27Oct 20, 2022Updated 3 years ago
Alternatives and similar repositories for count-tokens-hf-datasets
Users that are interested in count-tokens-hf-datasets are comparing it to the libraries listed below
Sorting:
- This repository hosts the code to port NumPy model weights of BiT-ResNets to TensorFlow SavedModel format.β14Dec 21, 2021Updated 4 years ago
- 3D car detection competition on Kaggle : https://www.kaggle.com/c/pku-autonomous-driving/overviewβ14Feb 5, 2020Updated 6 years ago
- Implementation of CaiT models in TensorFlow and ImageNet-1k checkpoints. Includes code for inference and fine-tuning.β12Jun 9, 2023Updated 2 years ago
- small examples to test shared layerβ11Dec 31, 2020Updated 5 years ago
- Transfer Learning in Dialogue Benchmarking Toolkitβ14Mar 31, 2023Updated 2 years ago
- Emotion-Aware Dialogue Response Generation by Multi-Task Learningβ13Jan 22, 2022Updated 4 years ago
- This repository hosts code for converting the original MLP Mixer models (JAX) to TensorFlow.β15Sep 29, 2021Updated 4 years ago
- β16May 9, 2023Updated 2 years ago
- Neural Arithmetic Logic Units by Trask et al.β12Apr 10, 2019Updated 6 years ago
- Implementing the OPRO paperβ16Sep 18, 2023Updated 2 years ago
- β24Sep 2, 2022Updated 3 years ago
- Contains my experiments with the `big_vision` repo to train ViTs on ImageNet-1k.β22Jan 16, 2023Updated 3 years ago
- Minimal implementation of Denoised Smoothing (https://arxiv.org/abs/2003.01908) in TensorFlow.β20Aug 4, 2021Updated 4 years ago
- Showcases the use of deep learning to detect wheat heads from crops. The project is based on: https://www.kaggle.com/c/global-wheat-detecβ¦β18May 30, 2020Updated 5 years ago
- This repository holds files and scripts for incorporating simple CI/CD practices for model training in ML.β21Oct 26, 2021Updated 4 years ago
- Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https:/β¦β28Apr 17, 2024Updated last year
- Contains code to demonstrate distributed training in TensorFlow 2 with AI Platform and custom Docker contains.β20Apr 28, 2021Updated 4 years ago
- [UNMAINTAINED] A PyTorch Implementation of Gated Graph Sequence Neural Networks (GGNN) for Graph Classificationβ20Mar 19, 2019Updated 6 years ago
- Implementation of Swin Transformers in TensorFlow along with converted pre-trained models, code for off-the-shelf classification and fineβ¦β59Jul 31, 2022Updated 3 years ago
- [ICLR 2025] π CodeMMLU Evaluator: A framework for evaluating LM models on CodeMMLU MCQs benchmark.β29Apr 21, 2025Updated 10 months ago
- This repository shows various ways of deploying a vision model (TensorFlow) from π€ Transformers.β30Aug 22, 2022Updated 3 years ago
- In-IDE Code Searchβ29Apr 29, 2022Updated 3 years ago
- β33Dec 9, 2022Updated 3 years ago
- Swarming algorithms like PSO, Ant Colony, Sakana, and more in PyTorch πβ138Jan 31, 2026Updated last month
- Rotated Box SSD detection Framework with FPN support, next generation object detection frameworkβ31Jul 29, 2019Updated 6 years ago
- Go SDK for the Bare Metal Cloud APIβ14Dec 20, 2025Updated 2 months ago
- An Educational Framework Based on PyTorch for Deep Learning Education and Explorationβ10Dec 24, 2023Updated 2 years ago
- openASO is a project designed to identify regulatory regions of an RNA that can be targeted by antisense oligonucleotides.β10Sep 30, 2021Updated 4 years ago
- Understanding how features learned by neural networks evolve throughout trainingβ41Oct 24, 2024Updated last year
- TF2 implementation of knowledge distillation using the "function matching" hypothesis from https://arxiv.org/abs/2106.05237.β88Sep 24, 2021Updated 4 years ago
- Game engine for website version avalon card-board gameβ12Aug 2, 2025Updated 7 months ago
- β12Dec 26, 2023Updated 2 years ago
- This software uses a config file (config.py), which is a settings file, to build and run SWAT+ models. Users can share the config along wβ¦β16Mar 9, 2022Updated 4 years ago
- μ΄λ¦°μ΄λ₯Ό μν λν μ μ μλΉμ€, My AI Fairy-Taleβ11Apr 7, 2023Updated 2 years ago
- TensorFlow 2 / Lite implementation of Ultra-Fast Structure-Aware Lane Detectionβ12Aug 19, 2020Updated 5 years ago
- A simple camera board using GMAX3412 1" 4K@30fps global shutter sensorβ19Dec 21, 2025Updated 2 months ago
- A RAG that can scale π§π»βπ»β11May 28, 2024Updated last year
- My personal websiteβ11Dec 22, 2024Updated last year
- Implementation of a Systolic Array based sorting engine on an FPGA using Verilogβ11May 11, 2017Updated 8 years ago