A numpy implementation of the Transformer model in "Attention is All You Need"
☆58Jul 21, 2024Updated last year
Alternatives and similar repositories for numpy-transformer
Users that are interested in numpy-transformer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Сustom torch style machine learning framework with automatic differentiation implemented on numpy, allows build GANs, VAEs, etc.☆81Feb 28, 2026Updated last month
- Implement Transformers (and Deep Learning) from scratch in NumPy☆28Oct 3, 2023Updated 2 years ago
- Python implementations of basic machine learning algorithms☆14Feb 17, 2024Updated 2 years ago
- [Neural Networks 2025] The official code for the paper "MNet: A Multi-Scale Network for Visible Watermark Removal."☆17Jun 16, 2025Updated 9 months ago
- ☆14Oct 24, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- bitfusion verilog implementation☆12Feb 21, 2022Updated 4 years ago
- Diffusion-based korean text-to-image generation model☆12Aug 16, 2023Updated 2 years ago
- Reduction Server in Rust☆14Apr 9, 2024Updated 2 years ago
- Example of using Epochraft to train HuggingFace transformers models with PyTorch FSDP☆11Jan 29, 2024Updated 2 years ago
- T5-based (russian) text normalization☆26Jan 25, 2024Updated 2 years ago
- [CVPR 2025 Highlight] FIMA-Q: Post-Training Quantization for Vision Transformers by Fisher Information Matrix Approximation☆27Jun 16, 2025Updated 9 months ago
- Deploy & Monetize Agents in a Hour☆16Jun 28, 2025Updated 9 months ago
- Unofficial PyTorch Reimplementation of UniformAugment.☆15Sep 7, 2020Updated 5 years ago
- This repository contains the training and evaluation code for llm-jp-modernbert-base.☆16Jun 17, 2025Updated 9 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆19Nov 7, 2023Updated 2 years ago
- Official implementation of EMNLP'23 paper "Revisiting Block-based Quantisation: What is Important for Sub-8-bit LLM Inference?"☆24Oct 25, 2023Updated 2 years ago
- Library for high level model ensembling☆12Jan 27, 2023Updated 3 years ago
- ☆12Jul 8, 2023Updated 2 years ago
- ☆18Mar 25, 2024Updated 2 years ago
- Randomized algorithm class at CU☆15Jul 8, 2025Updated 9 months ago
- ☆20Jul 24, 2024Updated last year
- Object Detection with Transformers : DETR, Conditional DETR, Deformable DETR, Dynamic Head☆12Jan 22, 2023Updated 3 years ago
- ☆13Aug 29, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- PL Reading Group Website☆14Jan 12, 2026Updated 3 months ago
- Code Release for the paper "TriBERT: Full-body Human-centric Audio-visual Representation Learning for Visual Sound Separation" in NeurIPS…☆14Dec 9, 2021Updated 4 years ago
- Repository for SIGIR'18 paper: "Ranking for Relevance and Display Preferences in Complex Presentation Layouts"☆16Aug 28, 2018Updated 7 years ago
- Our solution to ML Talent Match hackathon☆11Mar 22, 2024Updated 2 years ago
- ☆21Jan 11, 2023Updated 3 years ago
- Checkpointable dataset utilities for foundation model training☆32Jan 29, 2024Updated 2 years ago
- Code and Dataset release of "Carpe Diem: On the Evaluation of World Knowledge in Lifelong Language Models" (NAACL 2024)☆10Oct 16, 2024Updated last year
- Talks about vaex☆36Dec 2, 2022Updated 3 years ago
- Proof-of-concept of global switching between numpy/jax/pytorch in a library.☆18Jun 18, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Python tools for processing the stackexchange data dumps into a text dataset for Language Models☆86Dec 6, 2023Updated 2 years ago
- Hierarchical entity typing via multi-level learning to rank☆12Oct 13, 2020Updated 5 years ago
- Procedural data generators suite for synthetic pretraining and formal reasoning☆36Updated this week
- ☆17Oct 31, 2023Updated 2 years ago
- A Minimum Working Example of the Dissertation Template for UW-Madison.☆13May 4, 2024Updated last year
- My personal work on the numerical projects of a book called "A First Course in Stochastic Calculus".☆16Apr 29, 2022Updated 3 years ago
- Single-prompt pptx generation framework☆36Nov 13, 2024Updated last year