A gMLP (gated MLP) implementation in Tensorflow 1.x, as described in the paper "Pay Attention to MLPs" (2105.08050).
☆16Aug 31, 2021Updated 4 years ago
Alternatives and similar repositories for g-mlp-tensorflow
Users that are interested in g-mlp-tensorflow are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- RWKV6 in native pytorch and triton:)☆11Aug 4, 2024Updated last year
- ☆10Feb 21, 2023Updated 3 years ago
- This repository contains data and analysis scripts to reproduce the figures as well as source code and simulation scripts to perform the …☆13Apr 13, 2021Updated 5 years ago
- ☆10Jun 10, 2023Updated 3 years ago
- ☆10May 1, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- VQ-TR repository☆12Apr 18, 2024Updated 2 years ago
- 免注册免费使用 ChatGPT,请关注微信公众号【胖竹同学】。☆10Apr 4, 2023Updated 3 years ago
- ☆13May 18, 2021Updated 5 years ago
- This is the official implementation of TAGCOS: Task-agnostic Gradient Clustered Coreset Selection for Instruction Tuning Data☆13Jul 21, 2024Updated last year
- ☆12Apr 19, 2024Updated 2 years ago
- [ICML 2024] PyTorch implementation for "Diversified Batch Selection for Training Acceleration"☆10Jul 30, 2024Updated last year
- Codes for our ICLR2020 paper: Knowledge Consistency between Neural Networks and Beyond☆16Jan 11, 2020Updated 6 years ago
- Ice segment plugin for Bluge☆12Jul 4, 2022Updated 3 years ago
- Go Based Lightweight RAG / LLM Tool with CLI + API☆14Sep 28, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A tool to find all duplicates in large sets of text documents.☆16Sep 29, 2021Updated 4 years ago
- ☆15Aug 15, 2023Updated 2 years ago
- ☆14Oct 24, 2024Updated last year
- Variance Covariance Regularization☆14Jun 22, 2023Updated 2 years ago
- Feature Importance Analysis of Models☆11Mar 23, 2022Updated 4 years ago
- 国家统计局中国省市县乡村5级地址抓取,http://www.stats.gov.cn/tjsj/tjbz/tjyqhdmhcxhfdm/2018/index.html☆12Jan 8, 2020Updated 6 years ago
- ☆12May 14, 2025Updated last year
- Time series data contribution via influence functions☆17Jan 18, 2025Updated last year
- ☆10May 30, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Go library for accessing the Paddle API☆10Apr 14, 2022Updated 4 years ago
- ☆22Oct 28, 2024Updated last year
- Gamera 4 for Python 3☆14May 16, 2025Updated last year
- Best way to use ChatGPT/GPT-3 with Go: zero dependencies, tokenizer, under 1500 LOC☆14Jul 18, 2024Updated last year
- Deploy Yolo series algorithms on Hisilicon platform hi3516, including yolov3, yolov5, yolox, etc☆11Mar 25, 2022Updated 4 years ago
- table understanding dataset for comparative evaluation of different table understanding algorithms☆13Jun 15, 2018Updated 8 years ago
- Source code for Leveraging 2D Information for Long-term Time Series Forecasting with Vanilla Transformers☆19May 29, 2024Updated 2 years ago
- Python scripts to facilitate easy working☆11Mar 23, 2026Updated 2 months ago
- ☆24Nov 22, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆21Jul 19, 2024Updated last year
- Source code of FDNet: Focal Decomposed Network for Efficient, Robust and Practical Time Series Forecasting☆14Mar 6, 2024Updated 2 years ago
- Community Implementation of the paper: "Multi-Head Mixture-of-Experts" In PyTorch☆31May 11, 2026Updated last month
- Psychology-inspired Eye Movement Synthesis for Gaze-based Activity Recognition☆28Aug 9, 2023Updated 2 years ago
- Computer Vision Segmentation for Document Layout Analysis☆10Sep 26, 2022Updated 3 years ago
- Official repository of "Pareto Manifold Learning: Tackling multiple tasks via ensembles of single-task models" [ICML 2023]☆26Jan 10, 2025Updated last year
- WIP: Full text search engine library written in Go with 1.18+ Generics, heavily inspired by Tantivy☆14Apr 5, 2023Updated 3 years ago