MatFormer repo
☆76Dec 9, 2024Updated last year
Alternatives and similar repositories for matformer
Users that are interested in matformer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Auditing agents for fine-tuning safety☆21Oct 21, 2025Updated 7 months ago
- The official implementation of HybridNorm: Towards Stable and Efficient Transformer Training via Hybrid Normalization☆19Mar 7, 2025Updated last year
- Use the tokenizer in parallel to achieve superior acceleration☆20Mar 21, 2024Updated 2 years ago
- Official repo for BWLer: Barycentric Weight Layer☆30Mar 20, 2026Updated 2 months ago
- Official dataset repository for "SciReviewGen: A Large-scale Dataset for Automatic Literature Review Generation."☆21Jun 4, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Implementation of the dilated self attention as described in "LongNet: Scaling Transformers to 1,000,000,000 Tokens"☆13Jul 23, 2023Updated 2 years ago
- Code repository for the public reproduction of the language modelling experiments on "MatFormer: Nested Transformer for Elastic Inference…☆31Nov 14, 2023Updated 2 years ago
- A RAG that can scale 🧑🏻💻☆11May 28, 2024Updated last year
- Source code for "Taming GANs with Lookahead–Minmax", ICLR 2021.☆15Mar 28, 2021Updated 5 years ago
- Vocabulary Trimming (VT) is a model compression technique, which reduces a multilingual LM vocabulary to a target language by deleting ir…☆67Oct 25, 2024Updated last year
- [ICML2025] Official code for "Reinforced Lifelong Editing for Language Models"☆21Feb 23, 2025Updated last year
- Source code for the paper "Positional Attention: Expressivity and Learnability of Algorithmic Computation"☆14May 26, 2025Updated 11 months ago
- ☆10Oct 2, 2024Updated last year
- ☆54Sep 26, 2025Updated 7 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆40Apr 15, 2024Updated 2 years ago
- YASEM - Yet Another Splade|Sparse Embedder - A simple and efficient library for SPLADE embeddings☆13May 22, 2025Updated last year
- Paper dataset for "Factored Verification: Detecting and Reducing Hallucination in Summaries of Academic Papers"☆13Oct 20, 2024Updated last year
- Official Repository for "Hypencoder: Hypernetworks for Information Retrieval"☆35Sep 20, 2025Updated 8 months ago
- Official Repository for paper "Ontology-Free General-Domain Knowledge Graph-to-Text Generation Dataset Synthesis using Large Language Mod…☆15Nov 25, 2024Updated last year
- Run TFLITE models on the web☆13Jan 2, 2022Updated 4 years ago
- Official repo of dataset-decomposition paper [NeurIPS 2024]☆21Jan 8, 2025Updated last year
- Code repository for the paper - "Matryoshka Representation Learning"☆634Feb 19, 2024Updated 2 years ago
- ☆17Jan 5, 2023Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Code to train Sentence BERT Japanese model for Hugging Face Model Hub☆11Aug 8, 2021Updated 4 years ago
- [ICLR'25] Code for KaSA, an official implementation of "KaSA: Knowledge-Aware Singular-Value Adaptation of Large Language Models"☆22Jan 16, 2025Updated last year
- A summarizer for Japanese articles (but ChatGPT is better)☆10Aug 1, 2022Updated 3 years ago
- PyTorch implementation of StableMask (ICML'24)☆15Jun 27, 2024Updated last year
- ☆137Jun 6, 2025Updated 11 months ago
- ☆92Aug 18, 2024Updated last year
- Implementation of Influence Function approximations for differently sized ML models, using PyTorch☆18Sep 15, 2023Updated 2 years ago
- ☆13Nov 15, 2021Updated 4 years ago
- 🤗 HuggingFace Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but for Vertex AI and unofficial)☆17Mar 20, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 🎨 Imagine what Picasso could have done with AI. Self-host your StableDiffusion API.☆50May 8, 2023Updated 3 years ago
- Sparse Embedding Compression for Scalable Retrieval in Recommender Systems☆35Nov 21, 2025Updated 6 months ago
- Code for paper https://arxiv.org/abs/2501.00522☆15Apr 28, 2025Updated last year
- A missing piece of the Python multitask (both threads and processes) API: An extension that supports stateful worker pools & size-aware i…☆29Mar 8, 2026Updated 2 months ago
- 🚂 Fine-tune OpenAI models for text classification, question answering, and more☆17May 1, 2023Updated 3 years ago
- ☆28Oct 7, 2025Updated 7 months ago
- ☆33Oct 15, 2025Updated 7 months ago