getorca / mamba_for_sequence_classificationLinks
☆17Updated last year
Alternatives and similar repositories for mamba_for_sequence_classification
Users that are interested in mamba_for_sequence_classification are comparing it to the libraries listed below
Sorting:
- Text Classification using Mamba Model☆26Updated last year
- Towards Understanding the Mixture-of-Experts Layer in Deep Learning☆34Updated 2 years ago
- MultiModN – Multimodal, Multi-Task, Interpretable Modular Networks (NeurIPS 2023)☆35Updated 2 years ago
- Integrating Mamba/SSMs with Transformer for Enhanced Long Context and High-Quality Sequence Modeling☆213Updated last week
- PyTorch Implementation of Jamba: "Jamba: A Hybrid Transformer-Mamba Language Model"☆206Updated 3 weeks ago
- a curated list of the role of small models in the LLM era☆111Updated last year
- my attempts at implementing various bits of Sepp Hochreiter's new xLSTM architecture☆134Updated last year
- This repository contains a custom implementation of the BERT model, fine-tuned for specific tasks, along with an implementation of Low Ra…☆78Updated 2 years ago
- PyTorch implementation of Retentive Network: A Successor to Transformer for Large Language Models☆14Updated 2 years ago
- [NeurIPS 2023] This is the code for the paper `Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias`.☆156Updated 2 years ago
- Collection of tests performed during the study of the new Kolmogorov-Arnold Neural Networks (KAN)☆41Updated 11 months ago
- A Benchmark Dataset for Multimodal Scientific Fact Checking☆25Updated last year
- This is the repository for the CONFLARE (CONformal LArge language model REtrieval) Python package.☆21Updated last year
- Pytorch Implementation of the paper: "Learning to (Learn at Test Time): RNNs with Expressive Hidden States"☆25Updated last week
- ☆19Updated last year
- 📚 Text Classification with LoRA (Low-Rank Adaptation) of Language Models - Efficiently fine-tune large language models for text classifi…☆55Updated 2 years ago
- ☆42Updated last year
- Code and pretrained models for the paper: "MatMamba: A Matryoshka State Space Model"☆62Updated last year
- [ICLR 2025] Official Code Release for Explaining Modern Gated-Linear RNNs via a Unified Implicit Attention Formulation☆48Updated 11 months ago
- Official PyTorch Implementation of "The Hidden Attention of Mamba Models"☆231Updated 3 months ago
- Research on Tabular Foundation Models☆69Updated last year
- ☆24Updated 3 years ago
- Text classification with Foundation Language Model LLaMA☆113Updated 2 years ago
- The source code of "Improved Graph Contrastive Learning for Short Text Classification"☆11Updated 5 months ago
- E-NER: Evidential Deep Learning for Trustworthy Named Entity Recognition☆25Updated 2 years ago
- Implementation of Griffin from the paper: "Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models"☆56Updated 3 months ago
- ☆12Updated last year
- Implementation of xLSTM in Pytorch from the paper: "xLSTM: Extended Long Short-Term Memory"☆118Updated 3 weeks ago
- Code for the paper "ClinicalBench: Can LLMs Beat Traditional ML Models in Clinical Prediction?"☆31Updated 7 months ago
- A single repo with all scripts and utils to train / fine-tune the Mamba model with or without FIM☆61Updated last year