getorca / mamba_for_sequence_classificationLinks
☆17Updated last year
Alternatives and similar repositories for mamba_for_sequence_classification
Users that are interested in mamba_for_sequence_classification are comparing it to the libraries listed below
Sorting:
- Text Classification using Mamba Model☆24Updated last year
- This repository contains a custom implementation of the BERT model, fine-tuned for specific tasks, along with an implementation of Low Ra…☆77Updated last year
- Integrating Mamba/SSMs with Transformer for Enhanced Long Context and High-Quality Sequence Modeling☆206Updated last week
- a curated list of the role of small models in the LLM era☆104Updated 11 months ago
- E-NER: Evidential Deep Learning for Trustworthy Named Entity Recognition☆24Updated 2 years ago
- ☆22Updated last year
- 📚 Text Classification with LoRA (Low-Rank Adaptation) of Language Models - Efficiently fine-tune large language models for text classifi…☆52Updated last year
- MultiModN – Multimodal, Multi-Task, Interpretable Modular Networks (NeurIPS 2023)☆33Updated last year
- A Benchmark Dataset for Multimodal Scientific Fact Checking☆19Updated last year
- Research on Tabular Foundation Models☆56Updated 9 months ago
- [ICLR 2025] Official Code Release for Explaining Modern Gated-Linear RNNs via a Unified Implicit Attention Formulation☆45Updated 6 months ago
- my attempts at implementing various bits of Sepp Hochreiter's new xLSTM architecture☆131Updated last year
- [ICLR 2025] TabDiff: a Mixed-type Diffusion Model for Tabular Data Generation☆99Updated 3 months ago
- This is the code that went into our practical dive using mamba as information extraction☆55Updated last year
- A regression-alike loss to improve numerical reasoning in language models - ICML 2025☆25Updated last month
- ☆91Updated 7 months ago
- PyTorch Implementation of Jamba: "Jamba: A Hybrid Transformer-Mamba Language Model"☆187Updated last week
- Resources about xLSTM by Sepp Hochreiter☆316Updated 10 months ago
- Towards Understanding the Mixture-of-Experts Layer in Deep Learning☆31Updated last year
- Code and data of HRGraph, accepted to KaLLM workshop at ACL 2024.☆16Updated 11 months ago
- ☆154Updated last year
- Efficient Python library for Extended LSTM with exponential gating, memory mixing, and matrix memory for superior sequence modeling.☆296Updated last year
- ☆136Updated last year
- This repository contains my research work on building the state of the art next basket recommendations using techniques such as Autoencod…☆11Updated 4 years ago
- Implementation of xLSTM in Pytorch from the paper: "xLSTM: Extended Long Short-Term Memory"☆119Updated last week
- ☆101Updated 3 years ago
- Implementation and evaluation of multimodal RAG with text and image inputs for industrial applications☆62Updated 10 months ago
- Codebase the paper "The Remarkable Robustness of LLMs: Stages of Inference?"☆18Updated 3 months ago
- Mining Logical Rules with Large Language Models for Knowledge Graph Reasoning with 1 dollar.☆68Updated last year
- The pioneering neural network surpassing extremely-tuned XGboost and Catboost on varied tabular datasets.☆65Updated last year