getorca / mamba_for_sequence_classificationLinks

☆17

Alternatives and similar repositories for mamba_for_sequence_classification

Users that are interested in mamba_for_sequence_classification are comparing it to the libraries listed below

Sorting:

VuBacktracking / mamba-text-classification
Text Classification using Mamba Model
☆22Updated last year
tigerchen52 / awesome_role_of_small_models
a curated list of the role of small models in the LLM era
☆102Updated 9 months ago
kyegomez / MambaTransformer
Integrating Mamba/SSMs with Transformer for Enhanced Long Context and High-Quality Sequence Modeling
☆197Updated 3 months ago
eleonorapoeta / benchmarking-KAN
This repository contains the official implementation of "A Benchmarking Study of Kolmogorov-Arnold Networks on Tabular Data" (under revie…
☆17Updated last year
kyegomez / xLSTM
Implementation of xLSTM in Pytorch from the paper: "xLSTM: Extended Long Short-Term Memory"
☆119Updated 3 months ago
EPFLiGHT / MultiModN
MultiModN – Multimodal, Multi-Task, Interpretable Modular Networks (NeurIPS 2023)
☆33Updated last year
AI-Guru / xlstm-resources
Resources about xLSTM by Sepp Hochreiter
☆317Updated 8 months ago
andrewgcodes / xlstm
my attempts at implementing various bits of Sepp Hochreiter's new xLSTM architecture
☆129Updated last year
zhzhengit / ENER
E-NER: Evidential Deep Learning for Trustworthy Named Entity Recognition
☆23Updated last year
kyegomez / Jamba
PyTorch Implementation of Jamba: "Jamba: A Hybrid Transformer-Mamba Language Model"
☆173Updated 3 months ago
alexriggio / BERT-LoRA-TensorRT
This repository contains a custom implementation of the BERT model, fine-tuned for specific tasks, along with an implementation of Low Ra…
☆77Updated last year
riedlerm / multimodal_rag_for_industry
Implementation and evaluation of multimodal RAG with text and image inputs for industrial applications
☆54Updated 8 months ago
Atik-Ahamed / MambaTab
MambaTab: A Plug-and-Play Model for Learning Tabular Data
☆22Updated last month
IIT-DM / Fin-Fact
A Benchmark Dataset for Multimodal Scientific Fact Checking
☆17Updated 10 months ago
FKarl / short-text-classification
This repository contains code to reproduce the results in our paper "Transformers are Short Text Classifiers: A Study of Inductive Short …
☆43Updated 2 years ago
RManLuo / ChatRule
Mining Logical Rules with Large Language Models for Knowledge Graph Reasoning with 1 dollar.
☆66Updated last year
fshnkarimi / Fine-tuning-an-LLM-using-LoRA
📚 Text Classification with LoRA (Low-Rank Adaptation) of Language Models - Efficiently fine-tune large language models for text classifi…
☆50Updated last year
ShaderManager / RetNet
PyTorch implementation of Retentive Network: A Successor to Transformer for Large Language Models
☆14Updated last year
XZhang97666 / AlpaCare
☆91Updated 5 months ago
whitepurple / HBM-loss-for-HTC
[ACL 2024 Findings] Hierarchy-aware Biased Bound Margin Loss Function for Hierarchical Text Classification
☆15Updated 8 months ago
pranftw / openreview_scraper
Scrape papers from OpenReview using OpenReview API
☆50Updated 4 months ago
muditbhargava66 / PyxLSTM
Efficient Python library for Extended LSTM with exponential gating, memory mixing, and matrix memory for superior sequence modeling.
☆293Updated last year
zjunlp / knowledge-rumination
[EMNLP 2023] Knowledge Rumination for Pre-trained Language Models
☆17Updated 2 years ago
IkeYang / ViTime
The official repository of paper "ViTime: A Visual Intelligence-based Foundation Model for Time Series Forecasting"
☆95Updated 8 months ago
uclaml / MoE
Towards Understanding the Mixture-of-Experts Layer in Deep Learning
☆31Updated last year
kyegomez / Griffin
Implementation of Griffin from the paper: "Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models"
☆55Updated 3 months ago
Dousia / MetricPrompt
Code for KDD 2023 long paper: MetricPrompt: Prompting Model as a Relevance Metric for Few-Shot Text Classification
☆19Updated 11 months ago
VanekPetr / flan-t5-text-classifier
Fine-tuning of Flan-5T LLM for text classification 🤖 focuses on adapting a state-of-the-art language model to enhance its ability to cla…
☆39Updated 8 months ago
claCase / Attention-as-RNN
Non-official implementation of "Attention as an RNN" from https://arxiv.org/pdf/2405.13956, efficient associative parallel prefix scan an…
☆27Updated 11 months ago
trapoom555 / Language-Model-STS-CFT
Improving Text Embedding of Language Models Using Contrastive Fine-tuning
☆64Updated 11 months ago