getorca / mamba_for_sequence_classificationLinks
☆17Updated last year
Alternatives and similar repositories for mamba_for_sequence_classification
Users that are interested in mamba_for_sequence_classification are comparing it to the libraries listed below
Sorting:
- Text Classification using Mamba Model☆22Updated last year
- a curated list of the role of small models in the LLM era☆102Updated 9 months ago
- Integrating Mamba/SSMs with Transformer for Enhanced Long Context and High-Quality Sequence Modeling☆197Updated 3 months ago
- This repository contains the official implementation of "A Benchmarking Study of Kolmogorov-Arnold Networks on Tabular Data" (under revie…☆17Updated last year
- Implementation of xLSTM in Pytorch from the paper: "xLSTM: Extended Long Short-Term Memory"☆119Updated 3 months ago
- MultiModN – Multimodal, Multi-Task, Interpretable Modular Networks (NeurIPS 2023)☆33Updated last year
- Resources about xLSTM by Sepp Hochreiter☆317Updated 8 months ago
- my attempts at implementing various bits of Sepp Hochreiter's new xLSTM architecture☆129Updated last year
- E-NER: Evidential Deep Learning for Trustworthy Named Entity Recognition☆23Updated last year
- PyTorch Implementation of Jamba: "Jamba: A Hybrid Transformer-Mamba Language Model"☆173Updated 3 months ago
- This repository contains a custom implementation of the BERT model, fine-tuned for specific tasks, along with an implementation of Low Ra…☆77Updated last year
- Implementation and evaluation of multimodal RAG with text and image inputs for industrial applications☆54Updated 8 months ago
- MambaTab: A Plug-and-Play Model for Learning Tabular Data☆22Updated last month
- A Benchmark Dataset for Multimodal Scientific Fact Checking☆17Updated 10 months ago
- This repository contains code to reproduce the results in our paper "Transformers are Short Text Classifiers: A Study of Inductive Short …☆43Updated 2 years ago
- Mining Logical Rules with Large Language Models for Knowledge Graph Reasoning with 1 dollar.☆66Updated last year
- 📚 Text Classification with LoRA (Low-Rank Adaptation) of Language Models - Efficiently fine-tune large language models for text classifi…☆50Updated last year
- PyTorch implementation of Retentive Network: A Successor to Transformer for Large Language Models☆14Updated last year
- ☆91Updated 5 months ago
- [ACL 2024 Findings] Hierarchy-aware Biased Bound Margin Loss Function for Hierarchical Text Classification☆15Updated 8 months ago
- Scrape papers from OpenReview using OpenReview API☆50Updated 4 months ago
- Efficient Python library for Extended LSTM with exponential gating, memory mixing, and matrix memory for superior sequence modeling.☆293Updated last year
- [EMNLP 2023] Knowledge Rumination for Pre-trained Language Models☆17Updated 2 years ago
- The official repository of paper "ViTime: A Visual Intelligence-based Foundation Model for Time Series Forecasting"☆95Updated 8 months ago
- Towards Understanding the Mixture-of-Experts Layer in Deep Learning☆31Updated last year
- Implementation of Griffin from the paper: "Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models"☆55Updated 3 months ago
- Code for KDD 2023 long paper: MetricPrompt: Prompting Model as a Relevance Metric for Few-Shot Text Classification☆19Updated 11 months ago
- Fine-tuning of Flan-5T LLM for text classification 🤖 focuses on adapting a state-of-the-art language model to enhance its ability to cla…☆39Updated 8 months ago
- Non-official implementation of "Attention as an RNN" from https://arxiv.org/pdf/2405.13956, efficient associative parallel prefix scan an…☆27Updated 11 months ago
- Improving Text Embedding of Language Models Using Contrastive Fine-tuning☆64Updated 11 months ago