getorca / mamba_for_sequence_classificationLinks
☆16Updated last year
Alternatives and similar repositories for mamba_for_sequence_classification
Users that are interested in mamba_for_sequence_classification are comparing it to the libraries listed below
Sorting:
- Integrating Mamba/SSMs with Transformer for Enhanced Long Context and High-Quality Sequence Modeling☆211Updated last month
- Text Classification using Mamba Model☆24Updated last year
- PyTorch Implementation of Jamba: "Jamba: A Hybrid Transformer-Mamba Language Model"☆200Updated last month
- E-NER: Evidential Deep Learning for Trustworthy Named Entity Recognition☆24Updated 2 years ago
- my attempts at implementing various bits of Sepp Hochreiter's new xLSTM architecture☆134Updated last year
- This repository contains a custom implementation of the BERT model, fine-tuned for specific tasks, along with an implementation of Low Ra…☆78Updated 2 years ago
- Code for the paper "ClinicalBench: Can LLMs Beat Traditional ML Models in Clinical Prediction?"☆30Updated 6 months ago
- a curated list of the role of small models in the LLM era☆111Updated last year
- ☆80Updated last year
- A RL env with procedurally generated symbolic reasoning data☆29Updated last month
- Towards Understanding the Mixture-of-Experts Layer in Deep Learning☆33Updated 2 years ago
- Research on Tabular Foundation Models☆65Updated last year
- ☆67Updated 2 years ago
- Data and code for the Corr2Cause paper (ICLR 2024)☆111Updated last year
- [NeurIPS 2023] This is the code for the paper `Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias`.☆156Updated 2 years ago
- ☆94Updated 10 months ago
- Code for Language-Interfaced FineTuning for Non-Language Machine Learning Tasks.☆133Updated last year
- The pioneering neural network surpassing extremely-tuned XGboost and Catboost on varied tabular datasets.☆68Updated last year
- Code of ICLR paper: https://openreview.net/forum?id=-cqvvvb-NkI☆95Updated 2 years ago
- 📚 Text Classification with LoRA (Low-Rank Adaptation) of Language Models - Efficiently fine-tune large language models for text classifi…☆55Updated 2 years ago
- [EMNLP 2022] ReCo: Reliable Causal Chain Reasoning via Structural Causal Recurrent Neural Networks☆17Updated last year
- ☆22Updated last year
- ☆69Updated last year
- MultiModN – Multimodal, Multi-Task, Interpretable Modular Networks (NeurIPS 2023)☆35Updated 2 years ago
- Efficient Python library for Extended LSTM with exponential gating, memory mixing, and matrix memory for superior sequence modeling.☆302Updated last year
- [EMNLP 2023] Knowledge Rumination for Pre-trained Language Models☆17Updated 2 years ago
- PyTorch implementation of Retentive Network: A Successor to Transformer for Large Language Models☆14Updated 2 years ago
- [NAACL 2024] Struc-Bench: Are Large Language Models Good at Generating Complex Structured Tabular Data? https://aclanthology.org/2024.naa…☆55Updated 4 months ago
- ☆12Updated 2 years ago
- ☆327Updated 2 years ago