getorca / mamba_for_sequence_classificationLinks
☆16Updated last year
Alternatives and similar repositories for mamba_for_sequence_classification
Users that are interested in mamba_for_sequence_classification are comparing it to the libraries listed below
Sorting:
- Text Classification using Mamba Model☆26Updated last year
- Integrating Mamba/SSMs with Transformer for Enhanced Long Context and High-Quality Sequence Modeling☆212Updated this week
- A Benchmark Dataset for Multimodal Scientific Fact Checking☆23Updated last year
- This repository contains a custom implementation of the BERT model, fine-tuned for specific tasks, along with an implementation of Low Ra…☆78Updated 2 years ago
- 📚 Text Classification with LoRA (Low-Rank Adaptation) of Language Models - Efficiently fine-tune large language models for text classifi…☆55Updated 2 years ago
- a curated list of the role of small models in the LLM era☆111Updated last year
- Mining Logical Rules with Large Language Models for Knowledge Graph Reasoning with 1 dollar.☆73Updated last year
- Code for the paper "ClinicalBench: Can LLMs Beat Traditional ML Models in Clinical Prediction?"☆31Updated 6 months ago
- ☆94Updated 11 months ago
- Implementation of MambaFormer in Pytorch ++ Zeta from the paper: "Can Mamba Learn How to Learn? A Comparative Study on In-Context Learnin…☆20Updated 3 weeks ago
- PyTorch Implementation of Jamba: "Jamba: A Hybrid Transformer-Mamba Language Model"☆203Updated last week
- This repository contains code to reproduce the results in our paper "Transformers are Short Text Classifiers: A Study of Inductive Short …☆45Updated 3 years ago
- E-NER: Evidential Deep Learning for Trustworthy Named Entity Recognition☆25Updated 2 years ago
- TabSTAR: A Tabular Foundation Model for Tabular Data with Text Fields☆73Updated this week
- This is the code that went into our practical dive using mamba as information extraction☆57Updated 2 years ago
- Pytorch Implementation of the paper: "Learning to (Learn at Test Time): RNNs with Expressive Hidden States"☆25Updated 3 weeks ago
- ☆129Updated last year
- my attempts at implementing various bits of Sepp Hochreiter's new xLSTM architecture☆134Updated last year
- Collection of tests performed during the study of the new Kolmogorov-Arnold Neural Networks (KAN)☆41Updated 10 months ago
- Towards Understanding the Mixture-of-Experts Layer in Deep Learning☆34Updated 2 years ago
- MultiModN – Multimodal, Multi-Task, Interpretable Modular Networks (NeurIPS 2023)☆35Updated 2 years ago
- [EMNLP 2023] Knowledge Rumination for Pre-trained Language Models☆17Updated 2 years ago
- Reinforcement learning (RL) is an effective method to find reasoning pathways in incomplete knowledge graphs (KGs). To overcome the chall…☆23Updated last year
- Public code repo for paper "SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales"☆112Updated last year
- Official implement of 'Advancing Graph Convolutional Networks via General Spectral Wavelets'☆34Updated 6 months ago
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆38Updated last year
- The official repo of TimeLlama, an instruction-finetuned Llama2 series that improve complex temporal reasoning ability.☆41Updated 2 years ago
- Implementation of MambaByte in "MambaByte: Token-free Selective State Space Model" in Pytorch and Zeta☆126Updated 2 months ago
- Efficient Python library for Extended LSTM with exponential gating, memory mixing, and matrix memory for superior sequence modeling.☆302Updated last year
- Code and pretrained models for the paper: "MatMamba: A Matryoshka State Space Model"☆62Updated last year