A HuggingFace compatible Small Language Model trainer.
☆77Feb 2, 2025Updated last year
Alternatives and similar repositories for helibrunna
Users that are interested in helibrunna are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Resources about xLSTM by Sepp Hochreiter☆317Nov 13, 2024Updated last year
- Official repository of the xLSTM.☆2,177May 28, 2026Updated last month
- [ICLR'25] "Understanding Bottlenecks of State Space Models through the Lens of Recency and Over-smoothing" by Peihao Wang, Ruisi Cai, Yue…☆18Mar 21, 2025Updated last year
- Linear Attention for Efficient Bidirectional Sequence Modeling☆16May 13, 2025Updated last year
- Official implementation of the paper - GD-Retriever: Controllable generative text-music retrieval with diffusion models (Accepted at ISMI…☆19Sep 25, 2025Updated 9 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- docker for HF wav2vec2-sprint☆13Mar 26, 2021Updated 5 years ago
- Label shift estimation for transfer difficulty with Familiarity.☆10Feb 4, 2025Updated last year
- ☆13Jun 16, 2021Updated 5 years ago
- Official code repo for paper "Great Memory, Shallow Reasoning: Limits of kNN-LMs"☆24Apr 30, 2025Updated last year
- Code for "Theoretical Foundations of Deep Selective State-Space Models" (NeurIPS 2024)☆16Jan 7, 2025Updated last year
- (updated 2026, We will soon release new version of dataset and a playground) An electric guitar transcription model for the real world so…☆15Jan 11, 2023Updated 3 years ago
- ☆10Oct 2, 2024Updated last year
- ☆13Feb 7, 2023Updated 3 years ago
- ☆13May 29, 2026Updated last month
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆12Feb 9, 2021Updated 5 years ago
- Jax implementation of x-LSTM: Extended Long Short-Term Memory by Beck et al. (2024)☆16Aug 6, 2024Updated last year
- A Scalable Approximate Method for Probabilistic Neurosymbolic Inference☆25Jan 27, 2025Updated last year
- Implementation of Cascaded Head-colliding Attention (ACL'2021)☆11Sep 16, 2021Updated 4 years ago
- real-time speech enhance☆17Jan 23, 2024Updated 2 years ago
- SDLG is an efficient method to accurately estimate aleatoric semantic uncertainty in LLMs☆28Jun 7, 2024Updated 2 years ago
- ☆75Jun 25, 2026Updated last week
- Code for the paper "A Data-Driven Methodology for Considering Feasibility and Pairwise Likelihood in Deep Learning Based Guitar Tablature…☆19Dec 14, 2022Updated 3 years ago
- German Language Understanding Evaluation Benchmark @NAACL24☆22Dec 11, 2025Updated 6 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Trully flash implementation of DeBERTa disentangled attention mechanism.☆89Feb 10, 2026Updated 4 months ago
- Implementation of the table detection and table structure recognition deep learning model described in the paper "ClusterTabNet: Supervis…☆13Mar 15, 2025Updated last year
- A repository for research on medium sized language models.☆78May 23, 2024Updated 2 years ago
- Implementation and experiments for Partially Supervised NER via Expected Entity Ratio in TACL 2022☆14Nov 7, 2022Updated 3 years ago
- A CLI for generating synthetic data☆43May 14, 2025Updated last year
- Exploring an idea where one forgets about efficiency and carries out attention across each edge of the nodes (tokens)☆56Mar 25, 2025Updated last year
- Tiled Flash Linear Attention library for fast and efficient mLSTM Kernels.☆89Mar 27, 2026Updated 3 months ago
- [NeurIPS 2022]MorphTE: Injecting Morphology in Tensorized Embeddings☆17Oct 29, 2022Updated 3 years ago
- ⚡ Simplest and most professional TimeSeries forecasting + AutoML. Preprocess → Predict.☆11May 20, 2026Updated last month
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Official Repository for ICASSP 2024 Paper "SynthTab: Leveraging Synthesized Data for Guitar Tablature Transcription"☆31Dec 6, 2024Updated last year
- Efficient Python library for Extended LSTM with exponential gating, memory mixing, and matrix memory for superior sequence modeling.☆306Jun 28, 2024Updated 2 years ago
- [ICML 24 NGSM workshop] Associative Recurrent Memory Transformer implementation and scripts for training and evaluation☆66Mar 12, 2026Updated 3 months ago
- docker-gc is a microservice to cleanup docker images automatically based on recycling strategy. Dockerfile and helm charts are provided f…☆21Apr 19, 2023Updated 3 years ago
- ☆71Jul 8, 2025Updated 11 months ago
- An official pytorch implementation of EACL2024 short paper "Flow Matching for Conditional Text Generation in a Few Sampling Steps"☆33Jul 17, 2025Updated 11 months ago
- ☆20Apr 26, 2026Updated 2 months ago