AI-Guru / helibrunna
A HuggingFace compatible xLSTM trainer.
☆57Updated last week
Related projects: ⓘ
- Scaling is a distributed training library and installable dependency designed to scale up neural networks, with a dedicated module for tr…☆38Updated 3 weeks ago
- ☆29Updated 3 weeks ago
- ☆73Updated 5 months ago
- German dataset for DPR model training☆16Updated 2 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆34Updated last week
- Improving Text Embedding of Language Models Using Contrastive Fine-tuning☆54Updated last month
- Set of scripts to finetune LLMs☆36Updated 5 months ago
- ☆75Updated 3 weeks ago
- ☆58Updated 3 weeks ago
- Collection of autoregressive model implementation☆62Updated 2 weeks ago
- Tokun to can tokens☆13Updated this week
- ☆42Updated 3 weeks ago
- Randomized Positional Encodings Boost Length Generalization of Transformers☆78Updated 6 months ago
- Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"☆82Updated 3 weeks ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆58Updated 2 weeks ago
- A byte-level decoder architecture that matches the performance of tokenized Transformers.☆57Updated 4 months ago
- Implementation of a Light Recurrent Unit in Pytorch☆43Updated 2 weeks ago
- Use QLoRA to tune LLM in PyTorch-Lightning w/ Huggingface + MLflow☆55Updated 10 months ago
- SaLSa Optimzer implementation (No learning rates needed)☆27Updated last week
- Implementation of GateLoop Transformer in Pytorch and Jax☆86Updated 3 months ago
- NLP with Rust for Python 🦀🐍☆57Updated 3 months ago
- Code for the MTEB Arena☆14Updated this week
- ☆43Updated 7 months ago
- Library to facilitate pruning of LLMs based on context☆31Updated 7 months ago
- Truth Forest: Toward Multi-Scale Truthfulness in Large Language Models through Intervention without Tuning☆40Updated 9 months ago
- Minimum Bayes Risk Decoding for Hugging Face Transformers☆51Updated 3 months ago
- This is the reproduction repository for my 🤗 Hugging Face blog post on synthetic data☆57Updated 7 months ago
- ☆50Updated last month
- Code for NeurIPS LLM Efficiency Challenge☆52Updated 5 months ago
- Reads arXiv papers using Text-to-Speech☆55Updated last year