swapUniba / LLaMAntino
☆36Updated 9 months ago
Related projects: ⓘ
- Camoscio: An Italian instruction-tuned language model based on LLaMA☆124Updated 9 months ago
- Get ready to meet Fauno - the Italian language model crafted by the RSTLess Research Group from the Sapienza University of Rome.☆78Updated last year
- Materials for "IT5: Large-scale Text-to-text Pretraining for Italian Language Understanding and Generation" 🇮🇹☆30Updated 3 months ago
- The 🌟ANITA project🌟 *(Advanced Natural-based interaction for the ITAlian language)* wants to provide Italian NLP researchers with an im…☆14Updated last week
- The home of Stambecco 🦌: Italian Instruction-following LLaMA Model☆20Updated last year
- Knowledge pills on Neural Search☆24Updated last year
- 🇮🇹 Italian BERT and ELECTRA models (incl. evaluation)☆17Updated last year
- A python package for benchmarking interpretability techniques on Transformers.☆207Updated 2 months ago
- A python package to run inference with HuggingFace language and vision-language checkpoints wrapping many convenient features.☆25Updated this week
- UmBERTo: an Italian Language Model trained with Whole Word Masking.☆104Updated last year
- A collection of datasets for language model pretraining including scripts for downloading, preprocesssing, and sampling.☆51Updated last month
- Sentiment analysis and emotion classification for Italian using BERT (fine-tuning). Published at the WASSA workshop (EACL2021).☆24Updated 2 months ago
- Let's build better datasets, together!☆195Updated last month
- A HuggingFace compatible xLSTM trainer.☆57Updated last week
- Generalist and Lightweight Model for Text Classification☆29Updated 2 weeks ago
- Interpretability for sequence generation models 🐛 🔍☆361Updated 3 weeks ago
- Notebooks for training universal 0-shot classifiers on many different tasks☆100Updated 5 months ago
- A list of awesome open source projects in the machine learning field, who's developers are mainly based in Germany☆36Updated last week
- A large scale dataset for Question Answering in Italian☆24Updated 5 years ago
- ☆75Updated 3 weeks ago
- ☆56Updated 7 months ago
- Bi-encoder entity linking architecture☆40Updated last week
- A Python library aimed at dissecting and augmenting NER training data.☆56Updated last year
- ☆111Updated last week
- FastFit ⚡ When LLMs are Unfit Use FastFit ⚡ Fast and Effective Text Classification with Many Classes☆180Updated last month
- ☆56Updated this week
- Toolkit for attaching, training, saving and loading of new heads for transformer models☆237Updated last week
- 🗺️ Data Cleaning and Textual Data Visualization 🗺️☆131Updated 3 months ago
- 🤗 Disaggregators: Curated data labelers for in-depth analysis.☆66Updated last year
- Prune transformer layers☆60Updated 3 months ago