teelinsan / camoscio
Camoscio: An Italian instruction-tuned language model based on LLaMA
☆127Updated last year
Alternatives and similar repositories for camoscio:
Users that are interested in camoscio are comparing it to the libraries listed below
- Get ready to meet Fauno - the Italian language model crafted by the RSTLess Research Group from the Sapienza University of Rome.☆80Updated last year
- ☆37Updated last year
- Materials for "IT5: Large-scale Text-to-text Pretraining for Italian Language Understanding and Generation" 🇮🇹☆30Updated 8 months ago
- The 🌟ANITA project🌟 *(Advanced Natural-based interaction for the ITAlian language)* wants to provide Italian NLP researchers with an im…☆17Updated 5 months ago
- A python package to run inference with HuggingFace language and vision-language checkpoints wrapping many convenient features.☆26Updated 5 months ago
- The home of Stambecco 🦌: Italian Instruction-following LLaMA Model☆20Updated last year
- UmBERTo: an Italian Language Model trained with Whole Word Masking.☆104Updated 2 years ago
- A collection of datasets for language model pretraining including scripts for downloading, preprocesssing, and sampling.☆56Updated 7 months ago
- Knowledge pills on Neural Search☆25Updated last year
- Interpretability for sequence generation models 🐛 🔍☆405Updated 3 months ago
- Finetune Falcon, LLaMA, MPT, and RedPajama on consumer hardware using PEFT LoRA☆102Updated 7 months ago
- Language Identification with Support for More Than 2000 Labels -- EMNLP 2023☆118Updated 3 months ago
- Datasets collection and preprocessings framework for NLP extreme multitask learning☆175Updated last month
- Semantic search engine indexing 95 million academic publications☆78Updated this week
- A framework for few-shot evaluation of autoregressive language models.☆13Updated last year
- ☆92Updated last year
- A Multilingual Dataset for Parsing Realistic Task-Oriented Dialogs☆114Updated last year
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.☆104Updated 9 months ago
- Experiments with generating opensource language model assistants☆97Updated last year
- [WIP] A 🔥 interface for running code in the cloud☆86Updated 2 years ago
- Tune MPTs☆84Updated last year
- 🤗 Disaggregators: Curated data labelers for in-depth analysis.☆65Updated 2 years ago
- A python package for benchmarking interpretability techniques on Transformers.☆213Updated 5 months ago
- FastFit ⚡ When LLMs are Unfit Use FastFit ⚡ Fast and Effective Text Classification with Many Classes☆188Updated 4 months ago
- 📚 Datasets and models for instruction-tuning☆234Updated last year
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.☆93Updated 2 years ago
- 🇮🇹 Italian BERT and ELECTRA models (incl. evaluation)☆18Updated 2 years ago
- 📝 Reference-Free automatic summarization evaluation with potential hallucination detection☆100Updated last year
- Exploring finetuning public checkpoints on filter 8K sequences on Pile☆115Updated last year
- Late Interaction Models Training & Retrieval☆246Updated this week