teelinsan / camoscioLinks
Camoscio: An Italian instruction-tuned language model based on LLaMA
☆127Updated last year
Alternatives and similar repositories for camoscio
Users that are interested in camoscio are comparing it to the libraries listed below
Sorting:
- Get ready to meet Fauno - the Italian language model crafted by the RSTLess Research Group from the Sapienza University of Rome.☆84Updated 2 years ago
- ☆39Updated last year
- Materials for "IT5: Large-scale Text-to-text Pretraining for Italian Language Understanding and Generation" 🇮🇹☆30Updated last year
- A python package for benchmarking interpretability techniques on Transformers.☆213Updated last year
- UmBERTo: an Italian Language Model trained with Whole Word Masking.☆108Updated 2 years ago
- Experiments with generating opensource language model assistants☆97Updated 2 years ago
- A Multilingual Dataset for Parsing Realistic Task-Oriented Dialogs☆115Updated 2 years ago
- 📚 Datasets and models for instruction-tuning☆239Updated 2 years ago
- The prime repository for state-of-the-art Multilingual Question Answering research and development.☆738Updated 3 weeks ago
- Place where folks can contribute to 🤗 community events☆425Updated last year
- Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: …☆337Updated 2 years ago
- 🇮🇹 Italian BERT and ELECTRA models (incl. evaluation)☆18Updated 2 years ago
- A library to synthesize text datasets using Large Language Models (LLM)☆151Updated 2 years ago
- FastFit ⚡ When LLMs are Unfit Use FastFit ⚡ Fast and Effective Text Classification with Many Classes☆212Updated 3 weeks ago
- MAFAND-MT☆59Updated last year
- Pipeline for pulling and processing online language model pretraining data from the web☆177Updated 2 years ago
- 💫 SpaCy wrapper for ConceptNet 💫☆95Updated 2 years ago
- A collection of datasets for language model pretraining including scripts for downloading, preprocesssing, and sampling.☆61Updated last year
- Toolkit for attaching, training, saving and loading of new heads for transformer models☆288Updated 7 months ago
- Datasets collection and preprocessings framework for NLP extreme multitask learning☆187Updated 3 months ago
- A python package to run inference with HuggingFace language and vision-language checkpoints wrapping many convenient features.☆28Updated last year
- SpanMarker for Named Entity Recognition☆453Updated 9 months ago
- Domain Adapted Language Modeling Toolkit - E2E RAG☆328Updated 11 months ago
- Small finetuned LLMs for a diverse set of useful tasks☆127Updated 2 years ago
- Tools for managing datasets for governance and training.☆85Updated this week
- Repo for the Belebele dataset, a massively multilingual reading comprehension dataset.☆335Updated 9 months ago
- Neural information retrieval / Semantic search / Bi-encoders☆174Updated 2 years ago
- The robust European language model benchmark.☆129Updated this week
- Exploring finetuning public checkpoints on filter 8K sequences on Pile☆115Updated 2 years ago
- ☆124Updated 11 months ago