teelinsan / camoscioLinks
Camoscio: An Italian instruction-tuned language model based on LLaMA
☆127Updated 2 years ago
Alternatives and similar repositories for camoscio
Users that are interested in camoscio are comparing it to the libraries listed below
Sorting:
- Get ready to meet Fauno - the Italian language model crafted by the RSTLess Research Group from the Sapienza University of Rome.☆85Updated 2 years ago
- Materials for "IT5: Large-scale Text-to-text Pretraining for Italian Language Understanding and Generation" 🇮🇹☆30Updated last year
- ☆39Updated last year
- UmBERTo: an Italian Language Model trained with Whole Word Masking.☆110Updated 2 years ago
- A library to synthesize text datasets using Large Language Models (LLM)☆152Updated 2 years ago
- A python package for benchmarking interpretability techniques on Transformers.☆214Updated last year
- A Multilingual Dataset for Parsing Realistic Task-Oriented Dialogs☆115Updated 2 years ago
- A repository containing the code for translating popular LLM benchmarks to German.☆31Updated 2 years ago
- A collection of datasets for language model pretraining including scripts for downloading, preprocesssing, and sampling.☆63Updated last year
- ☆94Updated 2 years ago
- FastFit ⚡ When LLMs are Unfit Use FastFit ⚡ Fast and Effective Text Classification with Many Classes☆214Updated 2 months ago
- Topic modeling helpers using managed language models from Cohere. Name text clusters using large GPT models.☆222Updated 2 years ago
- Datasets collection and preprocessings framework for NLP extreme multitask learning☆189Updated 5 months ago
- Repo for the Belebele dataset, a massively multilingual reading comprehension dataset.☆338Updated 11 months ago
- 📚 Datasets and models for instruction-tuning☆238Updated 2 years ago
- SpanMarker for Named Entity Recognition☆462Updated 11 months ago
- A Python library aimed at dissecting and augmenting NER training data.☆59Updated 2 years ago
- Pipeline for pulling and processing online language model pretraining data from the web☆178Updated 2 years ago
- ☆84Updated 2 years ago
- Interpretability for sequence generation models 🐛 🔍☆449Updated last week
- This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.☆80Updated 2 years ago
- Let's build better datasets, together!☆265Updated 11 months ago
- git extension for {collaborative, communal, continual} model development☆217Updated last year
- The prime repository for state-of-the-art Multilingual Question Answering research and development.☆740Updated 2 months ago
- 💬 Language Identification with Support for More Than 2000 Labels -- EMNLP 2023☆175Updated 3 weeks ago
- Command Line Interface for Hugging Face Inference Endpoints☆66Updated last year
- ☆65Updated 2 years ago
- Tools for managing datasets for governance and training.☆87Updated 2 weeks ago
- Experiments with generating opensource language model assistants☆97Updated 2 years ago
- The robust European language model benchmark.☆142Updated this week