teelinsan / camoscio
Camoscio: An Italian instruction-tuned language model based on LLaMA
☆127Updated last year
Alternatives and similar repositories for camoscio:
Users that are interested in camoscio are comparing it to the libraries listed below
- Get ready to meet Fauno - the Italian language model crafted by the RSTLess Research Group from the Sapienza University of Rome.☆81Updated last year
- ☆37Updated last year
- Materials for "IT5: Large-scale Text-to-text Pretraining for Italian Language Understanding and Generation" 🇮🇹☆30Updated 10 months ago
- The home of Stambecco 🦌: Italian Instruction-following LLaMA Model☆20Updated 2 years ago
- UmBERTo: an Italian Language Model trained with Whole Word Masking.☆105Updated 2 years ago
- A python package to run inference with HuggingFace language and vision-language checkpoints wrapping many convenient features.☆27Updated 7 months ago
- A collection of Italian benchmarks for LLM evaluation☆30Updated last week
- Experiments with generating opensource language model assistants☆97Updated last year
- A collection of datasets for language model pretraining including scripts for downloading, preprocesssing, and sampling.☆58Updated 8 months ago
- A python package for benchmarking interpretability techniques on Transformers.☆212Updated 6 months ago
- Knowledge pills on Neural Search☆26Updated last year
- A Multilingual Dataset for Parsing Realistic Task-Oriented Dialogs☆114Updated 2 years ago
- Minimum Bayes Risk Decoding for Hugging Face Transformers☆57Updated 10 months ago
- Exploring finetuning public checkpoints on filter 8K sequences on Pile☆115Updated 2 years ago
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.☆108Updated 11 months ago
- 🤗 Disaggregators: Curated data labelers for in-depth analysis.☆65Updated 2 years ago
- A large scale dataset for Question Answering in Italian☆27Updated 6 years ago
- ☆92Updated last year
- Semantic search engine indexing 110 million academic publications☆80Updated last month
- Reimplementation of the task generation part from the Alpaca paper☆119Updated 2 years ago
- Fact checking baseline combining dense retrieval and textual entailment☆28Updated 3 months ago
- 💫 SpaCy wrapper for ConceptNet 💫☆92Updated last year
- 🇮🇹 Italian BERT and ELECTRA models (incl. evaluation)☆18Updated 2 years ago
- ☆148Updated 4 years ago
- ☆11Updated 2 years ago
- Datasets collection and preprocessings framework for NLP extreme multitask learning☆180Updated 3 months ago
- classy is a simple-to-use library for building high-performance Machine Learning models in NLP.☆87Updated 2 weeks ago
- A library to synthesize text datasets using Large Language Models (LLM)☆152Updated 2 years ago
- A repository containing the code for translating popular LLM benchmarks to German.☆25Updated last year
- Pre-training BART model for the Italian Language☆15Updated 2 years ago