teelinsan / camoscio
Camoscio: An Italian instruction-tuned language model based on LLaMA
โ127Updated last year
Alternatives and similar repositories for camoscio:
Users that are interested in camoscio are comparing it to the libraries listed below
- Get ready to meet Fauno - the Italian language model crafted by the RSTLess Research Group from the Sapienza University of Rome.โ80Updated last year
- โ37Updated last year
- Materials for "IT5: Large-scale Text-to-text Pretraining for Italian Language Understanding and Generation" ๐ฎ๐นโ30Updated 7 months ago
- The home of Stambecco ๐ฆ: Italian Instruction-following LLaMA Modelโ20Updated last year
- UmBERTo: an Italian Language Model trained with Whole Word Masking.โ104Updated 2 years ago
- A python package for benchmarking interpretability techniques on Transformers.โ213Updated 4 months ago
- Completion After Prompt Probability. Make your LLM make a choiceโ73Updated 2 months ago
- ๐ฎ๐น Italian BERT and ELECTRA models (incl. evaluation)โ18Updated 2 years ago
- The ๐ANITA project๐ *(Advanced Natural-based interaction for the ITAlian language)* wants to provide Italian NLP researchers with an imโฆโ15Updated 4 months ago
- A python package to run inference with HuggingFace language and vision-language checkpoints wrapping many convenient features.โ25Updated 4 months ago
- โ92Updated last year
- Generalist and Lightweight Model for Text Classificationโ59Updated last week
- A Multilingual Dataset for Parsing Realistic Task-Oriented Dialogsโ114Updated last year
- Pipeline for pulling and processing online language model pretraining data from the webโ175Updated last year
- A large scale dataset for Question Answering in Italianโ26Updated 6 years ago
- Knowledge pills on Neural Searchโ25Updated last year
- A collection of datasets for language model pretraining including scripts for downloading, preprocesssing, and sampling.โ56Updated 6 months ago
- Experiments with generating opensource language model assistantsโ97Updated last year
- [WIP] A ๐ฅ interface for running code in the cloudโ86Updated last year
- Exploring finetuning public checkpoints on filter 8K sequences on Pileโ115Updated last year
- ๐ค Disaggregators: Curated data labelers for in-depth analysis.โ65Updated last year
- A Word Level Transformer layer based on PyTorch and ๐ค Transformers.โ34Updated last year
- Datasets collection and preprocessings framework for NLP extreme multitask learningโ173Updated 3 weeks ago
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.โ93Updated last year
- โ11Updated last year
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for freeโ225Updated 2 months ago
- โ76Updated last year
- Interpretability for sequence generation models ๐ ๐โ394Updated 2 months ago
- CLIP (Contrastive LanguageโImage Pre-training) for Italianโ185Updated last year
- Let's build better datasets, together!โ250Updated last month