marcopoli / LLaMAntino-3-ANITA
The 🌟ANITA project🌟 *(Advanced Natural-based interaction for the ITAlian language)* wants to provide Italian NLP researchers with an improved model the for Italian Language 🇮🇹 use cases.
☆15Updated 4 months ago
Alternatives and similar repositories for LLaMAntino-3-ANITA:
Users that are interested in LLaMAntino-3-ANITA are comparing it to the libraries listed below
- KIND: an Italian Multi-Domain Dataset for Named Entity Recognition☆15Updated last year
- ☆37Updated last year
- Annotated corpus + evaluation metrics for text anonymisation☆53Updated 11 months ago
- Knowledge pills on Neural Search☆25Updated last year
- A software for transferring pre-trained English models to foreign languages☆18Updated last year
- Materials for "IT5: Large-scale Text-to-text Pretraining for Italian Language Understanding and Generation" 🇮🇹☆30Updated 7 months ago
- Code associated with the paper "Entropy-based Attention Regularization Frees Unintended Bias Mitigation from Lists"☆47Updated 2 years ago
- Shared code for training sentence embeddings with Flax / JAX☆27Updated 3 years ago
- ☆22Updated 2 years ago
- A collection of Italian benchmarks for LLM evaluation☆26Updated last month
- A python package to run inference with HuggingFace language and vision-language checkpoints wrapping many convenient features.☆25Updated 4 months ago
- A spaCy custom component that extracts and normalizes temporal expressions☆52Updated last year
- ☆15Updated 3 years ago
- A Python library aimed at dissecting and augmenting NER training data.☆57Updated last year
- ☆83Updated 5 months ago
- A multi-lingual approach to AllenNLP CoReference Resolution along with a wrapper for spaCy.☆104Updated 9 months ago
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.☆93Updated last year
- ☆30Updated 3 months ago
- Source code and data for Like a Good Nearest Neighbor☆28Updated 2 weeks ago
- A collection of datasets for language model pretraining including scripts for downloading, preprocesssing, and sampling.☆56Updated 6 months ago
- Benchmarking Large Language Models☆86Updated this week
- A monolingual and cross-lingual meta-embedding generation and evaluation framework☆80Updated 2 years ago
- ☆51Updated last year
- ☆35Updated 2 years ago
- Repo for Aspire - A scientific document similarity model based on matching fine-grained aspects of scientific papers.☆51Updated last year
- INCOME: An Easy Repository for Training and Evaluation of Index Compression Methods in Dense Retrieval. Includes BPR and JPQ.☆22Updated last year
- Explainable Zero-Shot Topic Extraction☆62Updated 5 months ago
- Dataset from the paper "Mintaka: A Complex, Natural, and Multilingual Dataset for End-to-End Question Answering" (COLING 2022)☆107Updated 2 years ago
- ☆38Updated last month
- Code and data form the paper BERT Got a Date: Introducing Transformers to Temporal Tagging☆65Updated 2 years ago