CoderPat / croissant-llm-training
Repository containing the code for training the CroissantLLM
☆21Updated last year
Alternatives and similar repositories for croissant-llm-training:
Users that are interested in croissant-llm-training are comparing it to the libraries listed below
- French instruction-following and chat models☆503Updated 4 months ago
- The robust European language model benchmark.☆99Updated last week
- Page de préconfiguration de la communauté OpenLLM-France☆46Updated last year
- Efficiently find the best-suited language model (LM) for your NLP task☆120Updated this week
- Backend ressources for Albert. Albert is a conversational agent that uses official French data sources to answer administrative agents qu…☆121Updated 2 weeks ago
- 🗺️ Data Cleaning and Textual Data Visualization 🗺️☆168Updated 10 months ago
- A collection of datasets for language model pretraining including scripts for downloading, preprocesssing, and sampling.☆58Updated 8 months ago
- Let's build better datasets, together!☆259Updated 4 months ago
- Late Interaction Models Training & Retrieval☆276Updated last week
- A library for working with prompt templates locally or on the Hugging Face Hub.☆45Updated last month
- ☆67Updated last year
- ☆113Updated 2 weeks ago
- Repository for the EM German Model☆109Updated last year
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆66Updated 5 months ago
- A list of awesome open source projects in the machine learning field, who's developers are mainly based in Germany☆43Updated 7 months ago
- Code for the MTEB Arena☆19Updated 7 months ago
- Data set of Finnish grey literature, containing curated Dublin Core style metadata and links to original PDF publications☆24Updated last week
- A framework for few-shot evaluation of autoregressive language models.☆13Updated last year
- This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.☆79Updated last year
- A BERT-based application for reusable text classification at scale☆38Updated last year
- Benchmarking library for RAG☆193Updated last week
- Generalist and Lightweight Model for Text Classification☆121Updated 2 weeks ago
- Interroger à l'aveugle deux modèles de langage conversationnels sur des tâches exprimées en français et comparer les résultats.☆20Updated last week
- SpanMarker for Named Entity Recognition☆426Updated 3 months ago
- 💬 Language Identification with Support for More Than 2000 Labels -- EMNLP 2023☆127Updated 4 months ago
- Awesome list of resources about NLP applied to French | Liste de ressources liées au NLP appliqué au français☆58Updated 4 years ago
- Enhancing Translation with RAG-Powered Large Language Models☆77Updated last month
- ☆54Updated 2 weeks ago
- A french sequence to sequence pretrained model☆59Updated 2 years ago
- A Scandinavian Benchmark for sentence embeddings☆36Updated 2 months ago