CoderPat / croissant-llm-trainingLinks
Repository containing the code for training the CroissantLLM
☆21Updated last year
Alternatives and similar repositories for croissant-llm-training
Users that are interested in croissant-llm-training are comparing it to the libraries listed below
Sorting:
- Page de préconfiguration de la communauté OpenLLM-France☆47Updated last year
- The robust European language model benchmark.☆114Updated last week
- A framework for few-shot evaluation of autoregressive language models.☆13Updated last year
- 💬 Language Identification with Support for More Than 2000 Labels -- EMNLP 2023☆147Updated 2 months ago
- A collection of datasets for language model pretraining including scripts for downloading, preprocesssing, and sampling.☆59Updated last year
- Repository for the EM German Model☆111Updated last year
- Easily embed, cluster and semantically label text datasets☆561Updated last year
- Let's build better datasets, together!☆260Updated 7 months ago
- Code for the MTEB Arena☆22Updated last month
- ☆104Updated 7 months ago
- Manage scalable open LLM inference endpoints in Slurm clusters☆268Updated last year
- A list of awesome open source projects in the machine learning field, who's developers are mainly based in Germany☆45Updated 11 months ago
- A Scandinavian Benchmark for sentence embeddings☆40Updated 2 months ago
- A repository containing the code for translating popular LLM benchmarks to German.☆27Updated last year
- French instruction-following and chat models☆507Updated 8 months ago
- 🤗 Benchmark Large Language Models Reliably On Your Data☆381Updated this week
- German Alpaca Dataset (Cleaned + Translated)☆26Updated 2 years ago
- ☆177Updated last month
- 🗺️ Data Cleaning and Textual Data Visualization 🗺️☆183Updated 2 months ago
- A library for working with prompt templates locally or on the Hugging Face Hub.☆49Updated 5 months ago
- ☆529Updated 8 months ago
- Interroger à l'aveugle deux modèles de langage conversationnels sur des tâches exprimées en français et comparer les résultats.☆39Updated this week
- ☆677Updated 3 months ago
- ☆129Updated 4 months ago
- Toolkit for attaching, training, saving and loading of new heads for transformer models☆284Updated 5 months ago
- ☆51Updated 6 months ago
- Optimus is a flexible and scalable framework built to train language models efficiently across diverse hardware configurations, including…☆66Updated last month
- A collection of LogitsProcessors to customize and enhance LLM behavior for specific tasks.☆332Updated last month
- German Language Understanding Evaluation Benchmark @NAACL24☆12Updated 2 weeks ago
- Master thesis: Exploring bias in German NLG (GPT-3 & GerPT-2). Applies regard classification and bias mitigation triggers.☆16Updated 10 months ago