CoderPat / croissant-llm-trainingLinks
Repository containing the code for training the CroissantLLM
☆21Updated last year
Alternatives and similar repositories for croissant-llm-training
Users that are interested in croissant-llm-training are comparing it to the libraries listed below
Sorting:
- The robust European language model benchmark.☆120Updated this week
- Page de préconfiguration de la communauté OpenLLM-France☆47Updated last year
- A framework for few-shot evaluation of autoregressive language models.☆13Updated last year
- Toolkit for attaching, training, saving and loading of new heads for transformer models☆285Updated 5 months ago
- Repository for the EM German Model☆112Updated last year
- 🤗 Benchmark Large Language Models Reliably On Your Data☆389Updated last week
- A library for easily merging multiple LLM experts, and efficiently train the merged LLM.☆491Updated last year
- A framework for few-shot evaluation of language models. including Turkish sets used in TurkishopenLLM leadboard on huggingface☆18Updated last year
- 💬 Language Identification with Support for More Than 2000 Labels -- EMNLP 2023☆149Updated 2 months ago
- Easily embed, cluster and semantically label text datasets☆566Updated last year
- Interpretability for sequence generation models 🐛 🔍☆437Updated 4 months ago
- A little(lil) Language Model (LM). A tiny reproduction of LLaMA 3's model architecture.☆52Updated 4 months ago
- Manage scalable open LLM inference endpoints in Slurm clusters☆270Updated last year
- French instruction-following and chat models☆507Updated 8 months ago
- Let's build better datasets, together!☆262Updated 8 months ago
- Code for the MTEB Arena☆22Updated 2 months ago
- Interroger à l'aveugle deux modèles de langage conversationnels sur des tâches exprimées en français et comparer les résultats.☆41Updated this week
- Automatically evaluate your LLMs in Google Colab☆655Updated last year
- A collection of LogitsProcessors to customize and enhance LLM behavior for specific tasks.☆348Updated last month
- Late Interaction Models Training & Retrieval☆532Updated this week
- Datasets collection and preprocessings framework for NLP extreme multitask learning☆186Updated last month
- code for training & evaluating Contextual Document Embedding models☆197Updated 3 months ago
- My personal site☆78Updated last year
- ☆134Updated last week
- A Scandinavian Benchmark for sentence embeddings☆40Updated 3 months ago
- Efficiently find the best-suited language model (LM) for your NLP task☆127Updated last month
- A repository containing the code for translating popular LLM benchmarks to German.☆28Updated 2 years ago
- German Alpaca Dataset (Cleaned + Translated)☆26Updated 2 years ago
- Guideline following Large Language Model for Information Extraction☆392Updated 10 months ago
- A library for working with prompt templates locally or on the Hugging Face Hub.☆50Updated 5 months ago