CoderPat / croissant-llm-training
Repository containing the code for training the CroissantLLM
☆21Updated 11 months ago
Alternatives and similar repositories for croissant-llm-training:
Users that are interested in croissant-llm-training are comparing it to the libraries listed below
- Page de préconfiguration de la communauté OpenLLM-France☆43Updated 11 months ago
- A list of awesome open source projects in the machine learning field, who's developers are mainly based in Germany☆42Updated 4 months ago
- Efficiently find the best-suited language model (LM) for your NLP task☆114Updated 2 weeks ago
- A collection of datasets for language model pretraining including scripts for downloading, preprocesssing, and sampling.☆56Updated 6 months ago
- Repository for the EM German Model☆105Updated last year
- Evaluation of language models on mono- or multilingual tasks.☆81Updated this week
- Language Identification with Support for More Than 2000 Labels -- EMNLP 2023☆111Updated 2 months ago
- Manage scalable open LLM inference endpoints in Slurm clusters☆249Updated 6 months ago
- ☆66Updated last month
- A framework for few-shot evaluation of autoregressive language models.☆13Updated 11 months ago
- A repository containing the code for translating popular LLM benchmarks to German.☆25Updated last year
- This is the reproduction repository for my 🤗 Hugging Face blog post on synthetic data☆63Updated 11 months ago
- Late Interaction Models Training & Retrieval☆229Updated last week
- Let's build better datasets, together!☆250Updated last month
- ☆79Updated last month
- A Scandinavian Benchmark for sentence embeddings☆32Updated last week
- Generalist and Lightweight Model for Text Classification☆59Updated last week
- Backend ressources for Albert. Albert is a conversational agent that uses official French data sources to answer administrative agents qu…☆119Updated last week
- Toolkit for attaching, training, saving and loading of new heads for transformer models☆260Updated last week
- ☆110Updated 4 months ago
- ☆108Updated 5 months ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆63Updated 2 months ago
- A BERT-based application for reusable text classification at scale☆37Updated last year
- Fork du code de LMSYS (FastChat) pour l'arène de comparaison de LLM francophones Compar:IA☆14Updated this week
- FastFit ⚡ When LLMs are Unfit Use FastFit ⚡ Fast and Effective Text Classification with Many Classes☆182Updated 3 months ago
- An introduction to LLM Sampling☆75Updated last month
- Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback☆92Updated last year
- A repository of instructions in French to fine-tune LLMs☆17Updated last year
- 🌱 EcoLogits tracks the energy consumption and environmental footprint of using generative AI models through APIs.☆117Updated last week