CoderPat / croissant-llm-training
Repository containing the code for training the CroissantLLM
☆21Updated 9 months ago
Related projects ⓘ
Alternatives and complementary repositories for croissant-llm-training
- Page de préconfiguration de la communauté OpenLLM-France☆42Updated 9 months ago
- Repository for the EM German Model☆104Updated last year
- A framework for few-shot evaluation of autoregressive language models.☆13Updated 9 months ago
- Backend ressources for Albert. Albert is a conversational agent that uses official French data sources to answer administrative agents qu…☆116Updated this week
- French instruction-following and chat models☆501Updated last week
- Let's build better datasets, together!☆206Updated this week
- A repository containing the code for translating popular LLM benchmarks to German.☆24Updated last year
- 🌱 EcoLogits tracks the energy consumption and environmental footprint of using generative AI models through APIs.☆88Updated last week
- Manage scalable open LLM inference endpoints in Slurm clusters☆237Updated 4 months ago
- FastFit ⚡ When LLMs are Unfit Use FastFit ⚡ Fast and Effective Text Classification with Many Classes☆183Updated last month
- 🗺️ Data Cleaning and Textual Data Visualization 🗺️☆146Updated 5 months ago
- Toolkit for attaching, training, saving and loading of new heads for transformer models☆246Updated 2 weeks ago
- ☆54Updated last year
- The Fastest State-of-the-Art Static Embeddings in the World☆473Updated this week
- awesome synthetic (text) datasets☆242Updated 3 weeks ago
- Late Interaction Models Training & Retrieval☆165Updated this week
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆61Updated 2 weeks ago
- Efficiently find the best-suited language model (LM) for your NLP task☆91Updated this week
- Scaling is a distributed training library and installable dependency designed to scale up neural networks, with a dedicated module for tr…☆51Updated 3 weeks ago
- ☆106Updated 2 months ago
- A collection of datasets for language model pretraining including scripts for downloading, preprocesssing, and sampling.☆53Updated 3 months ago
- This is the reproduction repository for my 🤗 Hugging Face blog post on synthetic data☆61Updated 9 months ago
- ☆64Updated 9 months ago
- ☆93Updated last month
- Automated Identification of Redundant Layer Blocks for Pruning in Large Language Models☆196Updated 6 months ago
- ☆131Updated 4 months ago
- In-Context Learning for eXtreme Multi-Label Classification (XMC) using only a handful of examples.☆386Updated 9 months ago
- Start a server from the MLX library.☆161Updated 3 months ago
- ☆66Updated this week
- ☆433Updated 10 months ago