CoderPat/croissant-llm-training

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/CoderPat/croissant-llm-training)

CoderPat / croissant-llm-training

Repository containing the code for training the CroissantLLM

☆21

Alternatives and similar repositories for croissant-llm-training

Users that are interested in croissant-llm-training are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

elsatch / daily_hf_papers_abstracts
View on GitHub
This repository includes the code to download the curated HuggingFace papers into a single markdown formatted file
☆16Jul 26, 2024Updated 2 years ago
Nicolas-BZRD / llm-distillation
View on GitHub
☆11Feb 3, 2025Updated last year
GeorgeVern / smala
View on GitHub
Python source code for EMNLP 2021 Findings paper: "Subword Mapping and Anchoring Across Languages".
☆13Sep 17, 2021Updated 4 years ago
louisbrulenaudet / ragoon
View on GitHub
High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡
☆70Nov 17, 2025Updated 8 months ago
hesamsheikh / dataset_git_commands
View on GitHub
☆13Aug 5, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
plutonium-239 / memsave_torch
View on GitHub
Lowering PyTorch's Memory Consumption for Selective Differentiation
☆12Aug 29, 2024Updated last year
hpi-swa-teaching / Scamper
View on GitHub
A Smalltalk Web Browser for Squeak/Smalltalk
☆18Apr 18, 2022Updated 4 years ago
aiintelligentsystems / next-level-bert
View on GitHub
☆16Jun 14, 2024Updated 2 years ago
thisisanshgupta / Senna
View on GitHub
Senna is an advanced AI-powered search engine designed to provide users with immediate answers to their queries by leveraging natural lan…
☆19Sep 5, 2024Updated last year
sarkiisov / startpage-frontend
View on GitHub
☆10Aug 11, 2025Updated 11 months ago
emnetter / codeislow
View on GitHub
recherche, dans un fichier texte, de références à des articles de codes de droit français, puis utilisation de l'API Légifrance
☆19Dec 11, 2023Updated 2 years ago
Aleph-Alpha-Research / NeurIPS-WANT-submission-efficient-parallelization-layouts
View on GitHub
☆22Dec 15, 2023Updated 2 years ago
zextras / tech-doc
View on GitHub
Zextras Technical Documentation
☆13Jul 10, 2026Updated 2 weeks ago
egozverev / aside
View on GitHub
ASIDE: Architectural Separation of Instructions and Data in Language Models [ICLR 2026]
☆16Jun 10, 2026Updated last month
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
Tufalabs / textbook-to-rl
View on GitHub
☆29Aug 27, 2025Updated 11 months ago
Magirad / F2_k_Spectral_Graft
View on GitHub
This is a vibe coded node for Flux.2 Klein 9b model for adding/altering objects, clothes swapping, face swapping etc.
☆16Jun 5, 2026Updated last month
ahars / spark-jhipster
View on GitHub
A JHipster app reporting to Spark Streaming
☆14Dec 23, 2014Updated 11 years ago
kubernetes-bad / reward-composer
View on GitHub
Lego for GRPO
☆30May 27, 2025Updated last year
antoinejeannot / jurisprudence
View on GitHub
French Jurisprudences at your fingertips @ every 72h
☆19Nov 18, 2025Updated 8 months ago
NathanGodey / headless-lm
View on GitHub
Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https:/…
☆29Apr 17, 2024Updated 2 years ago
302ai / 302_lipsync
View on GitHub
📷🤖 AI Talking Photo 🚀✨
☆16Aug 26, 2025Updated 11 months ago
RPC2 / AutoInject
View on GitHub
☆20Jun 12, 2026Updated last month
MikeWangWZHL / VDLM
View on GitHub
Repo for paper: https://arxiv.org/abs/2404.06479
☆30Oct 3, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Lawfam / LLM-Collab
View on GitHub
LLm Collaboration
☆12Aug 23, 2024Updated last year
osoleve / glitchlings
View on GitHub
Enemies for your LLM
☆38Jan 20, 2026Updated 6 months ago
HamzaG737 / legal-code-rag
View on GitHub
Repo for advanced RAG evaluation on french legal Code data
☆26Apr 7, 2024Updated 2 years ago
peterkickasspeter-civit / ImageMetadataViewer
View on GitHub
To view metadata
☆19Feb 18, 2026Updated 5 months ago
shootthesound / ComfyUI-SequentialImageLoader
View on GitHub
Load a folder of images one at a time into ComfyUI — a fresh image on every Queue. Natural sort, filetype picker (png/jpg/webp/.../all), …
☆22Updated this week
Nero10578 / LLM-Inference-Benchmark
View on GitHub
☆14Aug 25, 2024Updated last year
mkevenaar / FarmPi
View on GitHub
OctoFarm on Raspberry Pi
☆23Feb 7, 2024Updated 2 years ago
NOTMEE12 / AIPT
View on GitHub
AI model Prompt Tester (AIPT for short) is a simple app that will check how suitable each model is for a given prompt.
☆15Jul 7, 2024Updated 2 years ago
taylorai / onnx_embedding_models
View on GitHub
utilities for loading and running text embeddings with onnx
☆46Aug 16, 2025Updated 11 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
ai-joe-git / comfyui-webui-generator
View on GitHub
Generate beautiful, standalone HTML interfaces for your ComfyUI workflows with just a few clicks!
☆21Updated this week
MartinuzziFrancesco / CellularAutomata.jl
View on GitHub
Cellular automata creation and analysis tools
☆28Jul 17, 2026Updated last week
PlugOvr-ai / PlugOvr
View on GitHub
AI Assistant
☆21Feb 21, 2026Updated 5 months ago
vaibhav-systango / react-contact-number-input
View on GitHub
☆14Dec 5, 2023Updated 2 years ago
sunkencity999 / ollama_shell
View on GitHub
A powerful AI-integrated Terminal Shell powered by the Ollama LLM interface.
☆15Jul 9, 2026Updated 2 weeks ago
o-l-l-i / simple-captioner
View on GitHub
Simple image and video captioning app with a Gradio UI, powered by Qwen2.5/3 VL Instruct.
☆25Apr 1, 2026Updated 3 months ago
konstantinjdobler / focus
View on GitHub
[EMNLP'23] Official Code for "FOCUS: Effective Embedding Initialization for Monolingual Specialization of Multilingual Models"
☆37Jun 7, 2025Updated last year