OpenLLM-France / Lit-ClaireLinks
Continual pretraining of foundation LLM using ⚡ Lightning Fabric
☆37Updated 11 months ago
Alternatives and similar repositories for Lit-Claire
Users that are interested in Lit-Claire are comparing it to the libraries listed below
Sorting:
- Interroger à l'aveugle deux modèles de langage conversationnels sur des tâches exprimées en français et comparer les résultats.☆52Updated this week
- Tracking instruction-tuned LLM openness. Paper: Liesenfeld, Andreas, Alianda Lopez, and Mark Dingemanse. 2023. “Opening up ChatGPT: Track…☆119Updated 8 months ago
- MTEB: Massive Text Embedding Benchmark French extended☆19Updated last year
- simple to use, pretrained/training-less models for speaker diarization☆21Updated 2 years ago
- Coqui Inference Engine☆41Updated 4 years ago
- Small python package to measure OCR quality and other related metrics.☆25Updated last year
- Experiments with BitNet inference on CPU☆54Updated last year
- A polite and user-friendly downloader for Common Crawl data☆59Updated 3 months ago
- ☆67Updated last year
- Code for continual pretraining of LUCIE☆49Updated last month
- [WIP] A 🔥 interface for running code in the cloud☆85Updated 2 years ago
- A repository of instructions in French to fine-tune LLMs☆17Updated 2 years ago
- The application performs real-time inference on audio from an ALSA capture device☆34Updated 5 months ago
- Speakerbox: Fine-tune Audio Transformers for speaker identification.☆59Updated 11 months ago
- Openfst mirror with some fixes☆14Updated last year
- ☆16Updated 4 months ago
- **ARCHIVED** Filesystem interface to 🤗 Hub☆58Updated 2 years ago
- Massive Multimodal Open RAG & Extraction A scalable multimodal pipeline for processing, indexing, and querying multimodal documents Eve…☆161Updated 3 weeks ago
- ☆43Updated last month
- Pretraining data reconstruction scripts for Apertus☆102Updated 3 weeks ago
- GGML implementation of BERT model with Python bindings and quantization.☆56Updated last year
- A small rust-based data loader☆32Updated last week
- 🤗 Disaggregators: Curated data labelers for in-depth analysis.☆67Updated 2 years ago
- 🐸Coqui Dialogue Audio Pack contains more than 2000 audio files of synthetic human voices over dialogue created specifically for video ga…☆42Updated 2 years ago
- The Foundation Model Transparency Index☆83Updated last year
- text-to-speech alignment java software☆20Updated 6 years ago
- Easily turn large sets of audio urls to an audio dataset.☆21Updated 2 years ago
- OCTRA is a web-application for the orthographic transcription of audio files.☆39Updated 3 weeks ago
- 🔀 Deployement of LLM at a large scale using VLLM server for inference☆27Updated last month
- Framework for writing deep learning training loops. Lightweight, and retaining full freedom to design as you see fits. It handles checkpo…☆116Updated last year