OpenLLM-France / Lit-ClaireLinks
Continual pretraining of foundation LLM using ⚡ Lightning Fabric
☆37Updated last year
Alternatives and similar repositories for Lit-Claire
Users that are interested in Lit-Claire are comparing it to the libraries listed below
Sorting:
- Interroger à l'aveugle deux modèles de langage conversationnels sur des tâches exprimées en français et comparer les résultats.☆56Updated last week
- simple to use, pretrained/training-less models for speaker diarization☆21Updated 2 years ago
- Tracking instruction-tuned LLM openness. Paper: Liesenfeld, Andreas, Alianda Lopez, and Mark Dingemanse. 2023. “Opening up ChatGPT: Track…☆119Updated 9 months ago
- ☆45Updated 2 months ago
- BlindBox is a tool to isolate and deploy applications inside Trusted Execution Environments for privacy-by-design apps☆63Updated 2 years ago
- cologne-phonetics implementation in python☆17Updated last year
- Massive Multimodal Open RAG & Extraction A scalable multimodal pipeline for processing, indexing, and querying multimodal documents Eve…☆179Updated this week
- AI Energy Score: Initiative to establish comparable energy efficiency ratings for AI models.☆34Updated 3 weeks ago
- Small python package to measure OCR quality and other related metrics.☆25Updated last year
- A Python module for retrieving script types of writing systems including alphabets, abjads, abugidas, syllabaries, logographs, featurals …☆15Updated last year
- **ARCHIVED** Filesystem interface to 🤗 Hub☆58Updated 2 years ago
- Optimus is a flexible and scalable framework built to train language models efficiently across diverse hardware configurations, including…☆68Updated 3 weeks ago
- Softcatalà neural translation models☆20Updated last week
- Chunk Dedupe Estimation☆20Updated last year
- Datamodels for hugging face tokenizers☆86Updated last month
- Synthetic Dialog Generation and Analysis with LLMs☆118Updated 2 weeks ago
- MediaWiki Categories Model☆13Updated last year
- A repo listing known open source voice tools, ordered by where they sit in the voice stack☆26Updated 3 years ago
- MTEB: Massive Text Embedding Benchmark French extended☆19Updated last year
- 🤗 Disaggregators: Curated data labelers for in-depth analysis.☆67Updated 2 years ago
- The Foundation Model Transparency Index☆84Updated 3 weeks ago
- A polite and user-friendly downloader for Common Crawl data☆63Updated 4 months ago
- 🔀 Deployement of LLM at a large scale using VLLM server for inference☆28Updated 2 weeks ago
- Openfst mirror with some fixes☆14Updated last year
- Pretraining data reconstruction scripts for Apertus☆111Updated 2 months ago
- ☆17Updated 5 months ago
- Tooling for producing French dataset for Common Voice☆101Updated 11 months ago
- Website and documentation☆22Updated 2 months ago
- Your buddy in the (L)LM space.☆64Updated last year
- Seed Machine Translation Data☆33Updated last year