OpenLLM-France / Lit-Claire
Continual pretraining of foundation LLM using ⚡ Lightning Fabric
☆34Updated 2 months ago
Alternatives and similar repositories for Lit-Claire:
Users that are interested in Lit-Claire are comparing it to the libraries listed below
- Tools to do lexicometry on media☆41Updated last year
- Tracking instruction-tuned LLM openness. Paper: Liesenfeld, Andreas, Alianda Lopez, and Mark Dingemanse. 2023. “Opening up ChatGPT: Track…☆113Updated 5 months ago
- Scripts for training Kaldi for German speech recognition (ASR).☆24Updated 4 years ago
- MTEB: Massive Text Embedding Benchmark French extended☆19Updated 8 months ago
- 🫠 check your data, before you wreck your model☆16Updated 2 years ago
- Softcatalà neural translation models☆18Updated 3 weeks ago
- Tooling for producing French dataset for Common Voice☆101Updated 3 weeks ago
- Backend ressources for Albert. Albert is a conversational agent that uses official French data sources to answer administrative agents qu…☆120Updated 3 weeks ago
- docker for HF wav2vec2-sprint☆13Updated 3 years ago
- A python package for whisper normalizer☆47Updated 2 months ago
- simple to use, pretrained/training-less models for speaker diarization☆21Updated last year
- The code that runs my blog: https://blog.gpt4.org/☆10Updated 3 years ago
- TTS Client for Coqui TTS server☆13Updated 2 years ago
- Zero-shot Audio Classification using Whisper☆78Updated 2 years ago
- Ragtime🎹 is an LLMOps framework to automate testing and comparison for text to text large language models☆10Updated 5 months ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆25Updated last year
- Simplify stop motion animation with machine learning.☆29Updated 3 years ago
- Tunable pipelines☆31Updated last week
- Scripts to convert datasets from various sources to Hugging Face Datasets.☆58Updated 2 years ago
- Speaker diarization service☆21Updated last month
- GGML implementation of BERT model with Python bindings and quantization.☆27Updated last year
- Spoken Language Identification on Common Voice and AudioSet using Deep Learning☆37Updated 2 years ago
- A french sequence to sequence pretrained model☆57Updated 2 years ago
- OCTRA is a web-application for the orthographic transcription of audio files.☆37Updated this week
- A repository of instructions in French to fine-tune LLMs☆17Updated last year
- Audio tokenization, in the fastest way possible!☆48Updated 5 months ago
- Cortex-compatible model server for Python and TensorFlow☆17Updated 2 years ago
- Website and documentation☆19Updated last month
- Speaker diarization and speech to text☆14Updated 4 years ago