aixplain / aiXplainLinks
aiXplain enables python programmers to add AI functions to their software.
β48Updated this week
Alternatives and similar repositories for aiXplain
Users that are interested in aiXplain are comparing it to the libraries listed below
Sorting:
- MAFAND-MTβ57Updated last year
- π¬ Language Identification with Support for More Than 2000 Labels -- EMNLP 2023β153Updated 3 months ago
- OpusCleaner is a web interface that helps you select, clean and schedule your data for training machine translation models.β51Updated last month
- β109Updated 8 months ago
- Efficient few-shot learning with cross-encoders.β57Updated last year
- The FLORES+ Machine Translation Benchmarkβ108Updated 9 months ago
- Code repository for "Introducing Airavata: Hindi Instruction-tuned LLM"β60Updated 10 months ago
- NTREX -- News Test References for MT Evaluationβ85Updated last year
- Crosslingual Question Answering for African Languagesβ31Updated 11 months ago
- Pre-train Static Word Embeddingsβ85Updated this week
- Official implementation of the paper "CoEdIT: Text Editing by Task-Specific Instruction Tuning" (EMNLP 2023)β130Updated 11 months ago
- A collection of datasets for language model pretraining including scripts for downloading, preprocesssing, and sampling.β60Updated last year
- Efficiently find the best-suited language model (LM) for your NLP taskβ127Updated last month
- This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.β80Updated 2 years ago
- Consists of the largest (10K) human annotated code-switched semantic parsing dataset & 170K generated utterance using the CST5 augmentatiβ¦β41Updated 2 years ago
- AfriBERTa: Exploring the Viability of Pretrained Multilingual Language Models for Low-resourced Languagesβ75Updated 3 years ago
- AfroLID, a powerful neural toolkit for African languages identification which covers 517 African languages.β32Updated 5 months ago
- β17Updated 2 years ago
- Tools for managing datasets for governance and training.β83Updated 3 weeks ago
- Datasets collection and preprocessings framework for NLP extreme multitask learningβ186Updated last month
- π₯ Use Hugging Face text and token classification pipelines directly in spaCyβ63Updated last year
- [EMNLP'23] Official Code for "FOCUS: Effective Embedding Initialization for Monolingual Specialization of Multilingual Models"β33Updated 2 months ago
- β14Updated last month
- Open information and community for machine translationβ80Updated last week
- Repo for the Belebele dataset, a massively multilingual reading comprehension dataset.β335Updated 8 months ago
- Seed Machine Translation Dataβ33Updated 9 months ago
- π« SpaCy wrapper for ConceptNet π«β94Updated 2 years ago
- The pipeline for the OSCAR corpusβ171Updated last year
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.β110Updated last year
- Curriculum trainingβ18Updated 2 months ago