aixplain / aiXplainLinks
aiXplain enables python programmers to add AI functions to their software.
☆49Updated this week
Alternatives and similar repositories for aiXplain
Users that are interested in aiXplain are comparing it to the libraries listed below
Sorting:
- MAFAND-MT☆59Updated last year
- ☆116Updated 11 months ago
- 💬 Language Identification with Support for More Than 2000 Labels -- EMNLP 2023☆171Updated 5 months ago
- Code repository for "Introducing Airavata: Hindi Instruction-tuned LLM"☆61Updated last year
- Efficient few-shot learning with cross-encoders.☆59Updated last year
- OpusCleaner is a web interface that helps you select, clean and schedule your data for training machine translation models.☆52Updated last month
- Pre-train Static Word Embeddings☆90Updated 2 months ago
- ☆15Updated 3 weeks ago
- Chunk your text using gpt4o-mini more accurately☆44Updated last year
- NTREX -- News Test References for MT Evaluation☆86Updated last year
- Fine-tuning Open-Source LLMs for Adaptive Machine Translation☆88Updated 4 months ago
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.☆110Updated last year
- Crosslingual Question Answering for African Languages☆31Updated last year
- Official implementation of the paper "CoEdIT: Text Editing by Task-Specific Instruction Tuning" (EMNLP 2023)☆132Updated last year
- Seed Machine Translation Data☆33Updated last year
- Open information and community for machine translation☆79Updated 2 weeks ago
- Crowd-sourced lists of urls to help Common Crawl crawl under-resourced languages. See https://github.com/commoncrawl/web-languages-code/ …☆65Updated this week
- Tools for managing datasets for governance and training.☆86Updated last week
- This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.☆80Updated 2 years ago
- Generalist and Lightweight Model for Text Classification☆165Updated 5 months ago
- Aranizer: A Custom Tokenizer based on SentencePiece and BPE tailored for Arabic Language Modeling☆21Updated last year
- ☆58Updated last year
- Trully flash implementation of DeBERTa disentangled attention mechanism.☆66Updated last month
- Code for WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models.☆85Updated last year
- Datasets collection and preprocessings framework for NLP extreme multitask learning☆188Updated 4 months ago
- Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages -- ACL 2023☆106Updated last year
- A collection of datasets for language model pretraining including scripts for downloading, preprocesssing, and sampling.☆62Updated last year
- Training & Implementation of chatbots leveraging GPT-like architecture with the aitextgen package to enable dynamic conversations.☆49Updated 3 years ago
- The FLORES+ Machine Translation Benchmark☆109Updated last year
- Using open source LLMs to build synthetic datasets for direct preference optimization☆69Updated last year