aisingapore / BHASALinks
☆17Updated 6 months ago
Alternatives and similar repositories for BHASA
Users that are interested in BHASA are comparing it to the libraries listed below
Sorting:
- NusaWrites is an in-depth analysis of corpora collection strategy and a comprehensive language modeling benchmark for underrepresented an…☆25Updated 9 months ago
- A collaborative project to collect datasets in SEA languages, SEA regions, or SEA cultures.☆83Updated 5 months ago
- High-quality parallel resource on sentiment analysis for 10 low-resource Indonesian languages, English, and Indonesian (Outstanding Paper…☆101Updated 2 years ago
- ☆30Updated last year
- The first large-scale summarization corpus for the Indonesian language. AACL 2020.☆37Updated 4 years ago
- Benchmarking Multidomain English-Indonesian Machine Translation☆16Updated 4 years ago
- Multilingual Speech Recognition for Indonesian Languages☆64Updated 2 years ago
- A curated list of research papers and resources on Indonesian languages☆39Updated last year
- Welcome to our repository! This repository hosts the data on "IndoCollex: A Testbed for Morphological Transformation of Indonesian Word …☆22Updated 3 years ago
- Embedding Representation for Indonesian Sentences!☆18Updated 10 months ago
- IndoNLI☆19Updated 3 years ago
- ☆47Updated 4 months ago
- The first-ever vast natural language generation benchmark for Indonesian, Sundanese, and Javanese. We provide multiple downstream tasks, …☆74Updated 7 months ago
- Comparing retrieval abilities from GPT4-Turbo and a RAG system on a toy example for various context lengths☆35Updated last year
- Trully flash implementation of DeBERTa disentangled attention mechanism.☆58Updated last month
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer☆43Updated last year
- Code and data from the paper 'Human Feedback is not Gold Standard'☆19Updated 11 months ago
- ☆10Updated last year
- Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.☆78Updated 2 weeks ago
- Chatbot for The Carbon Almanac book or a climate change related topic☆14Updated 2 years ago
- ☆20Updated 2 months ago
- Quora Paraphrasing Dataset Bahasa Indonesia Version☆11Updated 4 years ago
- A collaborative project to collect datasets in Indonesian languages.☆269Updated last year
- Documentation effort for the BookCorpus dataset☆34Updated 4 years ago
- Using multiple LLMs for ensemble Forecasting☆16Updated last year
- Build Web Datasets with Ease☆33Updated last year
- assign color hues to a collection of text fragments based on embeddings☆20Updated last year
- ☆14Updated 8 months ago
- The collection of bulding blocks building fine-tunable metric learning models☆32Updated 2 months ago
- Demo example of consumer goods categorization☆28Updated last year