Toolkit for Indobenchmark
☆24Jan 5, 2024Updated 2 years ago
Alternatives and similar repositories for indobenchmark-toolkit
Users that are interested in indobenchmark-toolkit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The first-ever vast natural language generation benchmark for Indonesian, Sundanese, and Javanese. We provide multiple downstream tasks, …☆79Nov 16, 2024Updated last year
- Aspect and opinion terms extraction for hotel's review from AiryRooms in Bahasa Indonesia☆16Jul 3, 2019Updated 6 years ago
- DAC UNPAD IFEST 2021 Competition Selection Phase, predicting E-Wallet Sentiment and Topic Classification Using CNN - LSTM models☆14Sep 22, 2021Updated 4 years ago
- A benchmark dataset for Indonesian text summarization.☆76Mar 20, 2019Updated 7 years ago
- Indonesian Language Models and its Usage☆162May 22, 2023Updated 2 years ago
- A report crawler for kampus merdeka website consist of Intern and Studi Independent report generator☆21Jul 20, 2022Updated 3 years ago
- This repo is about how-to-use Indonesian NER with spaCy☆17Mar 27, 2022Updated 3 years ago
- Dependency Parser and NER model for Bahasa Indonesia Spacy 2.1☆20Jul 17, 2020Updated 5 years ago
- The first large-scale summarization corpus for the Indonesian language. AACL 2020.☆38Mar 4, 2021Updated 5 years ago
- ☆14Mar 8, 2019Updated 7 years ago
- IndoLEM is a comprehensive Indonesian NLU benchmark, comprising three pillars NLP task: morpho-syntax, semantic, and discourse. Presented…☆107Dec 14, 2020Updated 5 years ago
- A fine tuned IndoBERT model for University Sentiment On Social Media☆14Jun 3, 2025Updated 9 months ago
- Script to simulates a phishing attack by making concurrent HTTP requests to phishing links (Dana Kaget)☆15Dec 27, 2023Updated 2 years ago
- Repository ini berisikan kumpulan data mentah berupa artikel dari berbagai media online di Indonesia. (Raw dataset of Indonesian news art…☆42Mar 24, 2019Updated 7 years ago
- DAC Unpad 2021 Final, predicting government sentiment analytics topic of PPKM COVID-19 Policy☆17Oct 9, 2021Updated 4 years ago
- ☆20Nov 7, 2024Updated last year
- Prepare spectrograms from audio for training a Riffusion model☆16Mar 6, 2023Updated 3 years ago
- My personal website (hopefully)☆21May 20, 2025Updated 10 months ago
- Pujangga - Indonesian Natural Language Processing Tool with REST API, an Interface for InaNLP and Deeplearning4j's Word2Vec☆90Jan 28, 2026Updated last month
- A dataset for Indonesian Named Entity Recognizer☆30Dec 10, 2020Updated 5 years ago
- ☆10Dec 6, 2021Updated 4 years ago
- Synthetically generate random text document images with ground-truth☆12Jul 20, 2021Updated 4 years ago
- The first-ever vast natural language processing benchmark for Indonesian Language. We provide multiple downstream tasks, pre-trained Indo…☆638Nov 16, 2024Updated last year
- IndoBERTweet is the first large-scale pretrained model for Indonesian Twitter. Published at EMNLP 2021 (main conference)☆72Sep 13, 2021Updated 4 years ago
- Converter for images to panoramic viewer☆11Jun 22, 2022Updated 3 years ago
- Latest user agent strings for major browsers and OSs☆26Sep 2, 2024Updated last year
- Tensorflow ML API Template for Text and Image Input using FastAPI 🚀☆36Jun 14, 2023Updated 2 years ago
- Dataset Catalogue Homepage for Indonesian Languages☆10Feb 19, 2024Updated 2 years ago
- A collaborative project to collect datasets in Indonesian languages.☆281Jun 2, 2024Updated last year
- SafeCity: Understanding Diverse Forms of Sexual Harassment Personal Stories, EMNLP 2018☆10Sep 14, 2018Updated 7 years ago
- Developing NLP Applications Using NLTK in Python by Packt Publishing☆11Jan 15, 2021Updated 5 years ago
- SemEval 2019 Task 4: Hyperpartisan News Detection☆10Nov 9, 2019Updated 6 years ago
- NLP Datasets for Indonesian☆126Feb 11, 2023Updated 3 years ago
- ☆14Jan 12, 2014Updated 12 years ago
- Repo that contains code to automate data ingestion from Drive to GCS☆10Jun 30, 2022Updated 3 years ago
- Word-level language identification for Bangla-English code-mixed social media data, using a BiLSTM with subword embeddings.☆10Aug 13, 2023Updated 2 years ago
- ☆11Apr 23, 2024Updated last year
- Scrape South African news☆12May 22, 2023Updated 2 years ago
- Indonesia Sentiment Analysis Dataset☆43Jul 14, 2022Updated 3 years ago