ABHISHEKVALSAN / Malayalam-Newspaper-Article-Dataset

The project scraps articles from a malayalam newspaper website to create a corpus. A set of queries is created and corresponding ground truth answers is retrieved. This can be used as a dataset that can check new tools in future like malaylam stemmer, stopwords removal, lemmatizers, etc...
21Updated 2 years ago

Related projects

Alternatives and complementary repositories for Malayalam-Newspaper-Article-Dataset