google-research-datasets/Hinglish-TOP-Dataset

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/google-research-datasets/Hinglish-TOP-Dataset)

google-research-datasets / Hinglish-TOP-Dataset

Consists of the largest (10K) human annotated code-switched semantic parsing dataset & 170K generated utterance using the CST5 augmentation technique. Queries are derived from TOPv2, a multi-domain task oriented semantic parsing dataset. Tests suggest that with CST5, up to 20x less labeled data can achieve the same semantic parsing performance.

☆41

Alternatives and similar repositories for Hinglish-TOP-Dataset

Users that are interested in Hinglish-TOP-Dataset are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

philschmid / deep-learning-remote-runner
View on GitHub
☆16Aug 10, 2022Updated 3 years ago
ShareChatAI / MACD
View on GitHub
☆19Feb 22, 2024Updated 2 years ago
AI4Bharat / IndicInstruct
View on GitHub
Code repository for "Introducing Airavata: Hindi Instruction-tuned LLM"
☆65Oct 26, 2024Updated last year
AI4Bharat / indic-bart
View on GitHub
Pre-trained, multilingual sequence-to-sequence models for Indian languages
☆51Jul 20, 2022Updated 4 years ago
oriyor / turning_tables
View on GitHub
Implementation of the paper: "Turning Tables: Generating Examples from Semi-structured Tables for Endowing Language Models with Reasoning…
☆22Nov 2, 2021Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
karthikncode / MorphoChain
View on GitHub
A model for unsupervised morphological analysis that integrates orthographic and semantic views of words.
☆13Oct 10, 2023Updated 2 years ago
najeebkhan / text-to-speech-synthesis
View on GitHub
Hidden Markov model based text to speech synthesis system similar to HTS implemented in C#
☆11Dec 16, 2016Updated 9 years ago
azpoliak / eco
View on GitHub
Code and data related to "Efficient, Compositional, Order-Sensitive n-gram Embeddings" (EACL 2017)
☆15Apr 6, 2017Updated 9 years ago
neubig / lader
View on GitHub
A reordering tool for machine translation.
☆15May 3, 2019Updated 7 years ago
eugeneyan / text-to-image
View on GitHub
☆20Oct 24, 2022Updated 3 years ago
midas-research / hindi-nli-data
View on GitHub
a repository containing the details of natural language inference dataset in Hindi
☆14Dec 28, 2020Updated 5 years ago
EleutherAI / best-download
View on GitHub
URL downloader supporting checkpointing and continuous checksumming.
☆19Nov 29, 2023Updated 2 years ago
akikoe / nmtrnng
View on GitHub
C++ code of "Learning to Parse and Translate Improves Neural Machine Translation"
☆21May 8, 2017Updated 9 years ago
MicrosoftTranslator / ToShipOrNotToShip
View on GitHub
☆19Dec 16, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
robmarkcole / text-insights-app
View on GitHub
Upload an image of a document and extract text, names, facts and figures
☆21Aug 12, 2024Updated last year
jcolano / transformer_step_by_step
View on GitHub
This repository contains the notebooks of the series 'transformers by doing - leaving no rock unturned'
☆13Sep 24, 2023Updated 2 years ago
TTS-cdac-mumbai / TBT
View on GitHub
☆14May 7, 2019Updated 7 years ago
philschmid / huggingface-container
View on GitHub
☆10Dec 15, 2022Updated 3 years ago
nateraw / spaces-docker-templates
View on GitHub
🚀🤗 A collection of templates for Hugging Face Spaces
☆35Oct 9, 2023Updated 2 years ago
devaansh100 / CLIPTrans
View on GitHub
Official implementation for the paper "Transferring Visual Knowledge with Pre-trained Models for Multimodal Machine Translation", publish…
☆20Jun 3, 2024Updated 2 years ago
codestoryai / typescript_parsing
View on GitHub
Small package for parsing typescript (js and variants) using ts-morph for powering code graph
☆10Aug 1, 2023Updated 2 years ago
circle-hit / MuCDN
View on GitHub
Code for COLING 2022 accepted paper titled "MuCDN: Mutual Conversational Detachment Network for Emotion Recognition in Multi-Party Conver…
☆10Jul 21, 2023Updated 3 years ago
HLTCHKUST / Perplexity-FactChecking
View on GitHub
Towards Few-Shot Fact-Checking via Perplexity
☆13Jun 11, 2021Updated 5 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
CVxTz / distill-llm
View on GitHub
☆21Apr 6, 2024Updated 2 years ago
Prasanna1991 / LPR
View on GitHub
License Plate Recognition (LPR) dataset for Nepali motorbike license plate.
☆12Jul 27, 2018Updated 7 years ago
xlhex / dpe
View on GitHub
☆22Oct 26, 2020Updated 5 years ago
kinesiatricssxilm14 / CodeRepoQA
View on GitHub
CodeRepoQA dataset
☆15Feb 19, 2025Updated last year
lucidrains / remixer-pytorch
View on GitHub
Implementation of the Remixer Block from the Remixer paper, in Pytorch
☆36Sep 27, 2021Updated 4 years ago
agentsea / toolfuse
View on GitHub
A common protocol for AI agent tools
☆10Oct 21, 2024Updated last year
yamato0811 / streamlit-langgraph-HITL-copy-generator
View on GitHub
StreamlitとLangGraphで実装したHuman-in-the-loop広告コピー文生成アプリケーション
☆11Feb 15, 2025Updated last year
sawa-zen / simple-nodejs-mcp-client
View on GitHub
This is a study repository for implementing a Model Context Protocol (MCP) client. It features a simple interactive MCP client implemente…
☆11Apr 26, 2025Updated last year
manandhar01 / StarUML_cracker
View on GitHub
Python script to crack StarUML for Linux and Windows.
☆14Jan 24, 2022Updated 4 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
mobarski / alpaca-libre
View on GitHub
Reimplementation of the task generation part from the Alpaca paper
☆118Apr 4, 2023Updated 3 years ago
TehreemFarooqi / Preparing-a-speech-recognition-dataset-using-YouTube-videos
View on GitHub
Using YouTube to prepare a speech recognition dataset for any language
☆10Mar 30, 2021Updated 5 years ago
UKPLab / EACL21-personalized-conversational-system
View on GitHub
☆12Nov 19, 2022Updated 3 years ago
fywalter / label-bias
View on GitHub
A codebase for ACL 2023 paper: Mitigating Label Biases for In-context Learning
☆10Aug 4, 2023Updated 2 years ago
INK-USC / FiD-ICL
View on GitHub
"FiD-ICL: A Fusion-in-Decoder Approach for Efficient In-Context Learning" (ACL 2023)
☆15Jul 24, 2023Updated 2 years ago
HomeroRR / rmm
View on GitHub
This repository contains code for the paper RMM: A Recursive Mental Model for Dialog Navigation
☆10Nov 22, 2022Updated 3 years ago
dmitry / yandex_mystem
View on GitHub
Yandex Mystem makes morphological analysis of a russian text
☆29Feb 15, 2018Updated 8 years ago