Consists of the largest (10K) human annotated code-switched semantic parsing dataset & 170K generated utterance using the CST5 augmentation technique. Queries are derived from TOPv2, a multi-domain task oriented semantic parsing dataset. Tests suggest that with CST5, up to 20x less labeled data can achieve the same semantic parsing performance.
☆41Feb 1, 2023Updated 3 years ago
Alternatives and similar repositories for Hinglish-TOP-Dataset
Users that are interested in Hinglish-TOP-Dataset are comparing it to the libraries listed below
Sorting:
- Experimental webapp for creating music using graphs☆21Aug 27, 2015Updated 10 years ago
- We present algorithms to extract data from annual reports (PDF files) using the Python programming language with the aim to automate the …☆16Jul 1, 2021Updated 4 years ago
- ☆17Feb 22, 2024Updated 2 years ago
- Implementation of the paper: "Turning Tables: Generating Examples from Semi-structured Tables for Endowing Language Models with Reasoning…☆22Nov 2, 2021Updated 4 years ago
- Pre-trained, multilingual sequence-to-sequence models for Indian languages☆51Jul 20, 2022Updated 3 years ago
- A collection of trading settings for the Galileo FX trading robot. These settings are designed to optimize trading strategies across vari…☆13Jan 27, 2025Updated last year
- MetaQA: Combining Expert Agents for Multi-Skill Question Answering☆23Mar 13, 2022Updated 3 years ago
- This repository is dedicated to development of code-mixed language resources.☆27Jul 22, 2023Updated 2 years ago
- Code repository for "Introducing Airavata: Hindi Instruction-tuned LLM"☆64Oct 26, 2024Updated last year
- Facebook AI Research Sequence-to-Sequence Toolkit written in Python.☆25Aug 31, 2025Updated 6 months ago
- 🚀🤗 A collection of templates for Hugging Face Spaces☆35Oct 9, 2023Updated 2 years ago
- EMNLP'22 | PromptEHR: Conditional Electronic Healthcare Records Generation with Prompt Learning☆32Jun 8, 2023Updated 2 years ago
- Pumilio: A Web-Based Management System for Ecological Recordings☆13Oct 29, 2018Updated 7 years ago
- OxLM: Oxford Neural Language Modelling Toolkit☆38Nov 6, 2015Updated 10 years ago
- StreamlitとLangGraphで実装したHuman-in-the-loop広告コピー文生成アプリケーション☆11Feb 15, 2025Updated last year
- An Educational Framework Based on PyTorch for Deep Learning Education and Exploration☆10Dec 24, 2023Updated 2 years ago
- Public repository to host our Checker IP written in SVA that is ported to run on open-source Verilator.☆12Mar 31, 2023Updated 2 years ago
- Code, source data, examples, and audio excerpts for Flow: Expressive Rhythm in the Rapping Voice☆10Feb 13, 2020Updated 6 years ago
- SChunk-Encoder (Transformer or Conformer) for streaming E2E ASR☆11Oct 21, 2022Updated 3 years ago
- ☆11Feb 28, 2022Updated 4 years ago
- ☆10Jul 24, 2019Updated 6 years ago
- A Tree-LSTM-based dependency tree sentiment labeler☆15May 9, 2019Updated 6 years ago
- Feature extraction from sound signals along with complete CNN model and evaluations using tensorflow, keras and, librosa for MFCC generat…☆10Jan 1, 2022Updated 4 years ago
- ☆10Sep 15, 2024Updated last year
- A RAG that can scale 🧑🏻💻☆11May 28, 2024Updated last year
- Walks through building different HTML5 layouts for AV systems☆12Oct 15, 2021Updated 4 years ago
- Code for COLING 2022 accepted paper titled "MuCDN: Mutual Conversational Detachment Network for Emotion Recognition in Multi-Party Conver…☆10Jul 21, 2023Updated 2 years ago
- 1st place solution to the DCASE 2020 - Task 5 - Urban Sound Tagging with Spatiotemporal Context☆16Dec 8, 2022Updated 3 years ago
- ☆12Dec 12, 2019Updated 6 years ago
- Github mirror of MediaWiki extension Wikispeech - our actual code is hosted with Gerrit (please see https://www.mediawiki.org/wiki/Develo…☆12Updated this week
- Research_speech_speaker_verification_nist_sre2010☆12Mar 1, 2016Updated 10 years ago
- Tool for Evaluating Multilingual WS-353 and SimLex-999☆10Dec 15, 2016Updated 9 years ago
- ☆13Feb 27, 2026Updated last week
- Resources for "Simple Speech Representation Learning from Perceptual Data".☆11Sep 18, 2023Updated 2 years ago
- ATC-Anno is an annotation tool for Air Traffic Control data that offers automatic semantic and concept annotation.☆12Nov 17, 2023Updated 2 years ago
- PyGun: Procedural Generation of Anechoic Gunshot Sounds☆14Oct 8, 2016Updated 9 years ago
- [ACL 2023] Counterspeeches up my sleeve! Intent Distribution Learning and Persistent Fusion for Intent-Conditioned Counterspeech Generati…☆10Sep 23, 2023Updated 2 years ago
- EUDAMED API reference (unofficial)☆18Apr 26, 2025Updated 10 months ago
- Implementation of "Look, Listen and Recognise:character-aware audio-visual subtitling"☆19Nov 3, 2025Updated 4 months ago