ALM-LAB / PACE
PACE (Podcast AI for Chapters and Episodes) is a semantic search engine that helps you find the information you need, inter- and intra-podcasts (Project for the AssemblyAI Winter 2022 Hackathon).
☆13Updated last year
Related projects ⓘ
Alternatives and complementary repositories for PACE
- ITALIC: An ITALian Intent Classification Dataset☆11Updated 11 months ago
- Repository for the LLM course☆11Updated last week
- ☆61Updated 3 months ago
- ☆51Updated last week
- Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translation☆132Updated 9 months ago
- Repository contains code to fine-tune WhisperASR model☆23Updated last year
- ☆152Updated last year
- Joint speech-language model - respond directly to audio!☆30Updated 5 months ago
- ☆33Updated last year
- A collection of datasets for language model pretraining including scripts for downloading, preprocesssing, and sampling.☆52Updated 3 months ago
- This project is a collection of fine-tuning scripts to help researchers fine-tune Qwen 2 VL on HuggingFace datasets.☆46Updated last month
- Code for Zero-Shot Tokenizer Transfer☆115Updated 2 weeks ago
- Generalised Contrastive Learning. This is a Repository for Google Shopping Dataset and Benchmarks followed by our novel fine-grained cont…☆45Updated this week
- ViLMA: A Zero-Shot Benchmark for Linguistic and Temporal Grounding in Video-Language Models (ICLR 2024, Official Implementation)☆14Updated 9 months ago
- ☆14Updated 3 weeks ago
- Data and code for the paper "NormBank: A Knowledge Bank of Situational Social Norms"☆23Updated last year
- babyLM WhisBERT code☆17Updated 5 months ago
- a curated list of the role of small models in the LLM era☆72Updated last month
- ☆20Updated last month
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆122Updated 7 months ago
- Using open source LLMs to build synthetic datasets for direct preference optimization☆40Updated 8 months ago
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆83Updated last week
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆48Updated 4 months ago
- Language Identification with Support for More Than 2000 Labels -- EMNLP 2023☆89Updated last week
- ☆40Updated 5 months ago
- Tokun to can tokens☆15Updated last month
- Collection of Open Source Speech Data☆143Updated this week
- A repository containing the code for translating popular LLM benchmarks to German.☆23Updated last year
- ☆20Updated this week
- Experiments for efforts to train a new and improved t5☆76Updated 6 months ago