idiap / dialog2flow

Dialog2Flow: convert your dialogs to flows. This repository accompanies the paper "Dialog2Flow: Pre-training Soft-Contrastive Sentence Embeddings for Automatic Dialog Flow Extraction", accepted to the EMNLP 2024 main conference

☆14

Alternatives and similar repositories for dialog2flow

Users that are interested in dialog2flow are comparing it to the libraries listed below

Sorting:

shangdatalab / Deep-Contam
☆10Updated 8 months ago
shreyansh26 / LLM-Sampling
A collection of various LLM sampling methods implemented in pure Pytorch
☆24Updated 5 months ago
luohongyin / EntST
Entailment self-training
☆25Updated last year
lucidrains / transformer-lm-gan
Explorations into adversarial losses on top of autoregressive loss for language modeling
☆36Updated 2 months ago
peterbhase / SLAG-Belief-Updating
Code for paper "Do Language Models Have Beliefs? Methods for Detecting, Updating, and Visualizing Model Beliefs"
☆28Updated 3 years ago
SuReLI / NeurOps
Implementations of growing and pruning in neural networks
☆22Updated last year
lucidrains / metaformer-gpt
Implementation of Metaformer, but in an autoregressive manner
☆24Updated 2 years ago
tml-epfl / why-weight-decay
Why Do We Need Weight Decay in Modern Deep Learning? [NeurIPS 2024]
☆66Updated 7 months ago
kazuki-irie / kv-memory-brain
Official Code Repository for the paper "Key-value memory in the brain"
☆25Updated 2 months ago
sustcsonglin / gated_linear_attention_layer
☆32Updated last year
jiangycTarheel / SQ-Transformer
☆10Updated last year
wang-kee / LiNeS
Official repository of "LiNeS: Post-training Layer Scaling Prevents Forgetting and Enhances Model Merging"
☆26Updated 6 months ago
piotrpiekos / MoSA
User-friendly implementation of the Mixture-of-Sparse-Attention (MoSA). MoSA selects distinct tokens for each head with expert choice rou…
☆13Updated 2 weeks ago
smonsays / hypernetwork-attention
Official code for the paper "Attention as a Hypernetwork"
☆33Updated 10 months ago
cedricrommel / augnet
Code of "Deep invariant networks with differentiable augmentation layers"
☆18Updated 2 years ago
lucidrains / quartic-transformer
Exploring an idea where one forgets about efficiency and carries out attention across each edge of the nodes (tokens)
☆51Updated last month
sfeucht / footprints
https://footprints.baulab.info
☆17Updated 7 months ago
uclaml / MoE
Towards Understanding the Mixture-of-Experts Layer in Deep Learning
☆29Updated last year
LegallyCoder / mamba-hf
Implementation of the Mamba SSM with hf_integration.
☆56Updated 8 months ago
KindXiaoming / physics_of_skill_learning
We study toy models of skill learning.
☆26Updated 3 months ago
GenRobo / MatMamba
Code and pretrained models for the paper: "MatMamba: A Matryoshka State Space Model"
☆59Updated 5 months ago
nuochenpku / LLaMA_Analysis
This is official project in our paper: Is Bigger and Deeper Always Better? Probing LLaMA Across Scales and Layers
☆30Updated last year
tylerachang / goldfish
Goldfish: Monolingual language models for 350 languages.
☆17Updated 8 months ago
noranta4 / ASIF
Personal implementation of ASIF by Antonio Norelli
☆25Updated 11 months ago
taylorwwebb / emergent_analogies_LLM
Code for 'Emergent Analogical Reasoning in Large Language Models'
☆49Updated last year
vsahil / MIMETIC-2
Official Code for MIMETIC^2
☆12Updated 5 months ago
FutureComputing4AI / Hrrformer
Hrrformer: A Neuro-symbolic Self-attention Model (ICML23)
☆54Updated last year
lucidrains / memory-editable-transformer
My explorations into editing the knowledge and memories of an attention network
☆34Updated 2 years ago
kjslag / spacebyte
A byte-level decoder architecture that matches the performance of tokenized Transformers.
☆63Updated last year
abietti / transformer-birth
☆17Updated last year