Conversational AI Design and Practice
GitHubAuthor
  • Preface
    • Syllabus
    • Schedule
  • 0. Getting Started
    • 0.1. Environment Setup
    • 0.2. Quiz
  • 1. Exploration
    • 1.1. Overview
    • 1.2. Project Ideas
    • 1.3. Quiz
  • 2. Dialogue Graph
    • 2.1. Emora STDM
    • 2.2. State Transition
    • 2.3. Matching Strategy
    • 2.4. Multi-turn Dialogue
    • 2.5. Quiz
  • 3. Contextual Understanding
    • 3.1. Natex
    • 3.2. Ontology
    • 3.4. Regular Expression
    • 3.5. Macro
    • 3.5. Quiz
  • 4. Interaction Design
    • 4.1. State Referencing
    • 4.2. Advanced Interaction
    • 4.3. Compound States
    • 4.4. Global Transition
    • 4.5. Saving and Loading
    • 4.6. Quiz
  • 5. LM-based Matching
    • 5.1. Language Models
    • 5.2. Quickstart with GPT
    • 5.3. Information Extraction
    • 5.4. Quiz
  • 6. Conversational Analysis
    • 6.1. H2H vs. H2M
    • 6.2. Team Evaluation
    • 6.3. Quiz
  • Project
    • Projects
    • Proposal Guidelines
    • Final Report Guidelines
  • Supplements
    • LINC Course
    • Page 1
Powered by GitBook

©2023 Emory University - All rights reserved

On this page
  • Components
  • Genre
  • Application
  • Intelligence
  • Properties
  • Unit
  • Context
  • Scopes
  • Task-oriented
  • Open-domain
  • Techniques
  • State Machine
  • End-to-End
  • Assessments

Was this helpful?

Export as PDF
  1. 1. Exploration

1.1. Overview

Explain components, properties, scopes, techniques, and assessments of dialogue systems.

Components

Genre

  • Conversation: interactive communication between two or more people.

  • Dialogue: a conversation, often between two people, with a specific goal in mind.

Dialog: a window that appears on a screen in computing contexts (e.g., dialog box).

Application

  • Dialogue System: a computer system that interacts with humans in natural language.

  • Conversational Agent: a dialogue system that interprets and responds to user statements.

  • Virtual Assistant: a dialogue system that performs tasks or services for user requests.

  • Chatbot: a dialogue system that simulates and processes human conversation.

Chatbots are typically understood to follow pre-defined dialogue flows for open-domain conversations without using sophisticated artificial intelligence technology.

Intelligence

  • Dialogue Management: a process of controlling the state and flow of the dialogue to conduct contextual communications.

  • Conversational AI: a type of Artificial Intelligence (AI) for a dialogue system to understand user inputs and respond properly to them, often processed by machine learning models.

  1. What are examples of dialogue systems currently used in practical applications?

  2. Are there applications that would greatly benefit from adopting dialogue systems?

Properties

Unit

  • Turn: a single contribution from one speaker to the dialogue.

  • Utterance: a natural unit of speech bounded by breaths or pauses.

For a text-based conversation, each turn is often considered an utterance.

Context

  • Intent: the user's goal expressed by an utterance within the context of a conversation (e.g., making an appointment, requesting information).

  • Topic: the matter dealt with in an utterance (e.g., movie, family, midterm).

It is possible that one utterance expresses multiple speech acts and intents and also deals with various topics.

Ross: Hi.

Joey: This guy says hello, I wanna kill myself.

Monica: Are you okay, sweetie?

Ross: I just feel like someone reached down my throat, grabbed my small intestine, pulled it out of my mouth and tied it around my neck...

Chandler: Cookie?

Monica: Carol moved her stuff out today.

Joey: Ohh.

Monica: Let me get you some coffee.

Ross: Thanks.

Scopes

Task-oriented

Task-oriented dialogue systems have specific tasks to be accomplished:

Open-domain

Open-domain dialogue systems aim to talk about any topics without specific end goals:

  1. What kind of tasks are presented in the above task-oriented datasets?

  2. Try the demos of BlenderBot and ChatGPT. What are their limitations?

  3. What are the challenges in building task-oriented vs. open-domain dialogue systems?

Techniques

State Machine

A dialogue flow can be designed into a fine-state machine. Most commercial dialogue systems take this approach because it gives greater control over how the systems behave. Several platforms are available to facilitate the development of state machine-based dialogue systems:

End-to-End

Recent researches focus on developing end-to-end dialogue systems using sequence-to-sequence (S2S) models, which is a type of encoder-decoder model:

The current state-of-the-art S2S models use transformers such as BERT as their encoders:

Assessments

The primary objective of both task-oriented and open-domain dialogue systems is to satisfy users by communicating with them. For task-oriented, users are generally satisfied if the tasks are accomplished efficiently. For open-domain, however, user satisfaction is often highly subjective, so proper conversational analysis may need to be involved.

Previous1. ExplorationNext1.2. Project Ideas

Last updated 2 years ago

Was this helpful?

Speech Act: the action, either explicitly or implicitly, expressed by an utterance (e.g., answering, advising, greeting; see ).

Classify each of the following utterances from Friends S1E1 using the dialogue acts:

, Henderson et al., SIGDIAL, 2014 ().

, Wen et al., EMNLP 2016 ().

, Bordes et al., ICLR, 2017 ().

, Eric et al., SIGDIAL, 2017 ().

, Budzianowski et al., EMNLP, 2018 ().

, Qin et al., EMNLP, 2019 ().

, Rastogi et al., AAAI, 2020 ().

()

()

(; requires login)

(; )

, Sutskever et al., NeurIPS, 2014.

, Vaswani et al., NeurIPS, 2017.

, Devlin et al., NAACL, 2019.

Three of the above, Meta BlenderBot, OpenAI ChatGPT, and Google LaMDA, are end-to-end systems based on S2S models.

Implementing an end-to-end system is beyond the scope of this course. Thus, we will use the state machine approach to develop dialogue systems, starting from .

, Finch and Choi, SIGDIAL, 2020.

, Mehri et al., arXiv, 2022.

, Finch et al., arXiv, 2022.

Switchboard Dialog Act Corpus
http://compprag.christopherpotts.net/swda.html
The Second Dialog State Tracking Challenge
dataset
Conditional Generation and Snapshot Learning in Neural Dialogue Systems
dataset
Learning End-to-End Goal-Oriented Dialog
dataset
Key-Value Retrieval Networks for Task-Oriented Dialogue
dataset
MultiWOZ - A Large-Scale Multi-Domain Wizard-of-Oz Dataset for Task-Oriented Dialogue Modelling
dataset
Entity-Consistent End-to-end Task-Oriented Dialogue System with KB Retriever
dataset
Towards Scalable Multi-domain Conversational Agents: The Schema-Guided Dialogue Dataset
dataset
Alexa Prize Socialbot Grand Challenge
Emora demo
Meta BlenderBot
demo
OpenAI ChatGPT
demo
Google LaMDA
article
interview
Amazon Lex
Google Dialogflow
IBM Watson Assistant
Microsoft Azure Bot Service
Emora STDM
Sequence to Sequence Learning with Neural Networks
Attention is All you Need
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Chapter 2
Towards Unified Dialogue System Evaluation: A Comprehensive Analysis of Current Evaluation Protocols
Report from the NSF Future Directions Workshop on Automatic Evaluation of Dialog: Research Directions and Challenges
Don't Forget Your ABC's: Evaluating the State-of-the-Art in Chat-Oriented Dialogue Systems
open-domain dialogue systems