This dataset correspond to the results presented in Computer Speech and Language article Dialogue manager domain adaptation using Gaussian process reinforcement learning and relates to Figure 7. Two contrasts were presented: Prior and NoPrior. NoPrior[1,2,3] is the data obtained in interaction with Amazon MTurk while training three policies for SFR domain. Prior[1,2,3] is the data obtained while training policy for SFR domain that uses a generic policy as a prior. In each directory there is a call directory with a time stamp in the name which contains session.xml file with the dialogue log and feedback.xml file with the user feedback. Figure 8 is obtained using data previously published at https://www.repository.cam.ac.uk/handle/1810/251169 and Figure 9 is obtained using data previously published at https://www.repository.cam.ac.uk/handle/1810/252636 . This data is released under a Creative Commons CC-BY licence (see https://creativecommons.org/licenses/by/4.0/)
Dialogue manager domain adaptation using Gaussian process reinforcement learning
Milica Gasic,N. Mrksic,L. Rojas-Barahona,Pei-hao Su,Stefan Ultes,David Vandyke,Tsung-Hsien Wen,S. Young
Published 2016 in Computer Speech and Language
ABSTRACT
PUBLICATION RECORD
- Publication year
2016
- Venue
Computer Speech and Language
- Publication date
2016-09-05
- Fields of study
Computer Science
- Identifiers
- External record
- Source metadata
Semantic Scholar
CITATION MAP
EXTRACTION MAP
CLAIMS
- No claims are published for this paper.
CONCEPTS
- No concepts are published for this paper.
REFERENCES
Showing 1-28 of 28 references · Page 1 of 1
CITED BY
Showing 1-41 of 41 citing papers · Page 1 of 1