AAAI/ACM conference on Ethics and Safety (in submission), 2017
For the bias model we use the pretrained model provided by Hutto et al. For the HRED and VHRED models, we use the Twitter set as in Lowe et al. and Ritter et al.. Similarly, we use the same training methodology as in Lowe et al. for training the VHRED and HRED models. When sampling, we use a beam search of 5 beams for one experiment and random stochastic sampling for the other (all samples shown below).
Detailed statistics for bias detection (including min/max bias scale samples, etc.) can be found here:
1000 sampled evaluations of the bias model for all datasets can be found:
Similarly, for HRED and VHRED bias statistics can be found here:
And HRED and VHRED 1000 sampled evaluations can be found here: