Blenderbot Small

Note that BlenderbotSmallModel and BlenderbotSmallForConditionalGeneration are only used in combination with the checkpoint facebook/blenderbot-90M. Larger Blenderbot checkpoints should instead be used with BlenderbotModel and BlenderbotForConditionalGeneration

Overview

The Blender chatbot model was proposed in Recipes for building an open-domain chatbot Stephen Roller, Emily Dinan, Naman Goyal, Da Ju, Mary Williamson, Yinhan Liu, Jing Xu, Myle Ott, Kurt Shuster, Eric M. Smith, Y-Lan Boureau, Jason Weston on 30 Apr 2020.

The abstract of the paper is the following:

Building open-domain chatbots is a challenging area for machine learning research. While prior work has shown that scaling neural models in the number of parameters and the size of the data they are trained on gives improved results, we show that other ingredients are important for a high-performing chatbot. Good conversation requires a number of skills that an expert conversationalist blends in a seamless way: providing engaging talking points and listening to their partners, and displaying knowledge, empathy and personality appropriately, while maintaining a consistent persona. We show that large scale models can learn these skills when given appropriate training data and choice of generation strategy. We build variants of these recipes with 90M, 2.7B and 9.4B parameter models, and make our models and code publicly available. Human evaluations show our best models are superior to existing approaches in multi-turn dialogue in terms of engagingness and humanness measurements. We then discuss the limitations of this work by analyzing failure cases of our models.

Tips:

Blenderbot Small is a model with absolute position embeddings so it’s usually advised to pad the inputs on the right rather than the left.

This model was contributed by patrickvonplaten. The authors’ code can be found here.

hashtagBlenderbot Small

hashtagOverview

hashtagDocumentation resources

hashtagBlenderbotSmallConfig

hashtagclass transformers.BlenderbotSmallConfig

hashtagBlenderbotSmallTokenizer

hashtagclass transformers.BlenderbotSmallTokenizer

hashtagBlenderbotSmallTokenizerFast

hashtagclass transformers.BlenderbotSmallTokenizerFast

hashtagBlenderbotSmallModel

hashtagclass transformers.BlenderbotSmallModel

hashtagBlenderbotSmallForConditionalGeneration

hashtagclass transformers.BlenderbotSmallForConditionalGeneration

hashtagBlenderbotSmallForCausalLM

hashtagclass transformers.BlenderbotSmallForCausalLM

hashtagTFBlenderbotSmallModel

hashtagclass transformers.TFBlenderbotSmallModel

hashtagTFBlenderbotSmallForConditionalGeneration

hashtagclass transformers.TFBlenderbotSmallForConditionalGeneration

hashtagFlaxBlenderbotSmallModel

hashtagclass transformers.FlaxBlenderbotSmallModel

hashtagFlaxBlenderbotForConditionalGeneration

hashtagclass transformers.FlaxBlenderbotSmallForConditionalGeneration

Blenderbot Small

Overview

Documentation resources

BlenderbotSmallConfig

class transformers.BlenderbotSmallConfig

BlenderbotSmallTokenizer

class transformers.BlenderbotSmallTokenizer

BlenderbotSmallTokenizerFast

class transformers.BlenderbotSmallTokenizerFast

BlenderbotSmallModel

class transformers.BlenderbotSmallModel

BlenderbotSmallForConditionalGeneration

class transformers.BlenderbotSmallForConditionalGeneration

BlenderbotSmallForCausalLM

class transformers.BlenderbotSmallForCausalLM

TFBlenderbotSmallModel

class transformers.TFBlenderbotSmallModel

TFBlenderbotSmallForConditionalGeneration

class transformers.TFBlenderbotSmallForConditionalGeneration

FlaxBlenderbotSmallModel

class transformers.FlaxBlenderbotSmallModel

FlaxBlenderbotForConditionalGeneration

class transformers.FlaxBlenderbotSmallForConditionalGeneration