Model outputs

All models have outputs that are instances of subclasses of ModelOutput. Those are data structures containing all the information returned by the model, but that can also be used as tuples or dictionaries.

Let’s see how this looks in an example:

Copied

from transformers import BertTokenizer, BertForSequenceClassification
import torch

tokenizer = BertTokenizer.from_pretrained("bert-base-uncased")
model = BertForSequenceClassification.from_pretrained("bert-base-uncased")

inputs = tokenizer("Hello, my dog is cute", return_tensors="pt")
labels = torch.tensor([1]).unsqueeze(0)  # Batch size 1
outputs = model(**inputs, labels=labels)

The outputs object is a SequenceClassifierOutput, as we can see in the documentation of that class below, it means it has an optional loss, a logits, an optional hidden_states and an optional attentions attribute. Here we have the loss since we passed along labels, but we don’t have hidden_states and attentions because we didn’t pass output_hidden_states=True or output_attentions=True.

When passing output_hidden_states=True you may expect the outputs.hidden_states[-1] to match outputs.last_hidden_states exactly. However, this is not always the case. Some models apply normalization or subsequent process to the last hidden state when it’s returned.

You can access each attribute as you would usually do, and if that attribute has not been returned by the model, you will get None. Here for instance outputs.loss is the loss computed by the model, and outputs.attentions is None.

When considering our outputs object as tuple, it only considers the attributes that don’t have None values. Here for instance, it has two elements, loss then logits, so

Copied

outputs[:2]

will return the tuple (outputs.loss, outputs.logits) for instance.

When considering our outputs object as dictionary, it only considers the attributes that don’t have None values. Here for instance, it has two keys that are loss and logits.

We document here the generic model outputs that are used by more than one model type. Specific output types are documented on their corresponding model page.

Model outputs

ModelOutput

class transformers.utils.ModelOutput

BaseModelOutput

class transformers.modeling_outputs.BaseModelOutput

BaseModelOutputWithPooling

class transformers.modeling_outputs.BaseModelOutputWithPooling

BaseModelOutputWithCrossAttentions

class transformers.modeling_outputs.BaseModelOutputWithCrossAttentions

BaseModelOutputWithCrossAttentions

class transformers.modeling_outputs.BaseModelOutputWithCrossAttentions

BaseModelOutputWithPoolingAndCrossAttentions

class transformers.modeling_outputs.BaseModelOutputWithPoolingAndCrossAttentions

BaseModelOutputWithPast

class transformers.modeling_outputs.BaseModelOutputWithPast

BaseModelOutputWithPastAndCrossAttentions

class transformers.modeling_outputs.BaseModelOutputWithPastAndCrossAttentions

Seq2SeqModelOutput

class transformers.modeling_outputs.Seq2SeqModelOutput

CausalLMOutput

class transformers.modeling_outputs.CausalLMOutput

CausalLMOutputWithCrossAttentions

class transformers.modeling_outputs.CausalLMOutputWithCrossAttentions

CausalLMOutputWithPast

class transformers.modeling_outputs.CausalLMOutputWithPast

MaskedLMOutput

class transformers.modeling_outputs.MaskedLMOutput

Seq2SeqLMOutput

class transformers.modeling_outputs.Seq2SeqLMOutput

NextSentencePredictorOutput

class transformers.modeling_outputs.NextSentencePredictorOutput

SequenceClassifierOutput

class transformers.modeling_outputs.SequenceClassifierOutput

Seq2SeqSequenceClassifierOutput

class transformers.modeling_outputs.Seq2SeqSequenceClassifierOutput

MultipleChoiceModelOutput

class transformers.modeling_outputs.MultipleChoiceModelOutput

TokenClassifierOutput

class transformers.modeling_outputs.TokenClassifierOutput

QuestionAnsweringModelOutput

class transformers.modeling_outputs.QuestionAnsweringModelOutput

Seq2SeqQuestionAnsweringModelOutput

class transformers.modeling_outputs.Seq2SeqQuestionAnsweringModelOutput

Seq2SeqSpectrogramOutput

class transformers.modeling_outputs.Seq2SeqSpectrogramOutput

SemanticSegmenterOutput

class transformers.modeling_outputs.SemanticSegmenterOutput

ImageClassifierOutput

class transformers.modeling_outputs.ImageClassifierOutput

ImageClassifierOutputWithNoAttention

class transformers.modeling_outputs.ImageClassifierOutputWithNoAttention

DepthEstimatorOutput

class transformers.modeling_outputs.DepthEstimatorOutput

Wav2Vec2BaseModelOutput

class transformers.modeling_outputs.Wav2Vec2BaseModelOutput

XVectorOutput

class transformers.modeling_outputs.XVectorOutput

Seq2SeqTSModelOutput

class transformers.modeling_outputs.Seq2SeqTSModelOutput

Seq2SeqTSPredictionOutput

class transformers.modeling_outputs.Seq2SeqTSPredictionOutput

SampleTSPredictionOutput

class transformers.modeling_outputs.SampleTSPredictionOutput

TFBaseModelOutput

class transformers.modeling_tf_outputs.TFBaseModelOutput

TFBaseModelOutputWithPooling

class transformers.modeling_tf_outputs.TFBaseModelOutputWithPooling

TFBaseModelOutputWithPoolingAndCrossAttentions

class transformers.modeling_tf_outputs.TFBaseModelOutputWithPoolingAndCrossAttentions

TFBaseModelOutputWithPast

class transformers.modeling_tf_outputs.TFBaseModelOutputWithPast

TFBaseModelOutputWithPastAndCrossAttentions

class transformers.modeling_tf_outputs.TFBaseModelOutputWithPastAndCrossAttentions

TFSeq2SeqModelOutput

class transformers.modeling_tf_outputs.TFSeq2SeqModelOutput

TFCausalLMOutput

class transformers.modeling_tf_outputs.TFCausalLMOutput

TFCausalLMOutputWithCrossAttentions

class transformers.modeling_tf_outputs.TFCausalLMOutputWithCrossAttentions

TFCausalLMOutputWithPast