Transformers documentation
FLAN-T5
Get started
Base classes
Inference
Training
Quantization
Export to production
Resources
Contribute
API
Main Classes
Models
Text models
ALBERTArceeBambaBARTBARThezBARTphoBERTBertGenerationBertJapaneseBERTweetBigBirdBigBirdPegasusBioGptBitNetBlenderbotBlenderbot SmallBLOOMBORTByT5CamemBERTCANINECodeGenCodeLlamaCohereCohere2ConvBERTCPMCPMANTCTRLDBRXDeBERTaDeBERTa-v2DeepSeek-V3DialoGPTDiffLlamaDistilBERTdots1DPRELECTRAEncoder Decoder ModelsERNIEErnieMESMFalconFalcon3FalconH1FalconMambaFLAN-T5FLAN-UL2FlauBERTFNetFSMTFunnel TransformerFuyuGemmaGemma2GLMglm4GPTGPT NeoGPT NeoXGPT NeoX JapaneseGPT-JGPT2GPTBigCodeGPTSAN JapaneseGPTSw3GraniteGraniteMoeGraniteMoeHybridGraniteMoeSharedHeliumHerBERTHGNet-V2I-BERTJambaJetMoeJukeboxLEDLLaMALlama2Llama3LongformerLongT5LUKEM2M100MADLAD-400MambaMamba2MarianMTMarkupLMMBart and MBart-50MEGAMegatronBERTMegatronGPT2MiniMaxMistralMixtralmLUKEMobileBERTModernBertMPNetMPTMRAMT5MVPmyt5NemotronNEZHANLLBNLLB-MoENyströmformerOLMoOLMo2OLMoEOpen-LlamaOPTPegasusPEGASUS-XPersimmonPhiPhi-3PhiMoEPhoBERTPLBartProphetNetQDQBertQwen2Qwen2MoEQwen3Qwen3MoERAGREALMRecurrentGemmaReformerRemBERTRetriBERTRoBERTaRoBERTa-PreLayerNormRoCBertRoFormerRWKVSplinterSqueezeBERTStableLmStarcoder2SwitchTransformersT5T5GemmaT5v1.1TAPEXTransformer XLUL2UMT5X-MODXGLMXLMXLM-ProphetNetXLM-RoBERTaXLM-RoBERTa-XLXLM-VXLNetYOSOZambaZamba2
Vision models
Audio models
Video models
Multimodal models
Reinforcement learning models
Time series models
Graph models
Internal helpers
Reference
You are viewing v4.53.3 version. A newer version v5.8.1 is available.
FLAN-T5
Overview
FLAN-T5 was released in the paper Scaling Instruction-Finetuned Language Models - it is an enhanced version of T5 that has been finetuned in a mixture of tasks.
One can directly use FLAN-T5 weights without finetuning the model:
>>> from transformers import AutoModelForSeq2SeqLM, AutoTokenizer
>>> model = AutoModelForSeq2SeqLM.from_pretrained("google/flan-t5-small")
>>> tokenizer = AutoTokenizer.from_pretrained("google/flan-t5-small")
>>> inputs = tokenizer("A step by step recipe to make bolognese pasta:", return_tensors="pt")
>>> outputs = model.generate(**inputs)
>>> print(tokenizer.batch_decode(outputs, skip_special_tokens=True))
['Pour a cup of bolognese into a large bowl and add the pasta']FLAN-T5 includes the same improvements as T5 version 1.1 (see here for the full details of the model’s improvements.)
Google has released the following variants:
The original checkpoints can be found here.
Refer to T5’s documentation page for all API reference, code examples and notebooks. For more details regarding training and evaluation of the FLAN-T5, refer to the model card.