1 2 3 4 5 6 7
_target_: text_recognizer.models.transformer.TransformerLitModel interval: step monitor: val/loss max_output_len: 451 start_token: <s> end_token: <e> pad_token: <p>