目录

flags

data_dir

The input data dir. Should contain the .tsv files (or other data files) for the task.

bert_config_file

The config json file corresponding to the pre-trained BERT model. This specifies the model architecture.

task_name

The name of the task to train.

vocab_file

The vocabulary file that the BERT model was trained on.

output_dir

The output directory where the model checkpoints will be written.

init_checkpoint

Initial checkpoint (usually from a pre-trained BERT model).

do_lower_case

Whether to lower case the input text. Should be True for uncased models and False for cased models.

max_seq_length

The maximum total input sequence length after WordPiece tokenization. Sequences longer than this will be truncated, and sequences shorter than this will be padded.