目录
flags
data_dir
The input data dir. Should contain the .tsv files (or other data files) for the task.
bert_config_file
The config json file corresponding to the pre-trained BERT model. This specifies the model architecture.
task_name
The name of the task to train.
vocab_file
The vocabulary file that the BERT model was trained on.
output_dir
The output directory where the model checkpoints will be written.
init_checkpoint
Initial checkpoint (usually from a pre-trained BERT model).
do_lower_case
Whether to lower case the input text. Should be True for uncased models and False for cased models.
max_seq_length
The maximum total input sequence length after WordPiece tokenization. Sequences longer than this will be truncated, and sequences shorter than this will be padded.