《BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding》论文阅读笔记

标签: paper 更新于: 2018/10/25 阅读:2253 原文发表于：2018-10-26

Introduction

BERT: Bidirectional Encoder Representations from Transformers.

They propose to extract contextsensitive features from a language model.

two existing strategies for applying pre-trained language representations to downstream tasks