Skip to main content
Cornell University
We gratefully acknowledge support from
the Simons Foundation and member institutions.
arxiv logo > cs > arXiv:1907.10529

Help | Advanced Search

Computer Science > Computation and Language

(cs)
[Submitted on 24 Jul 2019 (v1), last revised 18 Jan 2020 (this version, v3)]

Title:SpanBERT: Improving Pre-training by Representing and Predicting Spans

Authors:Mandar Joshi, Danqi Chen, Yinhan Liu, Daniel S. Weld, Luke Zettlemoyer, Omer Levy
Download PDF
Abstract: We present SpanBERT, a pre-training method that is designed to better represent and predict spans of text. Our approach extends BERT by (1) masking contiguous random spans, rather than random tokens, and (2) training the span boundary representations to predict the entire content of the masked span, without relying on the individual token representations within it. SpanBERT consistently outperforms BERT and our better-tuned baselines, with substantial gains on span selection tasks such as question answering and coreference resolution. In particular, with the same training data and model size as BERT-large, our single model obtains 94.6% and 88.7% F1 on SQuAD 1.1 and 2.0, respectively. We also achieve a new state of the art on the OntoNotes coreference resolution task (79.6\% F1), strong performance on the TACRED relation extraction benchmark, and even show gains on GLUE.
Comments: Accepted at TACL
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as: arXiv:1907.10529 [cs.CL]
  (or arXiv:1907.10529v3 [cs.CL] for this version)
  https://doi.org/10.48550/arXiv.1907.10529
arXiv-issued DOI via DataCite

Submission history

From: Mandar Joshi [view email]
[v1] Wed, 24 Jul 2019 15:43:40 UTC (1,623 KB)
[v2] Wed, 31 Jul 2019 17:13:57 UTC (1,623 KB)
[v3] Sat, 18 Jan 2020 03:53:04 UTC (999 KB)
Full-text links:

Download:

  • PDF
  • Other formats
(license)
Current browse context:
cs.CL
< prev   |   next >
new | recent | 1907
Change to browse by:
cs
cs.LG

References & Citations

  • NASA ADS
  • Google Scholar
  • Semantic Scholar

1 blog link

(what is this?)

DBLP - CS Bibliography

listing | bibtex
Mandar Joshi
Danqi Chen
Yinhan Liu
Daniel S. Weld
Luke Zettlemoyer
…
a export bibtex citation Loading...

Bookmark

BibSonomy logo Mendeley logo Reddit logo ScienceWISE logo

Bibliographic and Citation Tools

Bibliographic Explorer (What is the Explorer?)
Litmaps (What is Litmaps?)
scite Smart Citations (What are Smart Citations?)
Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack