PositionalEmbedding #53

12yang12 · 2019-01-15T06:21:00Z

The position embedding in the BERT is not the same as in the transformer. Why not use the form in bert?

codertimo · 2019-04-08T13:23:41Z

@Yang92to Great Point, I'll check out the BERT positional embedding method, and update ASAP

jacklanchantin · 2019-11-26T18:48:35Z

@codertimo the BERT positional embedding method is to just learn an embedding for each position. So you can use nn.Embedding with a constant input sequence [0,1,2,...,L-1] where L is the maximum sequence length.

yonghee12 · 2020-09-21T03:23:28Z

@codertimo
Since BERT uses learned positional embeddings and it is one of the biggest difference between original transformers and BERT, I think it is quite urgent to modify the positional embedding part.

codertimo added the good first issue label Apr 8, 2019

Feb	MAR	Apr
	22
2021	2022	2023

codertimo / BERT-pytorch Public

PositionalEmbedding #53

PositionalEmbedding #53

12yang12 commented Jan 15, 2019

codertimo commented Apr 8, 2019 •

edited

jacklanchantin commented Nov 26, 2019

yonghee12 commented Sep 21, 2020

codertimo / BERT-pytorch Public

PositionalEmbedding #53

PositionalEmbedding #53

Comments

12yang12 commented Jan 15, 2019

codertimo commented Apr 8, 2019 • edited

jacklanchantin commented Nov 26, 2019

yonghee12 commented Sep 21, 2020

codertimo commented Apr 8, 2019 •

edited