Attention Is All You Need

Reviewed Nov 3, 2025 · 8m 20s · 2 checks run

3 desk-reject risks

25 reviewer flags

5 polish

Language quality: A-

All findings

Desk-reject risk

3

#1

No keywords were provided in the document.

Keywords

#2

No corresponding author was found.

Authors

#3

The table 'tab:parsing-results' must be cited in the text.

Results > English Constituency Parsing

Reviewer will likely flag

25

#1

A personal email address (@gmail.com) is used.

Authors

#2

The institutional affiliation for Aidan N.

Authors

#3

The institutional affiliation for several authors (Niki Parmar, Jakob Uszkoreit, Llion Jones, Illia Polosukhin…

Authors

#4

The institutional affiliation 'Google Brain' is incomplete for multiple authors (Ashish Vaswani, Noam Shazeer,…

Authors

#5

The acronym 'Transformer' is defined multiple times.

Conclusion

#6

The 'Background' section appears before the 'Introduction'.

Background

#7

The 'Model Architecture' section details the model's components, including attention mechanisms.

Model Architecture

#8

The 'Training' section is placed after 'Why Self-Attention'.

Training

#9

The 'Attention Visualizations' section is currently a top-level section with no content and appears after the…

Attention Visualizations

#10

Added a figure number (Fig.

Attention Visualizations

#11

Added missing essential information: sample size (n=X) for statistical data.

Attention Visualizations

#12

The figure 'fig:model-arch' needs to be cited in the text.

Model Architecture

#13

The table 'tab:op_complexities' should be cited in the text.

Why Self-Attention

#14

The figure 'fig:multi-head-att' must be cited in the text.

Multi-Head Attention

#15

The table 'tab:wmt-results' should be cited in the text.

Results > Machine Translation

#16

The table 'tab:variations' should be cited in the text.

Results > Model Variations

#17

Missing article 'the' before 'fact'.

Encoder and Decoder Stacks

#18

Changed 'structure' to 'structures' to agree with the plural subjects 'syntactic and semantic'.

Why Self-Attention

#19

The text mentions 'section~ ef{sec:reg}' and 'Section 22' separately.

English Constituency Parsing

#20

Removed redundant word 'from'.

English Constituency Parsing

#21

Corrected subject-verb agreement and phrasing: 'is another research goals of ours' to 'is another of our resea…

Conclusion

#22

No funding statement was found.

Funding Statement

#23

The title should be more descriptive.

Title

#24

Define the acronym WMT upon first use.

Abstract

#25

Define the acronym 'WMT' upon first use.

Abstract

Polish

5

#1

The acronym 'ConvS2S' is undefined and used only 2 times.

Background

#2

The acronym 'ConvS2S' is undefined and used only 2 times.

Background

#3

The acronym 'ReLU' is undefined and used only 1 times.

Position-wise Feed-Forward Networks

#4

The acronym 'Adam' is undefined and used only 1 times.

Optimizer

#5

Added figure number (Fig.

Attention Visualizations

Submit another manuscript