Click on the buttons to select which experiments will be included in the table (each experiment is one row); you can filter by task, encoder type, decoder type, attention type, learning rate, and hidden size.
Select which columns to include in the table
First, aspects of the experiment:
Second, full-sentence metrics of accuracy: Test full and Gen full are full-sentence accuracy on the test and generalization set, while POS indicates accuracy on getting the correct sequence of parts-of-speech
Second, first-word metrics of accuracy: Gen main is the percent of generalization set outputs that started with the main auxiliary; Gen first is the percent that started with the first auxiliary; Gen other aux is the percent that started with some other auxiliary; and Gen other is the percent that started with something that wasn't an auxiliary.
Generalization set first-word accuracy where the relative clause on the subject was of a particular type
Percent of outputs that belonged to a particular form along the lines of Crain and Nakayama
Example outputs given by the trained model, whether on the test set or generalization set; and the command that can be used to re-run this experiment.