Download as txt, pdf, or txt
Download as txt, pdf, or txt
You are on page 1of 13

Loading config from:

/dccstor/jhlau1/workspace/ibm_repo/sonnet/tmp/output_test2_rm5_wmin3_sd3_bat32_kp0.
7_eph30_grd5_wdim100_lmedim200_lmddim600_lmdlayer1_lmadim25_lmlr0.2_cdim150_pmedim5
0_pmddim200_pmadim50_pmlr1.0E-03_loss1.0-1.0-
0.7_sm1.00_rmdim100_rmn5_rmd0.5_rmlr1.0E-03_son2-14/config.py

Loading word embedding model...

First pass to collect word and character vocabulary...

Word type size = 8398

Char type size = 77

Loading train and valid data...

Train statistics:
Number of documents = 2685
Number of rhyme examples = 32220
Total number of word tokens = 367281
Mean/min/max words per line = 9.77/5/16
Total number of char tokens = 1552659
Mean/min/max chars per line = 41.31/25/59

Valid statistics:
Number of documents = 335
Number of rhyme examples = 4020
Total number of word tokens = 45922
Mean/min/max words per line = 9.79/6/16
Total number of char tokens = 193929
Mean/min/max chars per line = 41.35/28/58

Test statistics:
Number of documents = 335
Number of rhyme examples = 4020
Total number of word tokens = 45904
Mean/min/max words per line = 9.79/6/15
Total number of char tokens = 193851
Mean/min/max chars per line = 41.33/24/57

Epoch = 1
TRAIN 2761/2761: lm ppl = 538.3; pm loss = 7.62; rm loss = 0.27; batch/sec = 9.9
VALID 341/341: lm ppl = 186.1; pm loss = 4.72; rm loss = 0.21; batch/sec = 23.0
Stress acc [0] = 0.671 (23219)
Stress acc [1] = 0.615 (11572)
Stress acc [2] = 0.663 (643)
Stress acc [3] = 0.653 (35434)
Rhyme P/R/F@0.9 = 0.729 / 0.758 / 0.743
Rhyme P/R/F@0.8 = 0.703 / 0.811 / 0.753
Rhyme P/R/F@0.7 = 0.689 / 0.828 / 0.753
Rhyme P/R/F@0.6 = 0.672 / 0.839 / 0.746
TEST 341/341: lm ppl = 182.4; pm loss = 4.69; rm loss = 0.19; batch/sec = 23.1
Stress acc [0] = 0.673 (23440)
Stress acc [1] = 0.623 (11501)
Stress acc [2] = 0.679 (619)
Stress acc [3] = 0.657 (35560)
Rhyme P/R/F@0.9 = 0.785 / 0.740 / 0.762
Rhyme P/R/F@0.8 = 0.747 / 0.799 / 0.772
Rhyme P/R/F@0.7 = 0.728 / 0.828 / 0.775
Rhyme P/R/F@0.6 = 0.716 / 0.841 / 0.773

Epoch = 2
TRAIN 2761/2761: lm ppl = 176.2; pm loss = 3.61; rm loss = 0.22; batch/sec = 9.9
VALID 341/341: lm ppl = 134.4; pm loss = 2.04; rm loss = 0.20; batch/sec = 23.2
Stress acc [0] = 0.690 (23219)
Stress acc [1] = 0.711 (11572)
Stress acc [2] = 0.739 (643)
Stress acc [3] = 0.698 (35434)
Rhyme P/R/F@0.9 = 0.806 / 0.756 / 0.780
Rhyme P/R/F@0.8 = 0.735 / 0.810 / 0.771
Rhyme P/R/F@0.7 = 0.711 / 0.825 / 0.764
Rhyme P/R/F@0.6 = 0.695 / 0.838 / 0.760
TEST 341/341: lm ppl = 134.1; pm loss = 2.04; rm loss = 0.18; batch/sec = 22.0
Stress acc [0] = 0.696 (23440)
Stress acc [1] = 0.716 (11501)
Stress acc [2] = 0.774 (619)
Stress acc [3] = 0.704 (35560)
Rhyme P/R/F@0.9 = 0.843 / 0.749 / 0.793
Rhyme P/R/F@0.8 = 0.773 / 0.808 / 0.790
Rhyme P/R/F@0.7 = 0.745 / 0.822 / 0.781
Rhyme P/R/F@0.6 = 0.732 / 0.834 / 0.780

Epoch = 3
TRAIN 2761/2761: lm ppl = 136.5; pm loss = 1.93; rm loss = 0.20; batch/sec = 9.9
VALID 341/341: lm ppl = 116.1; pm loss = 1.72; rm loss = 0.16; batch/sec = 23.2
Stress acc [0] = 0.714 (23219)
Stress acc [1] = 0.745 (11572)
Stress acc [2] = 0.809 (643)
Stress acc [3] = 0.726 (35434)
Rhyme P/R/F@0.9 = 0.864 / 0.750 / 0.803
Rhyme P/R/F@0.8 = 0.842 / 0.810 / 0.826
Rhyme P/R/F@0.7 = 0.828 / 0.833 / 0.830
Rhyme P/R/F@0.6 = 0.812 / 0.841 / 0.826
TEST 341/341: lm ppl = 116.2; pm loss = 1.70; rm loss = 0.15; batch/sec = 21.7
Stress acc [0] = 0.717 (23440)
Stress acc [1] = 0.750 (11501)
Stress acc [2] = 0.806 (619)
Stress acc [3] = 0.729 (35560)
Rhyme P/R/F@0.9 = 0.889 / 0.751 / 0.814
Rhyme P/R/F@0.8 = 0.860 / 0.811 / 0.835
Rhyme P/R/F@0.7 = 0.837 / 0.833 / 0.835
Rhyme P/R/F@0.6 = 0.822 / 0.846 / 0.834

Epoch = 4
TRAIN 2761/2761: lm ppl = 115.7; pm loss = 1.81; rm loss = 0.19; batch/sec = 10.0
VALID 341/341: lm ppl = 105.0; pm loss = 1.65; rm loss = 0.14; batch/sec = 22.1
Stress acc [0] = 0.712 (23219)
Stress acc [1] = 0.745 (11572)
Stress acc [2] = 0.809 (643)
Stress acc [3] = 0.724 (35434)
Rhyme P/R/F@0.9 = 0.896 / 0.753 / 0.819
Rhyme P/R/F@0.8 = 0.876 / 0.812 / 0.843
Rhyme P/R/F@0.7 = 0.857 / 0.852 / 0.854
Rhyme P/R/F@0.6 = 0.838 / 0.877 / 0.857
TEST 341/341: lm ppl = 104.9; pm loss = 1.64; rm loss = 0.14; batch/sec = 23.2
Stress acc [0] = 0.714 (23440)
Stress acc [1] = 0.751 (11501)
Stress acc [2] = 0.811 (619)
Stress acc [3] = 0.727 (35560)
Rhyme P/R/F@0.9 = 0.896 / 0.756 / 0.820
Rhyme P/R/F@0.8 = 0.873 / 0.811 / 0.841
Rhyme P/R/F@0.7 = 0.857 / 0.855 / 0.856
Rhyme P/R/F@0.6 = 0.848 / 0.875 / 0.861

Epoch = 5
TRAIN 2761/2761: lm ppl = 101.2; pm loss = 1.68; rm loss = 0.17; batch/sec = 10.0
VALID 341/341: lm ppl = 97.2; pm loss = 1.55; rm loss = 0.13; batch/sec = 22.2
Stress acc [0] = 0.715 (23219)
Stress acc [1] = 0.759 (11572)
Stress acc [2] = 0.802 (643)
Stress acc [3] = 0.731 (35434)
Rhyme P/R/F@0.9 = 0.904 / 0.745 / 0.817
Rhyme P/R/F@0.8 = 0.892 / 0.855 / 0.873
Rhyme P/R/F@0.7 = 0.873 / 0.888 / 0.880
Rhyme P/R/F@0.6 = 0.859 / 0.900 / 0.879
TEST 341/341: lm ppl = 98.5; pm loss = 1.52; rm loss = 0.12; batch/sec = 23.2
Stress acc [0] = 0.717 (23440)
Stress acc [1] = 0.760 (11501)
Stress acc [2] = 0.816 (619)
Stress acc [3] = 0.733 (35560)
Rhyme P/R/F@0.9 = 0.897 / 0.752 / 0.818
Rhyme P/R/F@0.8 = 0.885 / 0.855 / 0.869
Rhyme P/R/F@0.7 = 0.867 / 0.887 / 0.877
Rhyme P/R/F@0.6 = 0.857 / 0.900 / 0.878

Epoch = 6
TRAIN 2761/2761: lm ppl = 90.2; pm loss = 1.59; rm loss = 0.16; batch/sec = 10.0
VALID 341/341: lm ppl = 92.7; pm loss = 1.46; rm loss = 0.11; batch/sec = 22.2
Stress acc [0] = 0.723 (23219)
Stress acc [1] = 0.767 (11572)
Stress acc [2] = 0.821 (643)
Stress acc [3] = 0.739 (35434)
Rhyme P/R/F@0.9 = 0.904 / 0.774 / 0.834
Rhyme P/R/F@0.8 = 0.886 / 0.869 / 0.878
Rhyme P/R/F@0.7 = 0.872 / 0.918 / 0.894
Rhyme P/R/F@0.6 = 0.857 / 0.940 / 0.897
TEST 341/341: lm ppl = 94.5; pm loss = 1.45; rm loss = 0.12; batch/sec = 23.1
Stress acc [0] = 0.725 (23440)
Stress acc [1] = 0.768 (11501)
Stress acc [2] = 0.834 (619)
Stress acc [3] = 0.741 (35560)
Rhyme P/R/F@0.9 = 0.900 / 0.777 / 0.834
Rhyme P/R/F@0.8 = 0.885 / 0.867 / 0.876
Rhyme P/R/F@0.7 = 0.872 / 0.919 / 0.894
Rhyme P/R/F@0.6 = 0.856 / 0.939 / 0.896

Epoch = 7
TRAIN 2761/2761: lm ppl = 81.4; pm loss = 1.57; rm loss = 0.15; batch/sec = 10.0
VALID 341/341: lm ppl = 89.0; pm loss = 1.46; rm loss = 0.11; batch/sec = 23.2
Stress acc [0] = 0.724 (23219)
Stress acc [1] = 0.775 (11572)
Stress acc [2] = 0.829 (643)
Stress acc [3] = 0.743 (35434)
Rhyme P/R/F@0.9 = 0.901 / 0.782 / 0.837
Rhyme P/R/F@0.8 = 0.890 / 0.907 / 0.898
Rhyme P/R/F@0.7 = 0.877 / 0.935 / 0.905
Rhyme P/R/F@0.6 = 0.865 / 0.948 / 0.905
TEST 341/341: lm ppl = 91.3; pm loss = 1.45; rm loss = 0.12; batch/sec = 23.2
Stress acc [0] = 0.724 (23440)
Stress acc [1] = 0.776 (11501)
Stress acc [2] = 0.827 (619)
Stress acc [3] = 0.743 (35560)
Rhyme P/R/F@0.9 = 0.900 / 0.778 / 0.835
Rhyme P/R/F@0.8 = 0.878 / 0.907 / 0.892
Rhyme P/R/F@0.7 = 0.871 / 0.935 / 0.902
Rhyme P/R/F@0.6 = 0.857 / 0.944 / 0.898

Epoch = 8
TRAIN 2761/2761: lm ppl = 73.8; pm loss = 1.53; rm loss = 0.15; batch/sec = 10.0
VALID 341/341: lm ppl = 85.1; pm loss = 1.43; rm loss = 0.12; batch/sec = 22.6
Stress acc [0] = 0.722 (23219)
Stress acc [1] = 0.768 (11572)
Stress acc [2] = 0.826 (643)
Stress acc [3] = 0.739 (35434)
Rhyme P/R/F@0.9 = 0.898 / 0.801 / 0.847
Rhyme P/R/F@0.8 = 0.887 / 0.911 / 0.898
Rhyme P/R/F@0.7 = 0.873 / 0.941 / 0.906
Rhyme P/R/F@0.6 = 0.868 / 0.946 / 0.905
TEST 341/341: lm ppl = 87.7; pm loss = 1.42; rm loss = 0.12; batch/sec = 21.0
Stress acc [0] = 0.724 (23440)
Stress acc [1] = 0.772 (11501)
Stress acc [2] = 0.837 (619)
Stress acc [3] = 0.742 (35560)
Rhyme P/R/F@0.9 = 0.898 / 0.800 / 0.846
Rhyme P/R/F@0.8 = 0.881 / 0.910 / 0.895
Rhyme P/R/F@0.7 = 0.868 / 0.936 / 0.901
Rhyme P/R/F@0.6 = 0.860 / 0.945 / 0.901

Epoch = 9
TRAIN 2761/2761: lm ppl = 67.3; pm loss = 1.50; rm loss = 0.15; batch/sec = 9.9
VALID 341/341: lm ppl = 83.2; pm loss = 1.44; rm loss = 0.11; batch/sec = 22.7
Stress acc [0] = 0.719 (23219)
Stress acc [1] = 0.761 (11572)
Stress acc [2] = 0.820 (643)
Stress acc [3] = 0.735 (35434)
Rhyme P/R/F@0.9 = 0.899 / 0.805 / 0.850
Rhyme P/R/F@0.8 = 0.889 / 0.918 / 0.903
Rhyme P/R/F@0.7 = 0.876 / 0.938 / 0.906
Rhyme P/R/F@0.6 = 0.867 / 0.949 / 0.906
TEST 341/341: lm ppl = 86.4; pm loss = 1.42; rm loss = 0.11; batch/sec = 21.6
Stress acc [0] = 0.722 (23440)
Stress acc [1] = 0.764 (11501)
Stress acc [2] = 0.808 (619)
Stress acc [3] = 0.737 (35560)
Rhyme P/R/F@0.9 = 0.903 / 0.804 / 0.851
Rhyme P/R/F@0.8 = 0.881 / 0.911 / 0.896
Rhyme P/R/F@0.7 = 0.866 / 0.939 / 0.901
Rhyme P/R/F@0.6 = 0.864 / 0.949 / 0.905

Epoch = 10
TRAIN 2761/2761: lm ppl = 61.2; pm loss = 1.49; rm loss = 0.14; batch/sec = 9.9
VALID 341/341: lm ppl = 80.3; pm loss = 1.44; rm loss = 0.11; batch/sec = 21.6
Stress acc [0] = 0.723 (23219)
Stress acc [1] = 0.769 (11572)
Stress acc [2] = 0.840 (643)
Stress acc [3] = 0.740 (35434)
Rhyme P/R/F@0.9 = 0.906 / 0.831 / 0.867
Rhyme P/R/F@0.8 = 0.890 / 0.922 / 0.906
Rhyme P/R/F@0.7 = 0.883 / 0.941 / 0.911
Rhyme P/R/F@0.6 = 0.874 / 0.947 / 0.909
TEST 341/341: lm ppl = 83.9; pm loss = 1.43; rm loss = 0.11; batch/sec = 22.7
Stress acc [0] = 0.725 (23440)
Stress acc [1] = 0.774 (11501)
Stress acc [2] = 0.830 (619)
Stress acc [3] = 0.742 (35560)
Rhyme P/R/F@0.9 = 0.903 / 0.827 / 0.863
Rhyme P/R/F@0.8 = 0.889 / 0.912 / 0.900
Rhyme P/R/F@0.7 = 0.876 / 0.941 / 0.907
Rhyme P/R/F@0.6 = 0.867 / 0.950 / 0.907

Epoch = 11
TRAIN 2761/2761: lm ppl = 56.5; pm loss = 1.46; rm loss = 0.14; batch/sec = 9.9
VALID 341/341: lm ppl = 79.1; pm loss = 1.38; rm loss = 0.11; batch/sec = 21.6
Stress acc [0] = 0.722 (23219)
Stress acc [1] = 0.763 (11572)
Stress acc [2] = 0.840 (643)
Stress acc [3] = 0.738 (35434)
Rhyme P/R/F@0.9 = 0.904 / 0.841 / 0.871
Rhyme P/R/F@0.8 = 0.889 / 0.924 / 0.906
Rhyme P/R/F@0.7 = 0.883 / 0.943 / 0.912
Rhyme P/R/F@0.6 = 0.878 / 0.954 / 0.914
TEST 341/341: lm ppl = 83.1; pm loss = 1.38; rm loss = 0.11; batch/sec = 22.3
Stress acc [0] = 0.725 (23440)
Stress acc [1] = 0.768 (11501)
Stress acc [2] = 0.830 (619)
Stress acc [3] = 0.741 (35560)
Rhyme P/R/F@0.9 = 0.900 / 0.831 / 0.864
Rhyme P/R/F@0.8 = 0.885 / 0.922 / 0.903
Rhyme P/R/F@0.7 = 0.875 / 0.940 / 0.906
Rhyme P/R/F@0.6 = 0.870 / 0.957 / 0.911

Epoch = 12
TRAIN 2761/2761: lm ppl = 52.0; pm loss = 1.45; rm loss = 0.14; batch/sec = 9.5
VALID 341/341: lm ppl = 78.3; pm loss = 1.37; rm loss = 0.11; batch/sec = 19.6
Stress acc [0] = 0.722 (23219)
Stress acc [1] = 0.772 (11572)
Stress acc [2] = 0.835 (643)
Stress acc [3] = 0.740 (35434)
Rhyme P/R/F@0.9 = 0.905 / 0.842 / 0.872
Rhyme P/R/F@0.8 = 0.891 / 0.926 / 0.908
Rhyme P/R/F@0.7 = 0.882 / 0.939 / 0.910
Rhyme P/R/F@0.6 = 0.878 / 0.960 / 0.917
TEST 341/341: lm ppl = 82.1; pm loss = 1.37; rm loss = 0.10; batch/sec = 20.1
Stress acc [0] = 0.725 (23440)
Stress acc [1] = 0.772 (11501)
Stress acc [2] = 0.830 (619)
Stress acc [3] = 0.742 (35560)
Rhyme P/R/F@0.9 = 0.904 / 0.834 / 0.867
Rhyme P/R/F@0.8 = 0.885 / 0.922 / 0.903
Rhyme P/R/F@0.7 = 0.874 / 0.946 / 0.908
Rhyme P/R/F@0.6 = 0.871 / 0.959 / 0.913

Epoch = 13
TRAIN 2761/2761: lm ppl = 48.2; pm loss = 1.44; rm loss = 0.13; batch/sec = 9.1
VALID 341/341: lm ppl = 76.9; pm loss = 1.37; rm loss = 0.10; batch/sec = 18.9
Stress acc [0] = 0.723 (23219)
Stress acc [1] = 0.764 (11572)
Stress acc [2] = 0.832 (643)
Stress acc [3] = 0.739 (35434)
Rhyme P/R/F@0.9 = 0.905 / 0.844 / 0.873
Rhyme P/R/F@0.8 = 0.892 / 0.929 / 0.910
Rhyme P/R/F@0.7 = 0.884 / 0.956 / 0.919
Rhyme P/R/F@0.6 = 0.879 / 0.970 / 0.923
TEST 341/341: lm ppl = 81.0; pm loss = 1.38; rm loss = 0.10; batch/sec = 20.4
Stress acc [0] = 0.725 (23440)
Stress acc [1] = 0.766 (11501)
Stress acc [2] = 0.824 (619)
Stress acc [3] = 0.740 (35560)
Rhyme P/R/F@0.9 = 0.908 / 0.845 / 0.875
Rhyme P/R/F@0.8 = 0.886 / 0.923 / 0.904
Rhyme P/R/F@0.7 = 0.878 / 0.969 / 0.921
Rhyme P/R/F@0.6 = 0.873 / 0.978 / 0.923

Epoch = 14
TRAIN 2761/2761: lm ppl = 44.7; pm loss = 1.42; rm loss = 0.13; batch/sec = 9.1
VALID 341/341: lm ppl = 75.9; pm loss = 1.37; rm loss = 0.10; batch/sec = 18.6
Stress acc [0] = 0.723 (23219)
Stress acc [1] = 0.767 (11572)
Stress acc [2] = 0.829 (643)
Stress acc [3] = 0.739 (35434)
Rhyme P/R/F@0.9 = 0.904 / 0.852 / 0.878
Rhyme P/R/F@0.8 = 0.894 / 0.938 / 0.916
Rhyme P/R/F@0.7 = 0.885 / 0.964 / 0.922
Rhyme P/R/F@0.6 = 0.879 / 0.972 / 0.923
TEST 341/341: lm ppl = 80.5; pm loss = 1.37; rm loss = 0.10; batch/sec = 19.2
Stress acc [0] = 0.724 (23440)
Stress acc [1] = 0.769 (11501)
Stress acc [2] = 0.826 (619)
Stress acc [3] = 0.740 (35560)
Rhyme P/R/F@0.9 = 0.906 / 0.853 / 0.879
Rhyme P/R/F@0.8 = 0.887 / 0.940 / 0.913
Rhyme P/R/F@0.7 = 0.878 / 0.977 / 0.925
Rhyme P/R/F@0.6 = 0.876 / 0.982 / 0.926

Epoch = 15
TRAIN 2761/2761: lm ppl = 41.5; pm loss = 1.42; rm loss = 0.13; batch/sec = 9.2
VALID 341/341: lm ppl = 75.3; pm loss = 1.36; rm loss = 0.10; batch/sec = 20.0
Stress acc [0] = 0.718 (23219)
Stress acc [1] = 0.756 (11572)
Stress acc [2] = 0.815 (643)
Stress acc [3] = 0.732 (35434)
Rhyme P/R/F@0.9 = 0.912 / 0.868 / 0.890
Rhyme P/R/F@0.8 = 0.894 / 0.940 / 0.916
Rhyme P/R/F@0.7 = 0.887 / 0.965 / 0.924
Rhyme P/R/F@0.6 = 0.883 / 0.978 / 0.928
TEST 341/341: lm ppl = 80.0; pm loss = 1.33; rm loss = 0.10; batch/sec = 19.5
Stress acc [0] = 0.721 (23440)
Stress acc [1] = 0.761 (11501)
Stress acc [2] = 0.821 (619)
Stress acc [3] = 0.735 (35560)
Rhyme P/R/F@0.9 = 0.914 / 0.860 / 0.886
Rhyme P/R/F@0.8 = 0.893 / 0.940 / 0.916
Rhyme P/R/F@0.7 = 0.885 / 0.976 / 0.928
Rhyme P/R/F@0.6 = 0.881 / 0.986 / 0.931
Epoch = 16
TRAIN 2761/2761: lm ppl = 38.6; pm loss = 1.43; rm loss = 0.13; batch/sec = 9.2
VALID 341/341: lm ppl = 74.7; pm loss = 1.35; rm loss = 0.10; batch/sec = 20.4
Stress acc [0] = 0.721 (23219)
Stress acc [1] = 0.757 (11572)
Stress acc [2] = 0.804 (643)
Stress acc [3] = 0.734 (35434)
Rhyme P/R/F@0.9 = 0.908 / 0.871 / 0.889
Rhyme P/R/F@0.8 = 0.895 / 0.939 / 0.917
Rhyme P/R/F@0.7 = 0.888 / 0.967 / 0.926
Rhyme P/R/F@0.6 = 0.883 / 0.984 / 0.931
TEST 341/341: lm ppl = 79.5; pm loss = 1.33; rm loss = 0.10; batch/sec = 19.9
Stress acc [0] = 0.724 (23440)
Stress acc [1] = 0.760 (11501)
Stress acc [2] = 0.814 (619)
Stress acc [3] = 0.737 (35560)
Rhyme P/R/F@0.9 = 0.909 / 0.876 / 0.892
Rhyme P/R/F@0.8 = 0.895 / 0.948 / 0.921
Rhyme P/R/F@0.7 = 0.885 / 0.975 / 0.928
Rhyme P/R/F@0.6 = 0.876 / 0.987 / 0.928

Epoch = 17
TRAIN 2761/2761: lm ppl = 36.2; pm loss = 1.39; rm loss = 0.13; batch/sec = 9.3
VALID 341/341: lm ppl = 74.8; pm loss = 1.35; rm loss = 0.10; batch/sec = 20.6
Stress acc [0] = 0.723 (23219)
Stress acc [1] = 0.763 (11572)
Stress acc [2] = 0.827 (643)
Stress acc [3] = 0.738 (35434)
Rhyme P/R/F@0.9 = 0.908 / 0.889 / 0.899
Rhyme P/R/F@0.8 = 0.893 / 0.944 / 0.918
Rhyme P/R/F@0.7 = 0.887 / 0.973 / 0.928
Rhyme P/R/F@0.6 = 0.882 / 0.985 / 0.931
TEST 341/341: lm ppl = 79.9; pm loss = 1.33; rm loss = 0.10; batch/sec = 19.5
Stress acc [0] = 0.725 (23440)
Stress acc [1] = 0.765 (11501)
Stress acc [2] = 0.829 (619)
Stress acc [3] = 0.740 (35560)
Rhyme P/R/F@0.9 = 0.910 / 0.874 / 0.892
Rhyme P/R/F@0.8 = 0.895 / 0.960 / 0.926
Rhyme P/R/F@0.7 = 0.886 / 0.982 / 0.931
Rhyme P/R/F@0.6 = 0.875 / 0.989 / 0.929
New valid performance is worse; restoring previous parameters...
lm loss: 2704.55650 --> 2705.69082
pm loss: 1.34837 --> 1.34557
rm loss: 0.10020 --> 0.10008

Epoch = 18
TRAIN 2761/2761: lm ppl = 36.1; pm loss = 1.40; rm loss = 0.13; batch/sec = 9.3
VALID 341/341: lm ppl = 74.9; pm loss = 1.35; rm loss = 0.10; batch/sec = 19.4
Stress acc [0] = 0.717 (23219)
Stress acc [1] = 0.754 (11572)
Stress acc [2] = 0.804 (643)
Stress acc [3] = 0.731 (35434)
Rhyme P/R/F@0.9 = 0.909 / 0.878 / 0.893
Rhyme P/R/F@0.8 = 0.895 / 0.945 / 0.919
Rhyme P/R/F@0.7 = 0.888 / 0.966 / 0.925
Rhyme P/R/F@0.6 = 0.883 / 0.983 / 0.930
TEST 341/341: lm ppl = 80.1; pm loss = 1.33; rm loss = 0.10; batch/sec = 19.8
Stress acc [0] = 0.720 (23440)
Stress acc [1] = 0.756 (11501)
Stress acc [2] = 0.806 (619)
Stress acc [3] = 0.733 (35560)
Rhyme P/R/F@0.9 = 0.915 / 0.874 / 0.894
Rhyme P/R/F@0.8 = 0.895 / 0.949 / 0.921
Rhyme P/R/F@0.7 = 0.890 / 0.981 / 0.934
Rhyme P/R/F@0.6 = 0.883 / 0.987 / 0.932
New valid performance is worse; restoring previous parameters...
lm loss: 2704.55650 --> 2706.42760
pm loss: 1.34837 --> 1.35268
rm loss: 0.10020 --> 0.10048

Epoch = 19
TRAIN 2761/2761: lm ppl = 36.0; pm loss = 1.40; rm loss = 0.12; batch/sec = 9.4
VALID 341/341: lm ppl = 75.3; pm loss = 1.35; rm loss = 0.10; batch/sec = 20.6
Stress acc [0] = 0.716 (23219)
Stress acc [1] = 0.747 (11572)
Stress acc [2] = 0.815 (643)
Stress acc [3] = 0.728 (35434)
Rhyme P/R/F@0.9 = 0.906 / 0.878 / 0.892
Rhyme P/R/F@0.8 = 0.895 / 0.943 / 0.919
Rhyme P/R/F@0.7 = 0.890 / 0.975 / 0.931
Rhyme P/R/F@0.6 = 0.883 / 0.985 / 0.931
TEST 341/341: lm ppl = 80.6; pm loss = 1.35; rm loss = 0.10; batch/sec = 20.4
Stress acc [0] = 0.717 (23440)
Stress acc [1] = 0.749 (11501)
Stress acc [2] = 0.817 (619)
Stress acc [3] = 0.729 (35560)
Rhyme P/R/F@0.9 = 0.912 / 0.870 / 0.890
Rhyme P/R/F@0.8 = 0.893 / 0.956 / 0.923
Rhyme P/R/F@0.7 = 0.886 / 0.981 / 0.931
Rhyme P/R/F@0.6 = 0.877 / 0.989 / 0.930
New valid performance is worse; restoring previous parameters...
lm loss: 2704.55650 --> 2709.46002
pm loss: 1.34837 --> 1.35380
rm loss: 0.10020 --> 0.10147

Epoch = 20
TRAIN 2761/2761: lm ppl = 36.1; pm loss = 1.40; rm loss = 0.12; batch/sec = 9.3
VALID 341/341: lm ppl = 74.9; pm loss = 1.33; rm loss = 0.10; batch/sec = 21.0
Stress acc [0] = 0.724 (23219)
Stress acc [1] = 0.762 (11572)
Stress acc [2] = 0.840 (643)
Stress acc [3] = 0.739 (35434)
Rhyme P/R/F@0.9 = 0.906 / 0.878 / 0.892
Rhyme P/R/F@0.8 = 0.896 / 0.943 / 0.918
Rhyme P/R/F@0.7 = 0.887 / 0.973 / 0.928
Rhyme P/R/F@0.6 = 0.882 / 0.983 / 0.930
TEST 341/341: lm ppl = 80.2; pm loss = 1.31; rm loss = 0.10; batch/sec = 19.5
Stress acc [0] = 0.726 (23440)
Stress acc [1] = 0.763 (11501)
Stress acc [2] = 0.834 (619)
Stress acc [3] = 0.740 (35560)
Rhyme P/R/F@0.9 = 0.909 / 0.879 / 0.894
Rhyme P/R/F@0.8 = 0.893 / 0.953 / 0.922
Rhyme P/R/F@0.7 = 0.885 / 0.980 / 0.930
Rhyme P/R/F@0.6 = 0.878 / 0.988 / 0.930
New valid performance is worse; restoring previous parameters...
lm loss: 2704.55650 --> 2706.34836
pm loss: 1.34837 --> 1.33365
rm loss: 0.10020 --> 0.10030

Epoch = 21
TRAIN 2761/2761: lm ppl = 36.2; pm loss = 1.40; rm loss = 0.13; batch/sec = 9.5
VALID 341/341: lm ppl = 74.3; pm loss = 1.37; rm loss = 0.10; batch/sec = 19.7
Stress acc [0] = 0.722 (23219)
Stress acc [1] = 0.761 (11572)
Stress acc [2] = 0.813 (643)
Stress acc [3] = 0.736 (35434)
Rhyme P/R/F@0.9 = 0.910 / 0.879 / 0.895
Rhyme P/R/F@0.8 = 0.895 / 0.940 / 0.917
Rhyme P/R/F@0.7 = 0.888 / 0.972 / 0.928
Rhyme P/R/F@0.6 = 0.884 / 0.983 / 0.931
TEST 341/341: lm ppl = 79.1; pm loss = 1.36; rm loss = 0.10; batch/sec = 19.9
Stress acc [0] = 0.724 (23440)
Stress acc [1] = 0.766 (11501)
Stress acc [2] = 0.827 (619)
Stress acc [3] = 0.739 (35560)
Rhyme P/R/F@0.9 = 0.915 / 0.879 / 0.897
Rhyme P/R/F@0.8 = 0.898 / 0.954 / 0.925
Rhyme P/R/F@0.7 = 0.886 / 0.978 / 0.930
Rhyme P/R/F@0.6 = 0.882 / 0.987 / 0.932

Epoch = 22
TRAIN 2761/2761: lm ppl = 33.8; pm loss = 1.40; rm loss = 0.12; batch/sec = 9.1
VALID 341/341: lm ppl = 75.2; pm loss = 1.33; rm loss = 0.10; batch/sec = 18.0
Stress acc [0] = 0.724 (23219)
Stress acc [1] = 0.763 (11572)
Stress acc [2] = 0.824 (643)
Stress acc [3] = 0.739 (35434)
Rhyme P/R/F@0.9 = 0.905 / 0.873 / 0.889
Rhyme P/R/F@0.8 = 0.895 / 0.947 / 0.920
Rhyme P/R/F@0.7 = 0.889 / 0.968 / 0.927
Rhyme P/R/F@0.6 = 0.884 / 0.985 / 0.932
TEST 341/341: lm ppl = 81.2; pm loss = 1.33; rm loss = 0.10; batch/sec = 19.5
Stress acc [0] = 0.727 (23440)
Stress acc [1] = 0.766 (11501)
Stress acc [2] = 0.821 (619)
Stress acc [3] = 0.741 (35560)
Rhyme P/R/F@0.9 = 0.911 / 0.872 / 0.891
Rhyme P/R/F@0.8 = 0.896 / 0.957 / 0.925
Rhyme P/R/F@0.7 = 0.889 / 0.980 / 0.933
Rhyme P/R/F@0.6 = 0.882 / 0.989 / 0.932
New valid performance is worse; restoring previous parameters...
lm loss: 2701.49113 --> 2709.06372
pm loss: 1.37264 --> 1.33357
rm loss: 0.10036 --> 0.10158

Epoch = 23
TRAIN 2761/2761: lm ppl = 33.8; pm loss = 1.39; rm loss = 0.12; batch/sec = 8.8
VALID 341/341: lm ppl = 74.7; pm loss = 1.35; rm loss = 0.10; batch/sec = 19.6
Stress acc [0] = 0.720 (23219)
Stress acc [1] = 0.759 (11572)
Stress acc [2] = 0.821 (643)
Stress acc [3] = 0.734 (35434)
Rhyme P/R/F@0.9 = 0.907 / 0.876 / 0.892
Rhyme P/R/F@0.8 = 0.895 / 0.940 / 0.917
Rhyme P/R/F@0.7 = 0.889 / 0.970 / 0.928
Rhyme P/R/F@0.6 = 0.884 / 0.977 / 0.928
TEST 341/341: lm ppl = 80.0; pm loss = 1.34; rm loss = 0.10; batch/sec = 15.4
Stress acc [0] = 0.723 (23440)
Stress acc [1] = 0.761 (11501)
Stress acc [2] = 0.814 (619)
Stress acc [3] = 0.737 (35560)
Rhyme P/R/F@0.9 = 0.913 / 0.878 / 0.895
Rhyme P/R/F@0.8 = 0.896 / 0.950 / 0.922
Rhyme P/R/F@0.7 = 0.888 / 0.980 / 0.932
Rhyme P/R/F@0.6 = 0.885 / 0.987 / 0.933
New valid performance is worse; restoring previous parameters...
lm loss: 2701.49113 --> 2704.35247
pm loss: 1.37264 --> 1.34982
rm loss: 0.10036 --> 0.09850

Epoch = 24
TRAIN 2761/2761: lm ppl = 33.9; pm loss = 1.41; rm loss = 0.12; batch/sec = 9.0
VALID 341/341: lm ppl = 75.3; pm loss = 1.38; rm loss = 0.10; batch/sec = 19.6
Stress acc [0] = 0.719 (23219)
Stress acc [1] = 0.761 (11572)
Stress acc [2] = 0.832 (643)
Stress acc [3] = 0.735 (35434)
Rhyme P/R/F@0.9 = 0.911 / 0.896 / 0.904
Rhyme P/R/F@0.8 = 0.897 / 0.948 / 0.922
Rhyme P/R/F@0.7 = 0.889 / 0.975 / 0.930
Rhyme P/R/F@0.6 = 0.884 / 0.983 / 0.931
TEST 341/341: lm ppl = 80.2; pm loss = 1.36; rm loss = 0.10; batch/sec = 20.4
Stress acc [0] = 0.723 (23440)
Stress acc [1] = 0.767 (11501)
Stress acc [2] = 0.819 (619)
Stress acc [3] = 0.739 (35560)
Rhyme P/R/F@0.9 = 0.920 / 0.893 / 0.906
Rhyme P/R/F@0.8 = 0.897 / 0.955 / 0.925
Rhyme P/R/F@0.7 = 0.887 / 0.984 / 0.933
Rhyme P/R/F@0.6 = 0.882 / 0.987 / 0.932
New valid performance is worse; restoring previous parameters...
lm loss: 2701.49113 --> 2709.34427
pm loss: 1.37264 --> 1.37761
rm loss: 0.10036 --> 0.09951

Epoch = 25
TRAIN 2761/2761: lm ppl = 33.9; pm loss = 1.41; rm loss = 0.12; batch/sec = 9.3
VALID 341/341: lm ppl = 74.7; pm loss = 1.45; rm loss = 0.10; batch/sec = 19.6
Stress acc [0] = 0.705 (23219)
Stress acc [1] = 0.737 (11572)
Stress acc [2] = 0.792 (643)
Stress acc [3] = 0.717 (35434)
Rhyme P/R/F@0.9 = 0.907 / 0.882 / 0.894
Rhyme P/R/F@0.8 = 0.896 / 0.942 / 0.918
Rhyme P/R/F@0.7 = 0.890 / 0.975 / 0.930
Rhyme P/R/F@0.6 = 0.883 / 0.984 / 0.931
TEST 341/341: lm ppl = 80.7; pm loss = 1.41; rm loss = 0.10; batch/sec = 20.5
Stress acc [0] = 0.707 (23440)
Stress acc [1] = 0.742 (11501)
Stress acc [2] = 0.803 (619)
Stress acc [3] = 0.720 (35560)
Rhyme P/R/F@0.9 = 0.909 / 0.877 / 0.893
Rhyme P/R/F@0.8 = 0.893 / 0.951 / 0.921
Rhyme P/R/F@0.7 = 0.887 / 0.981 / 0.932
Rhyme P/R/F@0.6 = 0.882 / 0.988 / 0.932
New valid performance is worse; restoring previous parameters...
lm loss: 2701.49113 --> 2704.33872
pm loss: 1.37264 --> 1.44634
rm loss: 0.10036 --> 0.10208

Epoch = 26
TRAIN 2761/2761: lm ppl = 33.7; pm loss = 1.40; rm loss = 0.12; batch/sec = 9.5
VALID 341/341: lm ppl = 74.6; pm loss = 1.33; rm loss = 0.10; batch/sec = 21.4
Stress acc [0] = 0.721 (23219)
Stress acc [1] = 0.762 (11572)
Stress acc [2] = 0.816 (643)
Stress acc [3] = 0.736 (35434)
Rhyme P/R/F@0.9 = 0.912 / 0.881 / 0.896
Rhyme P/R/F@0.8 = 0.896 / 0.939 / 0.917
Rhyme P/R/F@0.7 = 0.890 / 0.969 / 0.928
Rhyme P/R/F@0.6 = 0.884 / 0.984 / 0.931
TEST 341/341: lm ppl = 79.8; pm loss = 1.33; rm loss = 0.10; batch/sec = 19.5
Stress acc [0] = 0.722 (23440)
Stress acc [1] = 0.764 (11501)
Stress acc [2] = 0.821 (619)
Stress acc [3] = 0.737 (35560)
Rhyme P/R/F@0.9 = 0.916 / 0.875 / 0.895
Rhyme P/R/F@0.8 = 0.898 / 0.952 / 0.924
Rhyme P/R/F@0.7 = 0.886 / 0.978 / 0.930
Rhyme P/R/F@0.6 = 0.884 / 0.987 / 0.933
New valid performance is worse; restoring previous parameters...
lm loss: 2701.49113 --> 2703.97577
pm loss: 1.37264 --> 1.33469
rm loss: 0.10036 --> 0.10033

Epoch = 27
TRAIN 2761/2761: lm ppl = 33.8; pm loss = 1.39; rm loss = 0.12; batch/sec = 9.0
VALID 341/341: lm ppl = 75.2; pm loss = 1.34; rm loss = 0.10; batch/sec = 18.1
Stress acc [0] = 0.721 (23219)
Stress acc [1] = 0.761 (11572)
Stress acc [2] = 0.824 (643)
Stress acc [3] = 0.736 (35434)
Rhyme P/R/F@0.9 = 0.906 / 0.880 / 0.893
Rhyme P/R/F@0.8 = 0.895 / 0.939 / 0.917
Rhyme P/R/F@0.7 = 0.886 / 0.964 / 0.924
Rhyme P/R/F@0.6 = 0.883 / 0.978 / 0.928
TEST 341/341: lm ppl = 80.9; pm loss = 1.33; rm loss = 0.10; batch/sec = 18.6
Stress acc [0] = 0.721 (23440)
Stress acc [1] = 0.765 (11501)
Stress acc [2] = 0.821 (619)
Stress acc [3] = 0.737 (35560)
Rhyme P/R/F@0.9 = 0.913 / 0.872 / 0.892
Rhyme P/R/F@0.8 = 0.896 / 0.943 / 0.919
Rhyme P/R/F@0.7 = 0.888 / 0.980 / 0.932
Rhyme P/R/F@0.6 = 0.883 / 0.987 / 0.932
New valid performance is worse; restoring previous parameters...
lm loss: 2701.49113 --> 2708.98422
pm loss: 1.37264 --> 1.33804
rm loss: 0.10036 --> 0.09945

Epoch = 28
TRAIN 2761/2761: lm ppl = 34.0; pm loss = 1.40; rm loss = 0.12; batch/sec = 9.1
VALID 341/341: lm ppl = 74.7; pm loss = 1.32; rm loss = 0.10; batch/sec = 18.8
Stress acc [0] = 0.723 (23219)
Stress acc [1] = 0.759 (11572)
Stress acc [2] = 0.813 (643)
Stress acc [3] = 0.737 (35434)
Rhyme P/R/F@0.9 = 0.908 / 0.879 / 0.893
Rhyme P/R/F@0.8 = 0.895 / 0.945 / 0.920
Rhyme P/R/F@0.7 = 0.888 / 0.971 / 0.928
Rhyme P/R/F@0.6 = 0.883 / 0.985 / 0.931
TEST 341/341: lm ppl = 80.1; pm loss = 1.32; rm loss = 0.10; batch/sec = 20.2
Stress acc [0] = 0.725 (23440)
Stress acc [1] = 0.759 (11501)
Stress acc [2] = 0.817 (619)
Stress acc [3] = 0.738 (35560)
Rhyme P/R/F@0.9 = 0.912 / 0.877 / 0.894
Rhyme P/R/F@0.8 = 0.895 / 0.954 / 0.924
Rhyme P/R/F@0.7 = 0.885 / 0.977 / 0.929
Rhyme P/R/F@0.6 = 0.882 / 0.987 / 0.932
New valid performance is worse; restoring previous parameters...
lm loss: 2701.49113 --> 2704.98423
pm loss: 1.37264 --> 1.32496
rm loss: 0.10036 --> 0.10296

Epoch = 29
TRAIN 2761/2761: lm ppl = 33.9; pm loss = 1.39; rm loss = 0.12; batch/sec = 9.0
VALID 341/341: lm ppl = 74.3; pm loss = 1.33; rm loss = 0.10; batch/sec = 20.4
Stress acc [0] = 0.723 (23219)
Stress acc [1] = 0.758 (11572)
Stress acc [2] = 0.829 (643)
Stress acc [3] = 0.736 (35434)
Rhyme P/R/F@0.9 = 0.908 / 0.873 / 0.890
Rhyme P/R/F@0.8 = 0.895 / 0.948 / 0.921
Rhyme P/R/F@0.7 = 0.887 / 0.973 / 0.928
Rhyme P/R/F@0.6 = 0.883 / 0.982 / 0.930
TEST 341/341: lm ppl = 79.7; pm loss = 1.33; rm loss = 0.10; batch/sec = 19.9
Stress acc [0] = 0.724 (23440)
Stress acc [1] = 0.762 (11501)
Stress acc [2] = 0.817 (619)
Stress acc [3] = 0.738 (35560)
Rhyme P/R/F@0.9 = 0.914 / 0.879 / 0.896
Rhyme P/R/F@0.8 = 0.895 / 0.956 / 0.924
Rhyme P/R/F@0.7 = 0.884 / 0.981 / 0.930
Rhyme P/R/F@0.6 = 0.881 / 0.988 / 0.931
New valid performance is worse; restoring previous parameters...
lm loss: 2701.49113 --> 2701.72453
pm loss: 1.37264 --> 1.33325
rm loss: 0.10036 --> 0.10359

Epoch = 30
TRAIN 2761/2761: lm ppl = 33.8; pm loss = 1.41; rm loss = 0.12; batch/sec = 9.2
VALID 341/341: lm ppl = 74.7; pm loss = 1.36; rm loss = 0.10; batch/sec = 20.6
Stress acc [0] = 0.721 (23219)
Stress acc [1] = 0.762 (11572)
Stress acc [2] = 0.818 (643)
Stress acc [3] = 0.736 (35434)
Rhyme P/R/F@0.9 = 0.907 / 0.884 / 0.895
Rhyme P/R/F@0.8 = 0.897 / 0.945 / 0.920
Rhyme P/R/F@0.7 = 0.886 / 0.971 / 0.927
Rhyme P/R/F@0.6 = 0.885 / 0.983 / 0.931
TEST 341/341: lm ppl = 80.3; pm loss = 1.34; rm loss = 0.10; batch/sec = 19.7
Stress acc [0] = 0.725 (23440)
Stress acc [1] = 0.766 (11501)
Stress acc [2] = 0.817 (619)
Stress acc [3] = 0.740 (35560)
Rhyme P/R/F@0.9 = 0.913 / 0.881 / 0.896
Rhyme P/R/F@0.8 = 0.897 / 0.954 / 0.924
Rhyme P/R/F@0.7 = 0.887 / 0.983 / 0.932
Rhyme P/R/F@0.6 = 0.883 / 0.987 / 0.932
New valid performance is worse; restoring previous parameters...
lm loss: 2701.49113 --> 2704.36582
pm loss: 1.37264 --> 1.35676
rm loss: 0.10036 --> 0.10250

Aggregated Rhyme Pattern:

Quatrain 0 :
Line 00 = -2.00 0.21 0.17 0.18
Line 01 = 0.21 -2.00 0.18 0.16
Line 02 = 0.16 0.18 -2.00 0.21
Line 03 = 0.18 0.16 0.21 -2.00

Quatrain 1 :
Line 04 = -2.00 0.22 0.15 0.18
Line 05 = 0.22 -2.00 0.18 0.15
Line 06 = 0.15 0.18 -2.00 0.22
Line 07 = 0.17 0.15 0.22 -2.00

Quatrain 2 :
Line 08 = -2.00 0.23 0.22 0.09
Line 09 = 0.23 -2.00 0.05 0.23
Line 10 = 0.22 0.05 -2.00 0.23
Line 11 = 0.09 0.22 0.23 -2.00

You might also like