| Epoch | Training Loss | Validation Loss | Entropy | Num Tokens | Mean Token Accuracy |
|---|---|---|---|---|---|
| 1 | 0.800300 | 0.648739 | 0.590625 | 3241.000000 | 0.888866 |
| 2 | 0.269600 | 0.430649 | 0.310939 | 6482.000000 | 0.937447 |
| 3 | 0.179600 | 0.442410 | 0.217868 | 9723.000000 | 0.933030 |
| 4 | 0.104800 | 0.469016 | 0.196222 | 12964.000000 | 0.933815 |
| 5 | 0.064500 | 0.436476 | 0.183889 | 16205.000000 | 0.937461 |
| 6 | 0.054400 | 0.489788 | 0.119700 | 19446.000000 | 0.940793 |
| 7 | 0.035100 | 0.498824 | 0.083078 | 22687.000000 | 0.944682 |
| 8 | 0.036200 | 0.502200 | 0.093177 | 25928.000000 | 0.944957 |