I am wondering if I implemented my Encoder-Attention-Decoder model in the correct way
I have a school project where I am researching with a small group of people if it is possible to train a RNN model to decrypt text which is encrypted by the Vigenère cipher. The main research question is if we can train a model which can quite accurately decrypt text for every chosen key, without having been necessarily trained on that key. In other words, can the model generalise to all keys when trained on a lot of different keys.