My Projects

Image captioning using a CNN and a Transformer

Based on an extensive Literature survey and in concurrence to our findings, it was observed that the MHA Transformer performs better than CNN-RNN (LSTM) model. Further, MHA Transformer takes less time to train than CNN-RNN (LSTM) model.

BLEU Score target has been achieved (54.8% against 50% target). This is better than most of the published BLEU score performance to our best knowledge.

About Program