This is an online demo with explanation and tutorial on Visual Question Answering. This is not a naive or hello-world model, this model returns close to state-of-the-art without using any attention models, memory netorks (other than LSTM) and fine-tuning, which are essential recipe for current best results.
I have tried to explain different parts, and reasoning behind their choices. This is meant to be an interactive tutorial, feel free to change the model parameters and experiment. If you have latest graphics card execution time should be within a minute.
All the files required to run this ipython notebook can be obtained from
Table of Contents
转载本站任何文章请注明：转载至神刀安全网，谢谢神刀安全网 » Deep Learning for visual question answering: demo with Keras code included