|
I am interested in knowing if there are models and metrics for measuring the success of the textual captions in a search engine. For example let's say I have two models that generate captions for my search engine. How can I know that model 1 is producing better captions than model 2. The closest metric that I have found is using quick back/click backs. I am wondering if there is any scholarly work in this area? |