What are the main types of feature extraction methods for images?
How may transformers be leveraged to extract features from images?
How to use this item?
The paper in general covers methods used in image captioning, with feature extraction, and transformers being the primary topics. As a high level overview of what's happening with those items, it's a good article.