Department of Computer Engineering and Information Technology, Amirkabir University of Technology, Tehran, Iran
چکیده
There have been several attempts so far to use topic models like LDA for simultaneous image classification and annotation. Recently, other models based on probabilistic neural networks, e.g. SupDocNADE, have been introduced with promising results in modeling multinomial data. In these models, annotation words along with visual words are embedded in the same framework and are considered as the feature vector. The imbalance between the number of visual words and annotation words may cause some problems. For instance, the contribution of the annotation words to the hidden representation is so smaller than the contribution of the visual words that the network might easily ignore it. To address this issue, we propose a weighting scheme for the annotation words. We also use LLC to generate the visual words in our proposed model. Experimental results show a 5% increase in the F-measure in the proposed annotation words and a 1% improvement in classification accuracy compared to existing models on UIUC_Sports and LabelMe datasets.
Mohammadi Foumani,Seyed Navid و Nickabadi,Ahmad . (1395). Simultaneous Image Classification and Annotation Using Probabilistic Topic Models and LLC Encoding of Visual Words. (e206873). علوم رایانش و فناوری اطلاعات, 14(2), e206873
MLA
Mohammadi Foumani,Seyed Navid , و Nickabadi,Ahmad . "Simultaneous Image Classification and Annotation Using Probabilistic Topic Models and LLC Encoding of Visual Words" .e206873 , علوم رایانش و فناوری اطلاعات, 14, 2, 1395, e206873.
HARVARD
Mohammadi Foumani Seyed Navid, Nickabadi Ahmad. (1395). 'Simultaneous Image Classification and Annotation Using Probabilistic Topic Models and LLC Encoding of Visual Words', علوم رایانش و فناوری اطلاعات, 14(2), e206873.
CHICAGO
Seyed Navid Mohammadi Foumani و Ahmad Nickabadi, "Simultaneous Image Classification and Annotation Using Probabilistic Topic Models and LLC Encoding of Visual Words," علوم رایانش و فناوری اطلاعات, 14 2 (1395): e206873,
VANCOUVER
Mohammadi Foumani Seyed Navid, Nickabadi Ahmad. Simultaneous Image Classification and Annotation Using Probabilistic Topic Models and LLC Encoding of Visual Words. علوم رایانش و فناوری اطلاعات, 1395; 14(2): e206873.