0% found this document useful (0 votes)

93 views

What Helps Where - and Why? Semantic Relatedness For Knowledge Transfer

The document discusses two models for zero-shot object classification using knowledge transferred from known classes to unseen classes: 1) An attribute-based model that transfers knowledge between classes linked by shared attributes, where the class-attribute associations are obtained from language resources instead of manual labels; 2) A direct similarity-based model that transfers knowledge directly between similar classes based on relationships identified in language resources, without requiring manual supervision or training examples. It provides an extensive evaluation of several semantic relatedness measures and language resources for identifying these class relationships for zero-shot recognition.

Uploaded by

Matthew Wright

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

93 views

What Helps Where - and Why? Semantic Relatedness For Knowledge Transfer

Uploaded by

Matthew Wright

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

Visual Knowledge Transfer Using Semantic Relatedness

Marcus Rohrbach
Department of Computer Science, TU Darmstadt Abstract. Recognition of object classes has advanced to master difcult scenarios such as background clutter or multiple objects per image. Scaling to large numbers of classes, however, remains a challenge for state-of-the-art recognition approaches. In order to exploit similarities between classes, knowledge transfer between classes has been advocated. In this paper we examine the special case of transferring knowledge from known to unseen classes (zero-shot recognition). In previous work the decision which knowledge to transfer has been provided mostly by supervision in the form of manual associations between known and unseen classes or a few training examples, limiting the scalability of these approaches. We promote semantic relatedness to replace supervision in order to provide the missing link between the sources (known classes) and targets (unseen classes) of knowledge transfer. We provide a rigorous experimental evaluation of several state-of-the-art semantic relatedness measures and language resources which we evaluate on the challenging Animals with Attributes image dataset. This extensive evaluation provides insights into the different qualities and applicability of the different measures and resources.

Introduction

While remarkable recognition performance has been reported on a wide variety of object classes, scaling recognition to large numbers of classes remains a key challenge for state-of-the-art recognition approaches. The limiting factors are mainly the large training sets required to learn each object class and the restricted number of different classes systems are able to handle. It is generally believed that knowledge transfer between image classes is a promising way to tackle both problems. However, approaches so far require manual supervision [1, 2] or rely on a few bootstrap training examples [3, 4]. In this work we reduce the amount of manual supervision by using linguistic knowledge bases as an additional source of information. In the special case of zero-shot recognition we transfer knowledge from known object classes (during training) to unseen object classes (during testing) using two different models. The rst model, suggested by [1], uses attributes as a level of indirection to generalize over classes. Fig. 1(a) visualizes the model and shows that one attribute (e.g. paw) is associated with several training classes (tiger, gorilla) and test classes (leopard, giant panda). The binary object-attribute associations are mined from language resources, such as WordNet [5], Wikipedia, Flickr tags and descriptions, or the World Wide Web queried by Yahoo. The second model uses inter-object class relatedness to transfer knowledge. As shown in Fig. 1(b) unseen classes are directly associated with the most similar training classes, e.g. seal with walrus and dolphin. Again, associations are mined form language. The main contributions of this work are as follows. First, we provide a thorough evaluation of a variety of language resources and semantic relatedness (SR) measures

Marcus Rohrbach (a)

attributes
(groups of training classes)

(b)
unseen (zero-shot) test class images training classes
(direct)

belly

leopard

tiger

G. shepherd

paw

seal

walrus

dolphin

flipper

classattribute relatedness

giant panda

inter object class relatedness

grizzly bear

polar bear

language resources
mined attributes

WordNet

Fig. 1. Visual knowledge transfer from training classes to unseen test class images (zero-shot recognition). The required associations are provided by semantic relatedness measures based on different language resources (blue-colored lines). Two models are distinguished: (a) transfer from attributes to unseen classes using class-attribute relatedness; (b) transfer from most similar classes using inter-object class relatedness.

which allow signicantly reducing supervision in zero-shot object detection. Second, we compare two different models for zero-shot detection including several levels of automation for the attribute model to further reduce manual intervention. Third, we discuss the major differences of different language resources and give insights which might also applicable to other vision tasks. Large parts of this work have been published in [6]. The remainder of the paper is structured as follows. After a review of related work (Sect. 2) we present the models and approaches used for zero-shot knowledge transfer (Sect. 3) and introduce the different language resources and relatedness measures (Sect. 4). Then we discuss our experimental results in Sect. 5 and conclude with an outlook in Sect. 6.

Related work

Sharing and transferring knowledge between classes has recently become an important research direction for scalable recognition. We distinguish between two broad types of knowledge transfer: First, sharing knowledge across classes on a separate layer of attributes, and, secondly, directly transferring knowledge between classes using interclass similarity. We conclude this section by discussing approaches which use vision and language resources in combination. Attributes have been used in a wide variety of vision applications. Approaches range from elementary visual properties such as colors or geometric patterns [79] to high level attributes such as gender for face verication [10], background scenes [11], or parts [2]. As attribute activations can characterize object classes without using reference exemplars they can be used for zero-shot classication of previously unseen object classes: [12] formally discusses zero-shot learning and compares the performance of a

89:; ?>=< . . . ?>=< ?>=< . . . j?>=< 89:; 89:; 0123 89:; 0123 7654 y1 r yK 7654 z z S r 1 su 1 S j j n L s S r u j j Sn n z s S y n L a1 1 S ru 1 j s u j n aM j a2 n . . . aM a1 9 [UU O ss ss UU ss M |x) p(a1 |x) ss p(a
x (a) Attribute-based [1]

DAGM Young Researchers Forum 2010

z y1 1

89:; 89:; ?>=< ?>=< . . . ?>=< 0123 0123 7654 7654 89:; 0123 7654 z1 z2 zL w 1 V V 1 q qn n 1 z n w 1yL q 1 V 1 n w q K n w 89:; 89:; ?>=<n ?>=< . . . ?>=< 89:; y1 y2 : yK ]XX O uu XX uu u p(y1 |x) uu p(yK |x)
x

(b) Direct similarity-based

Fig. 2. Two models for zero-shot object classication. See Sect. 3 for discussions.

linguistic knowledge base (Google Trillion-Word-Corpus) to manual labels. However, they apply it to a very different domain, namely neural decoding of thoughts. In the domain of computer vision, [1] presents zero-shot classication schemes based on attributes, where the associations between attributes and object classes are obtained using manual supervision by human subjects. [13] advocates a paradigm shift from naming (by object classes) to describing (by attributes), distinguishing among common, discriminating, unusual, and unexpected attributes for object classes. While we use the model proposed by [1] for attribute-based knowledge transfer, we replace manual supervision with information extracted automatically from language resources. In particular, our attribute-based approach is the rst which is able transfer knowledge to unseen visual object classes without any additional supervision. In contrast to attribute-based approaches, which employ an additional layer of indirection, direct approaches use inter-class similarities to model relationships. One direction is to model inter-class similarities in a semantic hierarchy [1417]. A second direction is to directly transfer knowledge from known to unseen classes by transferring class priors [2, 18] or by using one or few example images of these new classes [4, 19, 3]. Our second model for knowledge transfer is also based on such direct similarities but we extend it to zero-shot classication by using information obtained from language resources instead. Literature combining vision and language resources ranges from estimating the visualness of words [20, 21], over creating visual ontologies [16, 17] to joint models of images and accompanying text [22, 23]. Similar to our work, search engine hit counts have been used to determine semantic relatedness between objects and background scenes [11] or to nd the color of objects [7]. Although these approaches are still limited in the use of language resources and semantic relatedness measures applied, they show the benet of using visual and language information together. This work extends several of these ideas and provides an in-depth study of a range of SR measures applied to different language resources for zero-shot object class detection.

Models and approaches for zero-shot knowledge transfer

We examine knowledge transfer in a zero-shot scenario, i.e. we assume that we have a set of known classes y1 , . . . , yK and a set of unseen classes z1 , . . . , zL . For this we build upon two different kinds of models, namely attribute-based classication [1] with

Marcus Rohrbach

an intermediate level of attributes a1 , . . . , aM (Fig. 2(a)) and direct similarity-based classication [3], transferring knowledge directly from the known classes y1 , . . . , yK to the unseen classes z1 , . . . , zL (see Fig. 2(b)). 3.1 Attribute-based classication

The attribute-based model aims to learn common visual features across classes dened by attributes. As these trained attribute classiers generalize over the set of training classes they can be used to classify new unseen classes. This is visualized in Fig. 1(a), where the attribute paw generalizes over the training classes tiger and gorilla which can then be transferred to the test classes leopard and giant panda. To formally model these associations we use the direct attribute prediction (DAP) model suggested by [1]. This is shown in Fig. 2(a): an attribute am is associated with training (y1 , . . . , yK ) and test classes (z1 , . . . , zL ), class-attribute associations shown with dashed lines. The class-attribute associations (ayk / azl ) can either be active or m m inactive resulting in a group of classes associated with each attribute. E.g., for giant panda in Fig. 1(a), attributes are either active (belly and paw) or inactive (ipper). Following the probabilistic formulation of DAP in [1], let ay = (ay , . . . , ay ) be 1 M a vector of binary associations ay {0, 1} between attributes am and training object m classes y. A classier for attribute am is trained by labeling all images of all classes for which ay = 1 as positive and the rest as negative training examples. Now this classier m can estimate the posterior probability p(am |x) of that attribute being present in image x. M Mutual independence yields p(a|x) = m=1 p(am |x) for multiple attributes. In order to transfer attribute knowledge to an unseen class z, we again assume a binary vector az with p(a|z) = 1 for a = az and p(a|z) = 0 otherwise. The posterior probability of class z being present in image x is then obtained by marginalizing over z p(z) |z)p(z) all possible attribute associations a, using Bayes rule p(z|az ) = p(ap(az ) = p(az ) : p(z|x)=
a{0,1}M p(z) p(z|a)p(a|x)= p(az ) M m=1

p(am |x)am . Assuming identical class

M m=1

priors p(z) and a factorial distribution for p(a) =

p(am ), we obtain (1)

p(z|x)
m=1

p(am |x) p(am )

az m

Attribute priors can be approximated by empirical means over the training classes K 1 p(am ) = K k=1 ayk or set to 1 [1]. Classifying an image x according to test classes m 2 zL uses MAP prediction argmaxl=1,...,L p(zl |x). This leaves us with estimating the class-attribute associations for the unseen classes az and the known classes ay . [1] based the associations on judgments of 10 test subm m jects [24, 25]. Although [1] achieved promising performance with these associations, it requires a large amount of human involvement when moving to a new domain or sets of classes. In the following we suggest three approaches to decrease this required supervision. First, we reduce the human involvement by mining the associations automatically using semantic relatedness measures. Second, the attributes are also mined automatically to achieve a fully unsupervised setting. Finally, our third approach uses the class terms themselves as objectness attributes.

DAGM Young Researchers Forum 2010

Mining class-attribute associations. To mine class-attribute associations one can use widely and freely available language resources paired with semantic relatedness (SR) measures. In contrast to domain specic knowledge bases which might require specic queries and measures, SR measures are generally applicable to a large range of textual resources such as the World Wide Web and thus have a very broad coverage and availability (see Sect. 4). When the class-attribute associations were determined manually by [24], human judges were provided with full text attribute descriptions. One example is the property living in the New World (North and South America) which was abbreviated by [24] to newworld. Neither the description nor the abbreviation is appropriate as input for SR measures. We thus map the attribute descriptions/abbreviations to a concise term, in this example we used the term america. In this process a large amount of information is obviously lost or changed and the attributes loose precision and accuracy which increases the amount of noise (see results in Sect. 5). Mining attributes. The amount of supervision is reduced signicantly by mining the class-attribute associations but it still requires the denition of a set of attributes when moving to another domain. Important qualities of the attributes are the ability to discriminate between object classes and to be visually distinctive, but at the same time shared among several classes to enable knowledge transfer. To meet these criteria we suggest using part attributes (e.g. ipper for animals, wheel for vehicles): part-based approaches have successfully been used in vision in different domains and most objects consist of parts. We mine WordNet for the explicitly encoded parts of all 50 class terms and recursively all sub- und super concepts. In total we found 74 part terms1 , see e.g. Fig. 1(a).
objectness attributes unseen (zero-shot) Objectness as attributes. Our third approach is (groups of training classes) test class images based on the idea that each attribute is represented by a subset of all classes which contain similar visual image features. Thus such groups can be polar bearness leopard formed by grouping similar classes. More specifically we use the class names themselves as attributes, for instance beaver in Fig. 3 groups obgrizzly bearness seal ject classes which are similar to a beaver. This attribute could thus be interpreted as beaverness of inter object class an object. We denominate this approach as objectrelatedness giant panda beaverness ness. language resources Regarding the underlying model there is no change, but SR measures are now required to Fig. 3. Objectness measure inter-object class similarities rather than object-attribute relatedness. In line with this change objectness attributes frequently group very similar classes compared to the more generic AwA or part attributes, e.g. the attribute grizzly bearness in Fig. 3 groups several bears (grizzly bear, gorilla, and giant panda) in contrast to the attribute belly (see Fig. 1(a)), which groups the very diverse classes sheep, polar bear, seal, and giant panda.

All software for computing object class-attribute associations from linguistic language resources and obtained intermediate results (lists of mined attributes, object class-attribute associations) will be made publicly available on our web pages prior to DAGM 2010.

Marcus Rohrbach

3.2

Direct similarity-based classication.

Rather than using the indirection of an attribute layer we can classify the unseen classes z1 , . . . , zL by using the learned models of the most similar training class y1 , . . . , yK z as illustrated in Fig. 2(b). Again, the similarity yk between the test class z and training class yk can be mined from language resources (Fig. 1(b)). The direct similarity model can be interpreted as the DAP model with M = K attributes, where each attribute corresponds exactly to one training class yk . We thus train classiers for each class yk to provide estimates of p(yk |x) for a test image x. Now we can estimate the posterior of a test image x by applying these changes to
|x) . Instead of using only binary associations Equation 1: p(z|x) k=1 p(ykk ) p(y z yk we found empirically that continuous weights improve performance. We weight the z most similar classes using the continuous similarities wy between z and y as normalized K z weights yk =
z wy k K z i=1 wyi z yk

and set K = 5 in all our experiments.

Semantic relatedness measures and language resources

In this section we introduce several semantic relatedness (SR) measures to extract similarity information from the most widely used language resources. WordNet [5] is the largest machine readable expert-created language ontology. It contains over 100,000 concepts which are organized in a hierarchical graph structure. We employ the measure proposed by Lin [26] which estimates SR between two concepts by comparing their individual depth in the hierarchy with the depth of their lowest common subsumer. Wikipedia is the largest community built online encyclopedia. The state-of-the-art Explicit Semantic Analysis (ESA) measure on Wikipedia [27] represents each term as a vector of frequencies over all articles. Similarity of two terms t1 , t2 computed by the is c 2 cosine between the two respective vectors simESA (c1 , c2 ) = 1 c . c c
1 2

Yahoo Web. The World Wide Web is presumably the largest publicly available source of textual information. Due to this fact it has been extensively use in diverse Natural Language Processing applications, mainly by using the hit counts of search engines to determine co-occurence information [28]. In our study we use Yahoo to gather hit count (HC) information and the Dice coefcient to measure the similarity of term pairs HC(t1 ,t2 ) simDICE (t1 , t2 ) = HC(t1 )+HC(t2 ) . Yahoo Holonyms. When using part attributes we can explicitly use part-whole holonym relations. This can be achieved by querying the web for holonym patterns, such as spots of Dalmatins or cats paw. We use nine holonym patterns2 suggested by [29], excluding the in patterns as they tend to denote non-visible parts. The measure is 9 based on DICE, setting HC(t1 , t2 ) := i=1 HC(patterni (t1 , t2 )). Yahoo Img / Flickr Img. The web tends to be very noisy as unrelated terms might appear on the same web page leading to noisy hit count statistics. Using web image
2

Nine holonym patterns: (1-2) wholes part[s], (3-4) wholes part[s], (5-6) part[s] of a whole, (7-8) part[s] of the whole, (9) parts of wholes.

DAGM Young Researchers Forum 2010

search instead allows reducing this noise as the evaluated text refers to the same entity, the image. Additionally we hope to get more visually relevant hit count results. We use Yahoo image search and search Flickrs image tags and descriptions.

Experiments

In this section we evaluate the different language resources and SR measures (Sect. 4) for the various approaches for zero-shot classication (Sect. 3) on the Animals with Attributes (AwA) dataset [1]. The AwA dataset consists of 50 mammal object classes paired with an inventory of 85 attributes and corresponding object class-attribute associations [25, 24]. We follow the experimental protocol of [1], using the provided split into 40 training and 10 test classes (24,295 training, 6,180 test images). We also use the provided pre-computed feature descriptors, namely, RGB color histograms, SIFT, rgSIFT, PHOG, SURF, and local self-similarity histograms. In contrast to [1], we concatenate all features to a single vector instead of training independent SVMs. For computational reasons we depart slightly from [1] in our main experiments by down-sampling all training images to the minimum of 92 available images per class and using histogram intersection kernel SVMs instead of 2 kernel SVMs. For each attribute in the attribute-based model and each class in the direct model we train a SVM with an intersection kernel over the concatenated feature vectors. We use libSVM with the built-in probability estimates and a xed cost parameter C=10, which has been found on a subset of the training classes by grid-search and cross-validation. To binarize the continuous SR values we threshold with the mean over all matrix values. We normalize the matrix values by dividing by column and row sums prior to binarization. Reproduction of the results in [1]. When using all available training images and a SVM with 2 -kernel, our implementation achieves 80.3% mean AUC and a multi-class classication accuracy of 40.3% which is very similar to 80.7% and 40.5% reported in [1]. Using intersection kernel and only 92 training images per class results in slightly lower performance with a mean AUC of 78.5% (Table 5, rst row & rst column) and accuracy of 34.7%. All following results are produced with this computationally more manageable setting. 1. AwA attributes - mined object class-attribute associations. We start by replacing the manual object class-attribute associations for the AwA attributes [1] with mined association, shown in the rst row in Table 5. Comparing the different language resources and SR measures introduced in Sect. 4, we nd the image-based measures (Yahoo Img and Flickr Img) and Wikipedia to perform best with a mean AUC of 71.0%, 70.1%, and 69.7%, respectively. This is expected as the image-based measures are based on image related text and thus inherently capture important correlations between terms and visual attributes. Wikipedia has shown to provide almost noise-free resource for computing semantic relatedness [27] which can apparently be transferred to our task. WordNet (60.5%) and Yahoo Web (60.4%) perform worst with a signicant drop in performance of about 10% compared to the rst three language resources. We explain this drop for Yahoo Web by the increased level of noise, mainly incidental cooccurrences on web pages, compared to image search and Wikipedia. While WordNet is noise-free, the problem originates from the employed measure which is based on

Marcus Rohrbach Yahoo Web Yahoo Img Flickr Img Wikipedia WordNet Y. Holonyms

Manual

1. AwA attributes 2. Mined attributes 3. Objectness as attributes 4. Direct similarity

78.5

60.5 59.8 71.2 73.4

69.7 66.0 66.4 76.6

60.4 57.4 66.7 77.7

71.0 65.2 74.1 78.8

70.1 64.6 69.9 68.5* 77.8

Table 1. Mean Area under ROC curve (AUC) of the ten test classes in % for zero-shot classication on the AwA data set. The respective best result of all knowledge bases is shown in bold. *See Sect. 5 for discussions.

proximity in WordNets hypernym hierarchy: Object classes and attributes are inherently different in nature and tend to lie in different subtrees of the hypernym hierarchy. For a deeper understanding we examine the mined class-attribute associations which are the bases for the knowledge transfer. Judging if the associations are meaningful is not always easy, nevertheless we provide examples for the visual attribute striped to give an impression of the quality of mined similarities. In the following we list the four top ranked mammal classes for striped in decreasing order: Manual: zebra, tiger, skunk, raccoon; WordNet: elephant, seal, mouse, bat; Wikipedia: zebra, skunk, tiger, Chihuahua; Yahoo Web: zebra, collie, Dalmatian, polar bear; Yahoo Img: zebra, skunk, tiger, Persian cat; Flickr Img: skunk, tiger, zebra, leopard. Overall we found that mined associations (at best 71.0%) perform considerably worse (7.5%) than manual dened associations (78.5%). Although we acknowledge this drop as signicant we want to emphasize that the information given to the human judges was more descriptive than the simplied terms used for querying the language resources, e.g. nest abbreviates the attribute keeping their young in a designated, enclosed area. In connection with the signicant reduction in manual supervision we consider the results a promising contribution to visual knowledge transfer. 2. Mined attributes (and associations). In contrast to the manually dened AwA attributes used in the rst experiment, we use the mined (part) attributes which do not require any supervision (Table 5, 2nd row). Best is Yahoo Holonyms with an mean AUC of 69.9% reaching performance close to manual attributes. This variant of Yahoo Web is specically targeted to part-whole relations by searching for specic patterns which reduce incidental co-occurrences compared to Yahoo Web. Next are Wikipedia, Yahoo Img, and Flickr Img (66.0%, 65.2%, 64.6%). Last are, with a signicant drop, WordNet 59.8% and Yahoo Web 57.4%. Disregarding Yahoo Holonyms, all language resources drop in performance compared to AwA attributes in the rst experiment but the relative performance is in line with our previous observations. We explain this drop by the smaller number of attributes (74 instead of 85) and the decreased diversity of attributes (only part attributes). 3. Objectness as attributes. In our third experiment with use all 50 class names as attributes (Table 5, 3rd row). Yahoo Img is again best (74.1%), but in contrast to the rst two experiments, WordNet follows closely as second best (71.2%): As objectness

DAGM Young Researchers Forum 2010

requires the SR measures to compare similarity of objects rather than object-attribute associations the WordNet hierarchy provides an adequate model for similarity. Next are Yahoo Web (66.7%) and Wikipedia (66.4%). Although the average performance of Flickr Img is slightly above (68.5%), one test class was not associated with any attributes resulting in chance-level performance for this class. We explain this by insufcient statistics in the user provided text for co-occurring object class terms. 4. Direct similarity. Our fourth experiment evaluates our second, direct model (Sect. 3.2), which uses the ve most similar training classes. Disregarding WordNet (73.4%) all language resources perform very similar and sometimes even on par with manually dened object class-attribute associations (Table 5, 4th row: Yahoo Img 78.8%, Flickr Img 77.8%, Yahoo Web 77.7%, Wikipedia 76.6%). This can be explained by the observation that the 5 most similar classes for a given class are very similar among the language resources and quite reliable. The direct similarity model additionally eliminates the need for an attribute layer and thus uses appropriate training data. Images of known training classes in the test set. In all experiments so far the training and test class set have been disjoint as suggest by [1]. This means that a zero-shot classier under test never has to distinguish known classes from the unseen ones. We expect this to be more difcult as it requires to classify those classes as negatives used as positive during training. To test this effect we add all images not used for training (due to downsampling) to the test set as negatives and report results for the best knowledge base for each of the above experiments. The performance drops for objectness (74.1% to 67.7%) and direct similarity (78.8% to 76.0%) for added negatives. In contrast to this, the performance is stable for manual associations (78.5% to 78.9%) or increases for mined associations on AwA attributes (71.0% to 73.2%) and mined attributes (69.9% to 70.7%). We attribute this to the more general character of AwA attributes and mined attributes: While the inter-object class relatedness based approaches (objectness and direct similarity) group very similar classes (Fig 3) the object-attribute relatedness approaches tend to generalize over more diverse classes (Fig. 1(a)) which reduces the inuence of specic (positive) training classes as negatives during testing.

Summary and Conclusions

Reducing supervision is vital for enabling knowledge transfer for a large number of classes. In this work we propose several approaches to fully replace manual intervention by tapping into linguistic knowledge bases1 . In particular, overall best performance is achieved by the hit count based measure on Yahoo Image search outperforming most other measures on attribute-, objectness- and direct-similarity based approaches, reaching a performance on par with manually dened associations for the direct-similarity based approach. Due to a smaller coverage Flickr image is always slightly inferior. While Wikipedia performs similarly compared to image search, Yahoo Web and WordNet are especially inferior for attribute-based associations. As part of future work we plan to apply our ndings to data sets with thousands of classes where manual supervision is infeasible.

Marcus Rohrbach

References
1. Lampert, C.H., Nickisch, H., Harmeling, S.: Learning to detect unseen object classes by between-class attribute transfer. In: CVPR. (2009) 2. Stark, M., Goesele, M., Schiele, B.: A shape-based object class model for knowledge transfer. In: ICCV. (2009) 3. Bart, E., Ullman, S.: Single-example learning of novel classes using representation by similarity. In: BMVC. (2005) 4. Fink, M.: Object classication from a single example utilizing class relevance pseudometrics. In: NIPS. (2004) 5. Fellbaum, C.: WordNet: An Electronical Lexical Database. The MIT Press (1998) 6. Rohrbach, M., Stark, M., Szarvas, G., Gurevych, I., Schiele, B.: What Helps Where And Why? Semantic Relatedness for Knowledge Transfer. In: CVPR. (2010) 7. Millet, C., Grefenstette, G., Bloch, I., Mo llic, P.A., H` de, P.: Automatically populating an e e image ontology and semantic color ltering. In: OntoImage. (2006) 8. Ferrari, V., Zisserman, A.: Learning visual attributes. In: NIPS. (2007) 9. Wang, G., Forsyth, D.: Joint learning of visual attributes, object classes and visual saliency. In: ICCV. (2009) 10. Kumar, N., Berg, A.C., Belhumeur, P.N., Nayar, S.K.: Attribute and simile classiers for face verication. In: ICCV. (2009) 11. Delezoide, B., Pitel, G., Borgne, H.L., Greffenstette, G., Mollic, P.A., Millet, C.: Object/background scene classication in photographs using linguistic statistics from the web. In: OntoImage. (2008) 12. Palatucci, M., Pomerleau, D., Hinton, G., Mitchell, T.: Zero-shot learning with semantic output codes. In: NIPS. (2009) 13. Farhadi, A., Endres, I., Hoiem, D., Forsyth, D.A.: Describing objects by their attributes. In: CVPR. (2009) 14. Zweig, A., Weinshall, D.: Exploiting object hierarchy: Combining models from different category levels. In: ICCV. (2007) 15. Marszalek, M., Schmid, C.: Semantic hierarchies for visual object recognition. CVPR (2007) 16. Popescu, A., Millet, C., Mo llic, P.A.: Ontology driven content based image retrieval. In: e CIVR, ACM (2007) 17. Wang, H., Jiang, X., Chia, L.T., Tan, A.H.: Ontology enhanced web image retrieval: aided by wikipedia & spreading activation theory. In: MIR. (2008) 18. Fei-Fei, L., Fergus, R., Perona, P.: One-shot learning of object categories. PAMI (2006) 19. Thrun, S.: Is learning the n-th thing any easier than learning the rst. In: NIPS. (2006) 20. Barnard, K., Yanai, K.: Mutual information of words and pictures. In: ITA. (2006) 21. Boiy, E., Deschacht, K., Moens, M.F.: Learning visual entities and their visual attributes from text corpora. In: DEXA. (2008) 22. Barnard, K., Duygulu, P., Forsyth, D.A., de Freitas, N., Blei, D.M., Jordan, M.I.: Matching words and pictures. JMLR (2003) 23. Li, L.J., Socher, R., Fei-Fei, L.: Towards total scene understanding: Classication, annotation and segmentation in an automatic framework. In: CVPR. (2009) 20362043 24. Osherson, D.N., Stern, J., Wilkie, O., Stob, M., Smith, E.E.: Default probability. CS (1991) 25. Kemp, C., Tenenbaum, J.B., Grifths, T.L., Yamada, T., Ueda, N.: Learning systems of concepts with an innite relational model. In: AAAI. (2006) 26. Lin, D.: An information-theoretic denition of similarity. In: ICML. (1998) 27. Zesch, T., Gurevych, I.: Wisdom of crowds versus wisdom of linguists - measuring the semantic relatedness of words. JNLE (2010) 28. Kilgarriff, A., Grefenstette, G.: Introduction to the special issue on the web as corpus. CL03 29. Berland, M., Charniak, E.: Finding parts in very large corpora. In: ACL. (1999)

Consonant Digraph Lesson Plan - Mina Lee
100% (1)
Consonant Digraph Lesson Plan - Mina Lee
6 pages
Global Media Culture: The Contemporary World
100% (3)
Global Media Culture: The Contemporary World
34 pages
Attribute-Based Classification For Zero-Shot Visual Object Categorization
No ratings yet
Attribute-Based Classification For Zero-Shot Visual Object Categorization
13 pages
Telling Stories For Common Sense Zero-Shot Action Recognition
No ratings yet
Telling Stories For Common Sense Zero-Shot Action Recognition
16 pages
2- 4891-Article Text-7957-1-10-20190709
No ratings yet
2- 4891-Article Text-7957-1-10-20190709
8 pages
Zero-Shot Learning To Detect Object Instances From Unknown Image Sources
No ratings yet
Zero-Shot Learning To Detect Object Instances From Unknown Image Sources
4 pages
4.1 - Unsupervised Visual Representation Learning by Context Prediction
No ratings yet
4.1 - Unsupervised Visual Representation Learning by Context Prediction
10 pages
Effect of Visual Attributes On Zero Shot Learning Jacob Chandran
No ratings yet
Effect of Visual Attributes On Zero Shot Learning Jacob Chandran
20 pages
Xian Latent Embeddings For CVPR 2016 Paper
No ratings yet
Xian Latent Embeddings For CVPR 2016 Paper
9 pages
Attribute-Centric Recognition For Cross-Category Generalization
No ratings yet
Attribute-Centric Recognition For Cross-Category Generalization
8 pages
Key_paper_1
No ratings yet
Key_paper_1
14 pages
OD Trans Christopher-Lang2022 Q2
No ratings yet
OD Trans Christopher-Lang2022 Q2
15 pages
Research Paper 5
No ratings yet
Research Paper 5
10 pages
Deep Learning Notes PDF
No ratings yet
Deep Learning Notes PDF
26 pages
SCAN: Learning To Classify Images Without Labels: 1 Introduction and Prior Work
No ratings yet
SCAN: Learning To Classify Images Without Labels: 1 Introduction and Prior Work
26 pages
Zero Shot Learning
No ratings yet
Zero Shot Learning
13 pages
Publi 6917
No ratings yet
Publi 6917
9 pages
Workshop Master Revealed
From Everand
Workshop Master Revealed
Anil Soni
No ratings yet
Learning Clusterable Visual Features For Zero-Shot Recognition
No ratings yet
Learning Clusterable Visual Features For Zero-Shot Recognition
9 pages
Visual Data Synthesis Via GAN For Zero-Shot Video Classification
No ratings yet
Visual Data Synthesis Via GAN For Zero-Shot Video Classification
7 pages
Explainable Zero-Shot Topic Extraction Using a Common-Sense Knowledge Graph
No ratings yet
Explainable Zero-Shot Topic Extraction Using a Common-Sense Knowledge Graph
16 pages
09508
No ratings yet
09508
16 pages
Multi attentionNetworkforOneShotLearning
No ratings yet
Multi attentionNetworkforOneShotLearning
9 pages
s41019-024-00248-9
No ratings yet
s41019-024-00248-9
36 pages
Expert Systems - 2023 - Basabain - Enhancing Arabic‐text feature extraction utilizing label‐semantic augmentation in few
No ratings yet
Expert Systems - 2023 - Basabain - Enhancing Arabic‐text feature extraction utilizing label‐semantic augmentation in few
16 pages
From Text to Mask Localizing Entities Using the
No ratings yet
From Text to Mask Localizing Entities Using the
43 pages
3- Zero Shot Detection
No ratings yet
3- Zero Shot Detection
13 pages
230623-Paper-learning and Representing Object Shape Through an Array of Orientation Columns
No ratings yet
230623-Paper-learning and Representing Object Shape Through an Array of Orientation Columns
14 pages
Lee Hierarchical Novelty Detection CVPR 2018 Paper
No ratings yet
Lee Hierarchical Novelty Detection CVPR 2018 Paper
23 pages
brooks1981
No ratings yet
brooks1981
64 pages
NeurIPS 2021 Unsupervised Part Discovery From Contrastive Reconstruction Paper
No ratings yet
NeurIPS 2021 Unsupervised Part Discovery From Contrastive Reconstruction Paper
15 pages
Zero-Shot Learning - A Comprehensive Evaluation of The Good, The Bad and The Ugly
No ratings yet
Zero-Shot Learning - A Comprehensive Evaluation of The Good, The Bad and The Ugly
14 pages
Transfer Learning Through Embedding Spaces (Z-Lib - Io)
No ratings yet
Transfer Learning Through Embedding Spaces (Z-Lib - Io)
223 pages
DOCK - Detecting Objects by Transferring Common-Sense Knowledge
No ratings yet
DOCK - Detecting Objects by Transferring Common-Sense Knowledge
17 pages
Weakly Supervised Contrastive Learning
No ratings yet
Weakly Supervised Contrastive Learning
10 pages
electronics-13-04763
No ratings yet
electronics-13-04763
19 pages
(IJCST-V11I4P12) :N. Kalyani, G. Pradeep Reddy, K. Sandhya
No ratings yet
(IJCST-V11I4P12) :N. Kalyani, G. Pradeep Reddy, K. Sandhya
16 pages
IT5409 - Ch7 - Part2 - Object Recognition - v2 - 4pages
No ratings yet
IT5409 - Ch7 - Part2 - Object Recognition - v2 - 4pages
38 pages
4-s2.0-S0957417422016803-main
No ratings yet
4-s2.0-S0957417422016803-main
12 pages
Attribute Discovery Via Predictable Discriminative Binary Codes
No ratings yet
Attribute Discovery Via Predictable Discriminative Binary Codes
14 pages
Virtual Training For Multi-View Object Class Recognition
No ratings yet
Virtual Training For Multi-View Object Class Recognition
8 pages
9
No ratings yet
9
11 pages
NeurIPS-2022-zeroc-a-neuro-symbolic-model-for-zero-shot-concept-recognition-and-acquisition-at-inference-time-Paper-Conference
No ratings yet
NeurIPS-2022-zeroc-a-neuro-symbolic-model-for-zero-shot-concept-recognition-and-acquisition-at-inference-time-Paper-Conference
13 pages
Object Detection With Voice Sensor and Cartoonizing The Image
No ratings yet
Object Detection With Voice Sensor and Cartoonizing The Image
6 pages
Object Search Using Object Co-Occurrence Relations Derived From Web Content Mining
No ratings yet
Object Search Using Object Co-Occurrence Relations Derived From Web Content Mining
13 pages
TMP F371
No ratings yet
TMP F371
54 pages
FR AIA David Forsyth Distribute
No ratings yet
FR AIA David Forsyth Distribute
41 pages
Jia Bin Huang Research Statement
No ratings yet
Jia Bin Huang Research Statement
6 pages
CLIP_ Connecting text and images _ OpenAI
No ratings yet
CLIP_ Connecting text and images _ OpenAI
16 pages
Fin Irjmets1654850281
No ratings yet
Fin Irjmets1654850281
10 pages
Learning From Children: Improving Image-Caption Pretraining Via Curriculum
No ratings yet
Learning From Children: Improving Image-Caption Pretraining Via Curriculum
7 pages
Learning To Reason Over Visual Objects
No ratings yet
Learning To Reason Over Visual Objects
22 pages
mathematics-10-02525
No ratings yet
mathematics-10-02525
20 pages
2019 Researchstatement
No ratings yet
2019 Researchstatement
6 pages
Semantic -Prmopt Paper
No ratings yet
Semantic -Prmopt Paper
11 pages
Han et al - Contrastive Embedding for Generalized Zero-Shot Learning
No ratings yet
Han et al - Contrastive Embedding for Generalized Zero-Shot Learning
11 pages
Using Objectdetection NLP and Message of Images
No ratings yet
Using Objectdetection NLP and Message of Images
14 pages
Transfer_Learning_for_Visual_Categorization_A_Survey
No ratings yet
Transfer_Learning_for_Visual_Categorization_A_Survey
16 pages
452_Learning_to_Adapt_Frozen_C
No ratings yet
452_Learning_to_Adapt_Frozen_C
22 pages
Latent Object Characteristics Recognition With Visual To Haptic-Audio Cross-Modal Transfer Learning
No ratings yet
Latent Object Characteristics Recognition With Visual To Haptic-Audio Cross-Modal Transfer Learning
8 pages
A Concise Guide to Object Orientated Programming
From Everand
A Concise Guide to Object Orientated Programming
alasdair gilchrist
No ratings yet
Object Oriented Programming Inheritance: Fundamentals and Applications
From Everand
Object Oriented Programming Inheritance: Fundamentals and Applications
Fouad Sabry
No ratings yet
ECT 310 ENGLISH METHODS (1)
No ratings yet
ECT 310 ENGLISH METHODS (1)
2 pages
Discourse and Political Culture The Language of The Third Way - Kranert, Michael
No ratings yet
Discourse and Political Culture The Language of The Third Way - Kranert, Michael
318 pages
Starter's References
No ratings yet
Starter's References
7 pages
Course Outline
No ratings yet
Course Outline
4 pages
¡ Memo Conciseness: Revised
No ratings yet
¡ Memo Conciseness: Revised
6 pages
Passive Voice
No ratings yet
Passive Voice
6 pages
Guide 9 Eleventh
No ratings yet
Guide 9 Eleventh
3 pages
Activity 1
No ratings yet
Activity 1
4 pages
Technical Report Writing
No ratings yet
Technical Report Writing
202 pages
Unit 8 TB - Think 1
No ratings yet
Unit 8 TB - Think 1
20 pages
Unit Plan Iv
No ratings yet
Unit Plan Iv
2 pages
Mother Tongue As Medium of Instruction P PDF
No ratings yet
Mother Tongue As Medium of Instruction P PDF
136 pages
Language Functions 2
100% (1)
Language Functions 2
19 pages
Humanities-Stylistical Analysis-Hafsa Ruba
No ratings yet
Humanities-Stylistical Analysis-Hafsa Ruba
8 pages
Advanced Reading Writing QP Finial 2019
No ratings yet
Advanced Reading Writing QP Finial 2019
2 pages
Zero and First Conditional
No ratings yet
Zero and First Conditional
2 pages
Ta - As - Inglês - 3 Etapa
No ratings yet
Ta - As - Inglês - 3 Etapa
2 pages
T V NG NG Nghĩa
No ratings yet
T V NG NG Nghĩa
7 pages
Download Full (Ebook) The Quantitative Analysis of the Dynamics and Structure of Terminologies by Kyo Kageura ISBN 9789027223395, 9789027272461, 9027223394, 9027272468 PDF All Chapters
100% (2)
Download Full (Ebook) The Quantitative Analysis of the Dynamics and Structure of Terminologies by Kyo Kageura ISBN 9789027223395, 9789027272461, 9027223394, 9027272468 PDF All Chapters
67 pages
Study Skill Document
No ratings yet
Study Skill Document
104 pages
De Thi Thu Tot Nghiep THPT Tieng Anh 2021 Truong THPT Chuyen Bac Ninh Lan 2 2
No ratings yet
De Thi Thu Tot Nghiep THPT Tieng Anh 2021 Truong THPT Chuyen Bac Ninh Lan 2 2
11 pages
About The Grammar Test
20% (5)
About The Grammar Test
13 pages
Argumentative Essay Assignment Sheet and Rubric (1) 2022
No ratings yet
Argumentative Essay Assignment Sheet and Rubric (1) 2022
4 pages
Verbal Tenses and Aspects
No ratings yet
Verbal Tenses and Aspects
2 pages
Lexical and Syntax Analysis-4
No ratings yet
Lexical and Syntax Analysis-4
54 pages
Communication Skills ASSIGNMENT
No ratings yet
Communication Skills ASSIGNMENT
8 pages
American Phonetic Symbols
No ratings yet
American Phonetic Symbols
3 pages
And The Words Become: Sawikaan's Mga Salita NG Taon-Filipino Word-Formation Towards Expanding and Codifying The Lexicon of Philippine English
No ratings yet
And The Words Become: Sawikaan's Mga Salita NG Taon-Filipino Word-Formation Towards Expanding and Codifying The Lexicon of Philippine English
77 pages