Convolutional Patch Representations for Image Retrieval An unsupervised approach

Download as pptx, pdf

1 like556 views

1. The document presents an unsupervised approach using convolutional neural networks to generate patch-level descriptors for image retrieval. 2. It trains a convolutional kernel network on unlabeled image patches to learn feature representations in a kernel space without requiring manual labels. 3. Experiments show the convolutional kernel descriptors achieve similar or better performance than supervised convolutional neural networks on standard patch and image retrieval datasets while requiring less training time.

Data & Analytics

Convolutional Patch Representations for Image
Retrieval: an Unsupervised Approach
29th Mar 2016
Original slides by Eva Mohedano
Insight Centre for Data Analytics (Dublin City University
Mattis Paulin, Julien Mairal, Matthijs Douze, Zaid Harchaoui, Florent Perronnin, Cordelia Schmidt

Overview
Published ICCV 2015 (A.K.A. Local Convolutional Features With Unsupervised
Training for Image Retrieval)
Deep Convolutional Architecture to produce patch-level descriptors
• Unsupervised framework
• Comparison in patch and retrieval datasets
• “RomePatches” dataset

Related Work
• Shallow patch descriptors
• Deep learning for image retrieval
• Deep patch descriptors

Related Work
• Shallow patch descriptors
SIFT – Scale-Invariant Feature Transform
- stereo matching
- retrieval
- classification
SURF, BRIEF, LIOP, (…)
Hand crafted → Relatively small number of parameters.
Note: A patch is an
image region extracted
from an image.

Related Work
• Deep learning for image retrieval
CNN learned on a sufficiently large labeled dataset (ImageNet) generates intermediate layers that
can be used as image descriptors.
Those descriptors work for a wide variety of tasks, including image retrieval

Related Work
• Deep learning for image retrieval
source image: http://pubs.sciepub.com/ajme/2/7/9/

Related Work
• Deep learning for image retrieval
source image: http://pubs.sciepub.com/ajme/2/7/9/
Fully connected layers → Global Image Descriptors
● Compact representation
● lack of geometric invariance
Below state-of-the art in image
retrieval
Compute at different scales
(Babenko, Razavian)

Related Work
• Deep learning for image retrieval
source image: http://pubs.sciepub.com/ajme/2/7/9/
Convolutional layers

Related Work
• Deep patch descriptors
3 different kind of supervision:
1. Category labels of ImageNet. [Long et al, 2014]
2. Surrogate patch labels: Each class is a given patch under different transformations [Fischer et al, 2014]
3. Matching/non-matching pairs. [Simo-Serra et al, 2015]
Works focussed in patch-level metrics, not image retrieval.
All approaches requiered some kind of supervision.

Image Retrieval Pipeline
• Interest point detection
Hessian-Affine detector.
Rotation invariance.
• Interest point description
Feature representation in a Euclidean space
• Patch Matching
VLAD encoding.
Power normalization with exponent 0.5 + L2-norm.

Convolutional Descriptors
Patch size = 51x51 – Optimal for SIFT on Oxford dataset.
CNN extended to retrieval by:
• Encoding local descriptors with model trained with an unrelated
classification task
• Devising a surrogate classification problem that is as related as
possible to image retrieval:
• Using unsupervised learning: Convolutional Kernel Network

Convolutional Descriptors
• Using unsupervised learning: Convolutional Kernel Network
Feature representation based in a kernel (feature) map -- Data independent

Convolutional Descriptors
• Using unsupervised learning: Convolutional Kernel Network
Projection in Hilbert space
Explicit kernel map can be computed to approximate it for computational efficiency.
- Sub-sample of patches
- Stochastic Gradient Optimization

Convolutional Descriptors
• Using unsupervised learning: Convolutional Kernel Network
4 possible inputs
From left to right: CKN-raw, CKN-mean subs, CKN-white (mean subs + PCA-whitening), CKN-grad
(fully invariant to color)
Only CKN-raw, CKN-white and CKN-grad are evaluated.

Experiments
Datasets:
1. Rome Patches-Image
2. Oxford
3. UKbench and Holidays
CKN trained on 1M sub-patches. 300K iterations. Mini-batches size of 1000.

Conclusions
• CKN offer similar and sometimes better performance than CNN in the
context of patch description.
• Good patch retrieval translates into good image retrieval.
• CKNs are orders of magnitude faster to train than CNNs (10 min vs 2-3 days
on a modern GPU)
• Fully unsupervised – no labels.

Resources
RomePatches+Code (Although code is not accessible!)
Discriminative Unsupervised Feature Learning with Exemplar Convolutional
Neural Networks
- Code with augmentations in matlab
- Code for training models.
- Models already trained :-)
Triplet’s net + Code !!
- Greyscale local patches of 32x32. Tested in matching datasets

This document summarizes a research paper on deep image retrieval using global image representations. It presents three key ideas: 1) A siamese network trained with a triplet loss to learn image representations optimized for retrieval. 2) Replacing rigid region grids with a region proposal network to localize regions of interest. 3) Experiments showing their method outperforms classification features and achieves state-of-the-art results on standard retrieval datasets. Their work demonstrates an effective and scalable approach to image retrieval based on learning compact global image signatures.

Deep image retrieval learning global representations for image searchUniversitat Politècnica de Catalunya

The document discusses a novel approach to deep image retrieval using a three-stream siamese network and regional maximum activation of convolutions (r-mac) to effectively encode images into compact feature vectors. It highlights the methodology employed, including the use of triplet loss for optimization, and demonstrates the system's efficiency through experiments using various datasets. The study concludes that this technique offers scalability and improved performance in image retrieval tasks compared to prior methods.

Class Weighted Convolutional Features for Image Retrieval Universitat Politècnica de Catalunya

The document presents a method for enhancing image retrieval through class-weighted convolutional features, utilizing pre-trained CNNs and class activation maps to extract and weight relevant image content. It discusses the challenges associated with training models on dynamic datasets and introduces techniques like transfer learning and descriptor aggregation to improve efficiency. Experimental results indicate that the proposed approach achieves superior performance compared to traditional methods, establishing a new state-of-the-art in image representation for retrieval tasks.

Convolutional Features for Instance SearchUniversitat Politècnica de Catalunya

This document discusses techniques for instance search using convolutional neural network features. It presents two papers by the author on this topic. The first paper uses bags-of-visual-words to encode convolutional features for scalable instance search. The second paper explores using region-level features from Faster R-CNN models for instance search and compares different fine-tuning strategies. The document outlines the methodology, experiments on standard datasets, and conclusions from both papers.

Deep Learning for Computer Vision: Image Retrieval (UPC 2016)Universitat Politècnica de Catalunya

The document discusses advanced methods for content-based image retrieval, focusing on generating rankings of similar images based on image queries. It emphasizes the use of convolutional neural network (CNN) representations and specific techniques like siamese networks for learning effective image descriptors. Additionally, it highlights datasets and approaches to improve retrieval accuracy through ranking and similarity metrics.

Semantic Segmentation on Satellite ImageryRAHUL BHOJWANI

The document discusses a semantic segmentation challenge using satellite imagery, aiming to assist image analysts in identifying and classifying various objects within the images. It outlines the methodology employed, including deep learning architectures such as U-Net and SegNet, for image processing and segmentation tasks. Results and future work are also mentioned, emphasizing further class inclusion and model optimization.

object detection paper reviewYoonho Na

- R-CNN was the first CNN model to achieve high performance in object detection. It used a multi-stage pipeline involving region proposals, feature extraction via CNN, and SVM classification. It was slow due to computing CNN features for each region individually. - Fast R-CNN improved on R-CNN by introducing a ROI pooling layer to share computation and enabling end-to-end training. However, region proposals were still generated externally, slowing down detection. - Faster R-CNN addressed this by introducing a Region Proposal Network to generate proposals, allowing the entire model to be trained end-to-end. This led to faster and more accurate detection compared to previous models. - YOLO

Content-Based Image Retrieval (D2L6 Insight@DCU Machine Learning Workshop 2017)Universitat Politècnica de Catalunya

The document discusses content-based image retrieval. It begins with an overview of the problem of using a query image to retrieve similar images from a large dataset. Common techniques discussed include using SIFT features with bag-of-words models or convolutional neural network (CNN) features. The document outlines the classic SIFT retrieval pipeline and techniques for using features from pre-trained CNNs, such as max-pooling features from convolutional layers or encoding them with VLAD. It also discusses learning image representations specifically for retrieval using methods like the triplet loss to learn an embedding space that clusters similar images. The state-of-the-art methods achieve the best performance by learning global or regional image representations from CNNs trained on large, generated datasets

Object Detection Using R-CNN Deep Learning FrameworkNader Karimi

The document discusses object detection using the R-CNN deep learning framework, detailing concepts such as inductive bias, region proposals, and different iterations of R-CNN including Fast R-CNN, Faster R-CNN, and Mask R-CNN. It explains the machine learning processes involved in classification, localization, and segmentation, specifically how selective search is employed for region proposals. The paper emphasizes the challenges and improvements in the R-CNN family of algorithms for efficient object detection.

Mask-RCNN for Instance SegmentationDat Nguyen

Mask R-CNN is an algorithm for instance segmentation that builds upon Faster R-CNN by adding a branch for predicting masks in parallel with bounding boxes. It uses a Feature Pyramid Network to extract features at multiple scales, and RoIAlign instead of RoIPool for better alignment between masks and their corresponding regions. The architecture consists of a Region Proposal Network for generating candidate object boxes, followed by two branches - one for classification and box regression, and another for predicting masks with a fully convolutional network using per-pixel sigmoid activations and binary cross-entropy loss. Mask R-CNN achieves state-of-the-art performance on standard instance segmentation benchmarks.

R-FCN : object detection via region-based fully convolutional networksEntrepreneur / Startup

The paper presents r-fcn, an object detection method using region-based fully convolutional networks, which enhances speed and efficiency compared to traditional methods like Fast R-CNN. It utilizes a ResNet-101 architecture to compute region-based feature maps independently of region proposals, allowing for faster inference and more accurate localization of objects. The paper outlines the training process, loss functions, and the effectiveness of r-fcn, demonstrating a significant speed advantage over existing approaches.

Object detection - RCNNs vs RetinanetRishabh Indoria

The document discusses various object detection methods, including selective search, R-CNN, and the Fast R-CNN approach, focusing on their advantages and disadvantages related to training time and proposal quality. It highlights the importance of region proposals and introduces the Faster R-CNN, which employs a Region Proposal Network (RPN) for improved proposal generation. Additionally, it covers advancements like Focal Loss and the use of Feature Pyramid Networks (FPN) to enhance detection accuracy across different scales and contexts.

Fast Non-Uniform Filtering with Symmetric Weighted Integral Imagesdavidmarimon

The document discusses fast non-uniform filtering using symmetric weighted integral images (SWII), which reduces computational complexity compared to traditional methods. It highlights the advantages of SWII in generating various kernel shapes for effective image processing and reports significant speed improvements over existing techniques such as reshuffling and kernel integral images. The findings also demonstrate successful applications in keypoint detection.

Improving access to satellite imagery with Cloud computingRAHUL BHOJWANI

The document discusses the challenges of remote sensing, including data size, availability, and management, while highlighting solutions such as 'Earth on AWS' and 'Google Earth Engine' for accessing satellite imagery. It covers the use of cloud computing for processing large datasets and improving accessibility for research and non-profits. Additionally, it mentions various datasets and applications that utilize these cloud services for innovative advancements in remote sensing.

Semantic segmentation with Convolutional Neural Network ApproachesUMBC

The document presents a project on semantic segmentation using convolutional neural networks, focusing on Mask R-CNN and its fine-tuning, experimental results, and future work in real-time segmentation. It includes details on datasets used, architecture comparisons, requirements for training, and evaluation metrics. The project involves various steps like dataset collection, performance analysis, and method implementation.

Deep learning based object detection basicsBrodmann17

The document discusses different approaches to object detection in images using deep learning. It begins with describing detection as classification, where an image is classified into categories for what objects are present. It then discusses approaches that involve separating detection into a classification head and localization head. The document also covers improvements like R-CNN which uses region proposals to first generate candidate object regions before running classification and bounding box regression on those regions using CNN features. This helps address issues with previous approaches like being too slow when running the CNN over the entire image at multiple locations and scales.

DNR - Auto deep lab paper review ppttaeseon ryu

The document presents Auto-Deeplab, a method for hierarchical neural architecture search tailored for semantic image segmentation, achieving state-of-the-art results without requiring ImageNet pretraining. The authors highlight the methodological advancements in neural architecture search, specifically in optimizing architectures for high-resolution imagery. Auto-Deeplab significantly enhances the efficiency of architecture search processes, offering a solution to extend beyond traditional image classification tasks.

Objects as points (CenterNet) review [CDM]Dongmin Choi

The document proposes representing objects as single center points rather than bounding boxes. This allows detecting objects through keypoint estimation using a single neural network without post-processing. The method, called CenterNet, predicts center points along with object properties like size in one forward pass. Experiments show CenterNet runs in real-time and is simpler, faster and more accurate than two-stage detectors that require additional pre and post-processing steps. It provides a new direction for real-time object recognition.

Shai Avidan's Support vector tracking and ensemble trackingwolf

This document summarizes two object tracking algorithms: Support Vector Tracking (SVT) and Ensemble Tracking. SVT uses support vector machines to classify pixels as object or background and finds the maximum scoring bounding rectangle. Ensemble Tracking trains an ensemble of weak classifiers over time to distinguish the object from background and outputs a confidence map, then uses mean shift to locate the object. Both algorithms use multiple resolutions and can handle challenges like occlusion and camera motion.

Deep Learning for Computer Vision: Segmentation (UPC 2016)Universitat Politècnica de Catalunya

The document discusses various techniques for image segmentation, including semantic segmentation, instance segmentation, and fully convolutional networks. It covers the methodologies for detecting and classifying objects within images, with a focus on pixel labeling and the use of convolutional and deconvolutional layers. The lecture also highlights relevant resources and papers in the field of computer vision.

Semantic Mapping of Road ScenesSunando Sengupta

This thesis explores large-scale outdoor urban semantic mapping through solutions for dense semantic reconstruction and scene labeling. It proposes frameworks that integrate street-level imagery for generating semantically consistent maps and utilizes advanced methods like CRF-based modeling for improved labelling accuracy. The work includes contributions to datasets and publications related to semantic mapping in computer vision.

Object Pose EstimationArithmer Inc.

The document discusses various methods of object pose estimation, including correspondence-based, template-based, voting-based, and regression-based approaches. It highlights key algorithms like ICP for matching point clouds and PoseCNN for estimating object pose through deep learning techniques. Performance comparisons indicate that DenseFusion achieves the highest accuracy among regression methods by integrating RGB and depth images.

Accelerated Logistic Regression on GPU(s)RAHUL BHOJWANI

The document summarizes a course project on accelerating logistic regression training using GPUs. The project involved implementing logistic regression on GPUs using techniques like parallel reduction, tiled computations, shared memory and streams. This led to an overall speedup of 57x compared to a CPU implementation. Key aspects included implementing sigmoid, gradient computation and weight update kernels optimized for GPU parallelism and memory access patterns. Data transposition and interleaving CPU/GPU tasks using streams further improved performance.

Object Segmentation (D2L7 Insight@DCU Machine Learning Workshop 2017)Universitat Politècnica de Catalunya

The document discusses object segmentation in deep learning, detailing semantic and instance segmentation techniques along with various datasets used in the field. Key methods for semantic segmentation include fully convolutional networks and learnable upsampling, while instance segmentation is addressed through proposal-based methods and recurrent approaches. Various challenges and innovations in these techniques are highlighted, including the implementation of skip connections and dilated convolutions for enhanced performance.

Region-oriented Convolutional Networks for Object RetrievalUniversitat Politècnica de Catalunya

This document describes research on using region-oriented convolutional neural networks for object retrieval. It discusses using local CNNs like CaffeNet, Fast R-CNN, and SDS to extract visual features from object candidates in images. These features are used to match against query descriptors. Pooled regional features are ranked to retrieve relevant shots. Fine-tuning pre-trained networks on larger datasets like COCO can improve retrieval accuracy. Combining global and local approaches through re-ranking provides an additional boost in performance.

How much position information do convolutional neural networks encode? review...Dongmin Choi

This document presents research into whether convolutional neural networks (CNNs) encode absolute spatial or position information. The authors hypothesize that CNN models implicitly encode position information through techniques like zero-padding. They propose PosENet, a model that couples a pretrained encoder like VGG or ResNet with a position encoding module to predict gradient-like position maps. PosENet is trained on a saliency detection dataset and evaluated on a semantic segmentation dataset. Results show deeper models and position-dependent tasks encode more position information. The authors conclude that zero-padding plays a key role in delivering position cues to CNNs.

Deformable DETR Review [CDM]Dongmin Choi

Deformable DETR introduces deformable transformers for more efficient end-to-end object detection, addressing issues of long training times and low performance in detecting small objects found in traditional DETR. By utilizing deformable attention modules that focus on key sampling points, it achieves better performance with significantly fewer training epochs. The architecture incorporates multi-scale feature inputs and iterative bounding box refinement for improved accuracy.

Review: Incremental Few-shot Instance Segmentation [CDM]Dongmin Choi

1) iMTFA is an incremental approach to few-shot instance segmentation that allows adding new classes without retraining. 2) It extends the MTFA baseline by training an instance feature extractor to generate discriminative embeddings for each instance, with the average embedding used as the class representative. 3) At inference, it predicts classes based on the cosine distance between ROI embeddings and stored class representatives, using class-agnostic box regression and mask prediction. 4) Experiments on COCO, VOC2007 and VOC2012 show iMTFA outperforms SOTA few-shot object detection and instance segmentation methods while enabling incremental class addition.

On-the-fly Visual Category Search in Web-scale Image CollectionsKen Chatfield

The document discusses a system for on-the-fly visual category search in large unannotated image datasets containing over a million images, emphasizing real-time performance without prior knowledge of categories. It details the use of pre-trained CNNs and various compression techniques to achieve efficient and rapid retrieval of images based on user queries, highlighting precision metrics across different classes. A live demonstration of the system is available, showcasing its ability to fetch and rank results rapidly, with a focus on minimizing the computational load during the retrieval process.

Content-based Image Retrieval - Eva Mohedano - UPC Barcelona 2018Universitat Politècnica de Catalunya

The document discusses content-based image retrieval and various techniques used for it. It begins by defining content-based image retrieval as taking a query image and ranking images in a large dataset based on how similar they are to the query. It then covers classic pipelines using SIFT features, using off-the-shelf CNN features, and learning representations specifically for retrieval. Methods discussed include spatial pooling of CNN activations, region pooling like R-MAC, and learning embeddings or features through triplet loss or diffusion-based ranking refinement. The goal is to learn representations from data that effectively capture semantic similarity for retrieval tasks.

More Related Content

What's hot (20)

Object Detection Using R-CNN Deep Learning FrameworkNader Karimi

Mask-RCNN for Instance SegmentationDat Nguyen

R-FCN : object detection via region-based fully convolutional networksEntrepreneur / Startup

Object detection - RCNNs vs RetinanetRishabh Indoria

Fast Non-Uniform Filtering with Symmetric Weighted Integral Imagesdavidmarimon

Improving access to satellite imagery with Cloud computingRAHUL BHOJWANI

Semantic segmentation with Convolutional Neural Network ApproachesUMBC

Deep learning based object detection basicsBrodmann17

DNR - Auto deep lab paper review ppttaeseon ryu

Objects as points (CenterNet) review [CDM]Dongmin Choi

Shai Avidan's Support vector tracking and ensemble trackingwolf

Deep Learning for Computer Vision: Segmentation (UPC 2016)Universitat Politècnica de Catalunya

Semantic Mapping of Road ScenesSunando Sengupta

Object Pose EstimationArithmer Inc.

Accelerated Logistic Regression on GPU(s)RAHUL BHOJWANI

Object Segmentation (D2L7 Insight@DCU Machine Learning Workshop 2017)Universitat Politècnica de Catalunya

Region-oriented Convolutional Networks for Object RetrievalUniversitat Politècnica de Catalunya

How much position information do convolutional neural networks encode? review...Dongmin Choi

Deformable DETR Review [CDM]Dongmin Choi

Review: Incremental Few-shot Instance Segmentation [CDM]Dongmin Choi

Object Detection Using R-CNN Deep Learning FrameworkNader Karimi

Mask-RCNN for Instance SegmentationDat Nguyen

R-FCN : object detection via region-based fully convolutional networksEntrepreneur / Startup

Object detection - RCNNs vs RetinanetRishabh Indoria

Fast Non-Uniform Filtering with Symmetric Weighted Integral Imagesdavidmarimon

Improving access to satellite imagery with Cloud computingRAHUL BHOJWANI

Semantic segmentation with Convolutional Neural Network ApproachesUMBC

Deep learning based object detection basicsBrodmann17

DNR - Auto deep lab paper review ppttaeseon ryu

Objects as points (CenterNet) review [CDM]Dongmin Choi

Shai Avidan's Support vector tracking and ensemble trackingwolf

Deep Learning for Computer Vision: Segmentation (UPC 2016)Universitat Politècnica de Catalunya

Semantic Mapping of Road ScenesSunando Sengupta

Object Pose EstimationArithmer Inc.

Accelerated Logistic Regression on GPU(s)RAHUL BHOJWANI

Object Segmentation (D2L7 Insight@DCU Machine Learning Workshop 2017)Universitat Politècnica de Catalunya

Region-oriented Convolutional Networks for Object RetrievalUniversitat Politècnica de Catalunya

How much position information do convolutional neural networks encode? review...Dongmin Choi

Deformable DETR Review [CDM]Dongmin Choi

Review: Incremental Few-shot Instance Segmentation [CDM]Dongmin Choi

Similar to Convolutional Patch Representations for Image Retrieval An unsupervised approach (20)

On-the-fly Visual Category Search in Web-scale Image CollectionsKen Chatfield

Content-based Image Retrieval - Eva Mohedano - UPC Barcelona 2018Universitat Politècnica de Catalunya

conv_nets.pptxssuser80a05c

This document discusses convolutional neural networks for image and speech processing. It begins with examples of using convolutional nets for handwritten digit recognition and object recognition with translation invariance. It then explains the basic architecture of convolutional nets, including convolution, hidden layers, pooling, and how they achieve translation invariance. Later sections discuss applications to large datasets like ImageNet, influential models like AlexNet, and extensions to tasks like segmentation using fully convolutional networks.

Image Retrieval (D4L5 2017 UPC Deep Learning for Computer Vision)Universitat Politècnica de Catalunya

The document provides a detailed overview of content-based image retrieval methods, discussing the classic SIFT retrieval pipeline and the use of off-the-shelf CNN features as well as learning representations for improved retrieval performance. Key challenges in the field include image similarity comparison, speed for sub-second responses, and scalability for large datasets. It also highlights the effectiveness of CNNs in representing images and the importance of ranking similarity in retrieval tasks.

Deep Learning for Computer Vision (1/4): Image Analytics @ laSalle 2016Universitat Politècnica de Catalunya

The document discusses deep learning techniques for computer vision and image analytics, focusing on the use of deep convolutional networks (CNNs) for image classification and retrieval tasks. It outlines the evolution of these methods, including the introduction of end-to-end learning and notable architectures like AlexNet and GoogLeNet. Additionally, the presentation covers key challenges and advancements in the field, including the importance of visualization and regularization techniques.

convnets.pptxMohamedAliHabib3

This document provides an overview of convolutional neural networks (CNNs or ConvNets). It discusses the history of ConvNets from their origins in modeling the visual cortex to modern applications in computer vision tasks. The document explains what ConvNets are through their use of filters, activation maps, and pooling layers. It also discusses methods for visualizing and understanding what different layers of ConvNets are learning from images.

Computer vision for transportationWanjin Yu

The document provides an introduction to computer vision concepts including neural network structures, activation functions, convolution operators, pooling layers, and batch normalization. It then discusses image classification, including popular datasets, classification networks from LeNet to DLA, and experiments on car brand classification. Finally, it covers object detection, comparing region-based methods like R-CNN, Fast R-CNN, Faster R-CNN, and R-FCN to region-free methods like YOLO.

Lecture 29 Convolutional Neural Networks - Computer Vision Spring2015Jia-Bin Huang

The document provides an overview of convolutional neural networks (CNNs) in computer vision, detailing their structure, training processes, and applications such as image classification and segmentation. It discusses the evolution of CNNs, techniques like backpropagation and transfer learning, and methods for understanding and visualizing CNNs. Additionally, it highlights important architectures, training strategies, and recent advancements in the field.

Faire de la reconnaissance d'images avec le Deep Learning - Cristina & Pierre...Jedha Bootcamp

The document discusses the fundamentals of deep learning, particularly through the use of Convolutional Neural Networks (CNNs) for image processing and classification. It outlines key components such as convolutional layers, activation functions, and pooling techniques, and emphasizes the importance of feature extraction and dimensionality reduction. The content also addresses practical applications of CNNs, techniques for dataset creation, and the learning process involving backpropagation.

Overview of Convolutional Neural Networksananth

The document discusses Convolutional Neural Networks (CNNs), emphasizing their significance in deep learning and various applications such as image retrieval and recognition. It covers the structure and components of CNNs, including convolution and pooling layers, highlighting the importance of local features and the handling of multidimensional inputs. Additionally, it outlines the advantages of deeper models in achieving better accuracy in computer vision tasks.

L7_finetuning on tamil technologies.pptxMeganath7

The document discusses transfer learning and its applications in deep learning for visual recognition tasks, highlighting the effectiveness of convolutional neural networks (CNNs) like AlexNet in improving performance across various datasets. It outlines the challenges faced in feature transferability, emphasizing the importance of fine-tuning and the loss of co-adaptations when features are transferred between tasks. The findings indicate that deep architectures can capture pertinent semantic representations and outperform traditional methods in visual classification and recognition tasks.

AlexNet(ImageNet Classification with Deep Convolutional Neural Networks)UMBC

The document discusses the ImageNet classification using deep convolutional neural networks, specifically focusing on AlexNet architecture which features 650,000 neurons and 60 million parameters. Key techniques to improve model performance, such as rectified linear units (ReLU), dropout for overfitting prevention, and data augmentation, are highlighted, leading to a top-5 error rate of 15.4%. It also compares various models' performance on ImageNet validation and test sets, emphasizing the advancements made in image classification tasks since 2010.

Cs231n 2017 lecture12 Visualizing and UnderstandingYanbin Kong

This lecture discusses techniques for visualizing and understanding convolutional neural networks (CNNs). It begins by visualizing the filters learned in the first layer of CNNs. It then discusses visualizing the activations and feature vectors from higher layers, including dimensionality reduction techniques. Methods are presented for visualizing which pixels or regions are important for classifications using saliency maps. Techniques are also described for generating images that maximally activate neurons using gradient ascent optimization. The goal is to gain insights into what CNNs have learned from images.

Interpretability of Convolutional Neural Networks - Xavier Giro - UPC Barcelo...Universitat Politècnica de Catalunya

This document summarizes techniques for interpreting convolutional neural networks (CNNs). It discusses visualizing learned weights, feature maps, and using attribution methods like class activation maps and gradient-based approaches to identify important regions of input for predictions. Feature visualization techniques are also covered, which generate examples to understand what patterns CNNs recognize. The document provides examples and references to papers for each interpretability method.

Batik image retrieval using convolutional neural networkTELKOMNIKA JOURNAL

This paper discusses a technique for batik image retrieval using Convolutional Neural Networks (CNN), comparing supervised and unsupervised learning models. The results show that the proposed supervised CNN outperforms the unsupervised model and traditional handcrafted feature descriptors in retrieval performance. The method aims to enhance content-based image retrieval systems specifically for batik, a culturally significant Indonesian textile.

CompVis 101 - Computer Vision Bootcamp - GDGoC IBAHibaMallick1

Mnist report pptRaghunandanJairam

This document is an internship report submitted by Raghunandan J to Eckovation about a project on classifying handwritten digits using a convolutional neural network. It provides an introduction to convolutional neural networks and explains each layer of a CNN including the input, convolutional layer, pooling layer, and fully connected layer. It also gives examples of real-world applications that use artificial neural networks like Google Maps, Google Images, and voice assistants.

1 introduction.pptxmustafa sarac

This document summarizes a tutorial on large-scale visual recognition. It begins by outlining common visual recognition tasks like image retrieval and classification. It then describes several benchmark datasets used for these tasks and how they have increased in scale over time. The document discusses how approaches to collecting large classification datasets have evolved from relying on manual post-processing to using automated methods or crowdsourcing. It notes a convergence between techniques originally developed for image retrieval and classification. The goals of the tutorial are described as providing tools to handle large-scale datasets and showing this convergence between retrieval and classification approaches.

Introduction to computer visionMarcin Jedyk

This document provides an introduction to computer vision with convoluted neural networks. It discusses what computer vision aims to address, provides a brief overview of neural networks and their basic building blocks. It then covers the history and evolution of convolutional neural networks, how and why they work on digital images, their limitations, and applications like object detection. Examples are provided of early CNNs from the 1980s and 1990s and recent advancements through the 2010s that improved accuracy, including deeper networks, inception modules, residual connections, and efforts to increase performance like MobileNets. Training deep CNNs requires large datasets and may take weeks, but pre-trained networks can be fine-tuned for new tasks.

Computer Vision Bootcamp: First WorshopMohammedArbi

On-the-fly Visual Category Search in Web-scale Image CollectionsKen Chatfield

Content-based Image Retrieval - Eva Mohedano - UPC Barcelona 2018Universitat Politècnica de Catalunya

conv_nets.pptxssuser80a05c

Image Retrieval (D4L5 2017 UPC Deep Learning for Computer Vision)Universitat Politècnica de Catalunya

Deep Learning for Computer Vision (1/4): Image Analytics @ laSalle 2016Universitat Politècnica de Catalunya

convnets.pptxMohamedAliHabib3

Computer vision for transportationWanjin Yu

Lecture 29 Convolutional Neural Networks - Computer Vision Spring2015Jia-Bin Huang

Faire de la reconnaissance d'images avec le Deep Learning - Cristina & Pierre...Jedha Bootcamp

Overview of Convolutional Neural Networksananth

L7_finetuning on tamil technologies.pptxMeganath7

AlexNet(ImageNet Classification with Deep Convolutional Neural Networks)UMBC

Cs231n 2017 lecture12 Visualizing and UnderstandingYanbin Kong

Interpretability of Convolutional Neural Networks - Xavier Giro - UPC Barcelo...Universitat Politècnica de Catalunya

Batik image retrieval using convolutional neural networkTELKOMNIKA JOURNAL

CompVis 101 - Computer Vision Bootcamp - GDGoC IBAHibaMallick1

Mnist report pptRaghunandanJairam

1 introduction.pptxmustafa sarac

Introduction to computer visionMarcin Jedyk

Computer Vision Bootcamp: First WorshopMohammedArbi

Recently uploaded (20)

Lesson-3_Program-Outcomes-and-Student-Learning-Outcomes_For-Students.pdfSarahMaeDuallo

presentation4.pdf Intro to mcmc methodssSergeyTsygankov6

一比一原版(TUC毕业证书)开姆尼茨工业大学毕业证如何办理taqyed

鉴于此，办理TUC大学毕业证开姆尼茨工业大学毕业证书【q薇1954292140】留学一站式办理学历文凭直通车（开姆尼茨工业大学毕业证TUC成绩单原版开姆尼茨工业大学学位证假文凭）未能正常毕业？【q薇1954292140】办理开姆尼茨工业大学毕业证成绩单/留信学历认证/学历文凭/使馆认证/留学回国人员证明/录取通知书/Offer/在读证明/成绩单/网上存档永久可查！如果您处于以下几种情况： ◇在校期间，因各种原因未能顺利毕业……拿不到官方毕业证 ◇面对父母的压力，希望尽快拿到； ◇不清楚认证流程以及材料该如何准备； ◇回国时间很长，忘记办理； ◇回国马上就要找工作，办给用人单位看； ◇企事业单位必须要求办理的 ◇需要报考公务员、购买免税车、落转户口 ◇申请留学生创业基金【办理开姆尼茨工业大学成绩单Buy Technische Universität Chemnitz Transcripts】购买日韩成绩单、英国大学成绩单、美国大学成绩单、澳洲大学成绩单、加拿大大学成绩单（q微1954292140）新加坡大学成绩单、新西兰大学成绩单、爱尔兰成绩单、西班牙成绩单、德国成绩单。成绩单的意义主要体现在证明学习能力、评估学术背景、展示综合素质、提高录取率，以及是作为留信认证申请材料的一部分。开姆尼茨工业大学成绩单能够体现您的的学习能力，包括开姆尼茨工业大学课程成绩、专业能力、研究能力。（q微1954292140）具体来说，成绩报告单通常包含学生的学习技能与习惯、各科成绩以及老师评语等部分，因此，成绩单不仅是学生学术能力的证明，也是评估学生是否适合某个教育项目的重要依据！

Communication_Skills_Class10_Visual.pptxnamanrastogi70555

NASA ESE Study Results v4 05.29.2020.pptxCiroAlejandroCamacho

最新版美国加利福尼亚大学旧金山法学院毕业证（UCLawSF毕业证书）定制taqyea

一比一还原加利福尼亚大学旧金山法学院毕业证/UCLawSF毕业证书2025原版【q薇1954292140】我们专业办理澳洲大学毕业证成绩单，美国大学毕业证成绩单,英国大学毕业证成绩单，加拿大大学毕业证成绩单，新加坡大学毕业证成绩单，新西兰大学毕业证成绩单，韩国大学毕业证成绩单，日本大学毕业证成绩单。【复刻一套加利福尼亚大学旧金山法学院毕业证成绩单信封等材料最强攻略,Buy University of California College of the Law, San Francisco Transcripts】购买日韩成绩单、英国大学成绩单、美国大学成绩单、澳洲大学成绩单、加拿大大学成绩单（q微1954292140）新加坡大学成绩单、新西兰大学成绩单、爱尔兰成绩单、西班牙成绩单、德国成绩单。成绩单的意义主要体现在证明学习能力、评估学术背景、展示综合素质、提高录取率，以及是作为留信认证申请材料的一部分。加利福尼亚大学旧金山法学院成绩单能够体现您的的学习能力，包括加利福尼亚大学旧金山法学院课程成绩、专业能力、研究能力。（q微1954292140）具体来说，成绩报告单通常包含学生的学习技能与习惯、各科成绩以及老师评语等部分，因此，成绩单不仅是学生学术能力的证明，也是评估学生是否适合某个教育项目的重要依据！我们承诺采用的是学校原版纸张（原版纸质、底色、纹路）我们工厂拥有全套进口原装设备，特殊工艺都是采用不同机器制作，仿真度基本可以达到100%，所有成品以及工艺效果都可提前给客户展示，不满意可以根据客户要求进行调整，直到满意为止！【主营项目】一、工作未确定，回国需先给父母、亲戚朋友看下文凭的情况，办理毕业证|办理文凭: 买大学毕业证|买大学文凭【q薇1954292140】加利福尼亚大学旧金山法学院学位证明书如何办理申请？二、回国进私企、外企、自己做生意的情况，这些单位是不查询毕业证真伪的，而且国内没有渠道去查询国外文凭的真假，也不需要提供真实教育部认证。鉴于此，办理美国成绩单加利福尼亚大学旧金山法学院毕业证【q薇1954292140】国外大学毕业证, 文凭办理, 国外文凭办理, 留信网认证三.材料咨询办理、认证咨询办理请加学历顾问【微信:1954292140】加利福尼亚大学旧金山法学院毕业证购买指大学文凭购买，毕业证办理和文凭办理。学院文凭定制，学校原版文凭补办，扫描件文凭定做，100%文凭复刻。

Prescriptive Process Monitoring Under Uncertainty and Resource Constraints: A...Mahmoud Shoush

Indigo_Airlines_Strategy_Presentation.pptxmukeshpurohit991

美国毕业证范本中华盛顿大学学位证书CWU学生卡购买Taqyea

1:1原版中华盛顿大学毕业证+CWU成绩单【Q微:1954 292 140】鉴于此，CWUdiploma中华盛顿大学挂科处理解决方案CWU毕业证成绩单专业服务学历认证【Q微:1954 292 140】办理教育部学历认证，留学回国证明，中华盛顿大学毕业证、中华盛顿大学成绩单、中华盛顿大学文凭（留信学历认证+永久存档查询）办理本科+硕士+博士毕业证成绩单学历认证，我们一直是留学生的首选，质量行业第一，诚信可靠。【中华盛顿大学成绩单一站式办理专业技术完美呈现Central Washington University Transcripts】购买日韩成绩单、英国大学成绩单、美国大学成绩单、澳洲大学成绩单、加拿大大学成绩单（q微1954292140）新加坡大学成绩单、新西兰大学成绩单、爱尔兰成绩单、西班牙成绩单、德国成绩单。成绩单的意义主要体现在证明学习能力、评估学术背景、展示综合素质、提高录取率，以及是作为留信认证申请材料的一部分。中华盛顿大学成绩单能够体现您的的学习能力，包括中华盛顿大学课程成绩、专业能力、研究能力。（q微1954292140）具体来说，成绩报告单通常包含学生的学习技能与习惯、各科成绩以及老师评语等部分，因此，成绩单不仅是学生学术能力的证明，也是评估学生是否适合某个教育项目的重要依据！【主营项目】一.毕业证【q微1954292140】成绩单、使馆认证、教育部认证、雅思托福成绩单、学生卡等！二.真实使馆公证(即留学回国人员证明,不成功不收费) 三.真实教育部学历学位认证（教育部存档！教育部留服网站永久可查）四.办理各国各大学文凭(一对一专业服务,可全程监控跟踪进度)

Microsoft Power BI - Advanced Certificate for Business Intelligence using Pow...Prasenjit Debnath

最新版美国约翰霍普金斯大学毕业证（JHU毕业证书）原版定制Taqyea

2025原版约翰霍普金斯大学毕业证书pdf电子版【q薇1954292140】美国毕业证办理JHU约翰霍普金斯大学毕业证书多少钱？【q薇1954292140】海外各大学Diploma版本，因为疫情学校推迟发放证书、证书原件丢失补办、没有正常毕业未能认证学历面临就业提供解决办法。当遭遇挂科、旷课导致无法修满学分，或者直接被学校退学，最后无法毕业拿不到毕业证。此时的你一定手足无措，因为留学一场，没有获得毕业证以及学历证明肯定是无法给自己和父母一个交代的。【复刻约翰霍普金斯大学成绩单信封,Buy The Johns Hopkins University Transcripts】购买日韩成绩单、英国大学成绩单、美国大学成绩单、澳洲大学成绩单、加拿大大学成绩单（q微1954292140）新加坡大学成绩单、新西兰大学成绩单、爱尔兰成绩单、西班牙成绩单、德国成绩单。成绩单的意义主要体现在证明学习能力、评估学术背景、展示综合素质、提高录取率，以及是作为留信认证申请材料的一部分。约翰霍普金斯大学成绩单能够体现您的的学习能力，包括约翰霍普金斯大学课程成绩、专业能力、研究能力。（q微1954292140）具体来说，成绩报告单通常包含学生的学习技能与习惯、各科成绩以及老师评语等部分，因此，成绩单不仅是学生学术能力的证明，也是评估学生是否适合某个教育项目的重要依据！我们承诺采用的是学校原版纸张（原版纸质、底色、纹路）我们工厂拥有全套进口原装设备，特殊工艺都是采用不同机器制作，仿真度基本可以达到100%，所有成品以及工艺效果都可提前给客户展示，不满意可以根据客户要求进行调整，直到满意为止！【主营项目】一、工作未确定，回国需先给父母、亲戚朋友看下文凭的情况，办理毕业证|办理文凭: 买大学毕业证|买大学文凭【q薇1954292140】约翰霍普金斯大学学位证明书如何办理申请？二、回国进私企、外企、自己做生意的情况，这些单位是不查询毕业证真伪的，而且国内没有渠道去查询国外文凭的真假，也不需要提供真实教育部认证。鉴于此，办理美国成绩单约翰霍普金斯大学毕业证【q薇1954292140】国外大学毕业证, 文凭办理, 国外文凭办理, 留信网认证

BCG-Executive-Perspectives-CEOs-Guide-to-Maximizing-Value-from-AI-EP0-3July20...benediktnetzer1

Model Evaluation & Visualisation part of a series of intro modules for data ...brandonlee626749

Boost Business Efficiency with Professional Data Entry Serviceseloiacs eloiacs

Boost Business Efficiency with Professional Data Entry Services In today’s digital-first world, businesses generate and handle massive amounts of data every day — customer records, sales data, inventory logs, survey results, and much more. But raw data has no value unless it is well-organized, accurate, and easily accessible. That’s where professional data entry services come in. By outsourcing data entry tasks to experts, businesses can streamline operations, reduce manual errors, and improve overall efficiency — while focusing their internal resources on core activities like growth and customer engagement. What Are Data Entry Services? Data entry services refer to the process of converting information from various formats (handwritten, scanned, PDF, image, or audio) into structured, digital formats such as Excel sheets, CRM databases, or cloud storage systems. This work may be done online or offline, manually or using automation tools, depending on the client’s requirements. Popular Data Entry Services Include: Manual data entry from paper documents or scanned files Online data entry directly into websites, forms, or portals Offline data entry in formats like MS Word, Excel, or custom software Product data entry for eCommerce platforms like Shopify, Amazon, and Flipkart Document and image conversion into editable formats Data cleansing and validation to remove duplicates and fix errors Remote data entry support for real-time operations CRM and ERP data management These services are essential for organizing business data and making it usable for analysis, reporting, and decision-making. Who Can Benefit from Data Entry Services? Outsourcing data entry is not limited to any one industry — it's a universal need for businesses of all types and sizes. Here are some examples: eCommerce Businesses – For managing product catalogs, inventory updates, pricing, and customer orders. Healthcare Providers – For digitizing patient records, prescriptions, and insurance documents. Education Institutes – To maintain student records, exam results, and staff data. Financial Services – For processing invoices, bank statements, transaction records. Real Estate Companies – To handle property listings, legal paperwork, client records. Research & Marketing Firms – To compile survey data, leads, and analytics reports. Even startups and freelancers often require virtual data entry services to stay organized and competitive. Top Benefits of Outsourcing Data Entry Services Outsourcing data entry work to a professional company or virtual assistant offers multiple benefits — whether you're running a small business or managing a large enterprise. 1. Reduced Costs Maintaining an in-house data entry team means salaries, hardware, training, and software expenses. Outsourcing eliminates these costs and provides flexible, pay-as-you-go solutions.

Flextronics Employee Safety Data-Project-2.pptxkilarihemadri

Presentation by Tariq & Mohammed (1).pptxAbooddSandoqaa

624753984-Annex-A3-RPMS-Tool-for-Proficient-Teachers-SY-2024-2025.pdfCristineGraceAcuyan

Residential Zone 4 for industrial villageMdYasinArafat13

ppt somu_Jarvis_AI_Assistant_presen.pptxMohammedumarFarhan

All the DataOps, all the paradigms .Lars Albertsson

Lesson-3_Program-Outcomes-and-Student-Learning-Outcomes_For-Students.pdfSarahMaeDuallo

presentation4.pdf Intro to mcmc methodssSergeyTsygankov6

一比一原版(TUC毕业证书)开姆尼茨工业大学毕业证如何办理taqyed

Communication_Skills_Class10_Visual.pptxnamanrastogi70555

NASA ESE Study Results v4 05.29.2020.pptxCiroAlejandroCamacho

最新版美国加利福尼亚大学旧金山法学院毕业证（UCLawSF毕业证书）定制taqyea

Prescriptive Process Monitoring Under Uncertainty and Resource Constraints: A...Mahmoud Shoush

Indigo_Airlines_Strategy_Presentation.pptxmukeshpurohit991

美国毕业证范本中华盛顿大学学位证书CWU学生卡购买Taqyea

Microsoft Power BI - Advanced Certificate for Business Intelligence using Pow...Prasenjit Debnath

最新版美国约翰霍普金斯大学毕业证（JHU毕业证书）原版定制Taqyea

BCG-Executive-Perspectives-CEOs-Guide-to-Maximizing-Value-from-AI-EP0-3July20...benediktnetzer1

Model Evaluation & Visualisation part of a series of intro modules for data ...brandonlee626749

Boost Business Efficiency with Professional Data Entry Serviceseloiacs eloiacs

Flextronics Employee Safety Data-Project-2.pptxkilarihemadri

Presentation by Tariq & Mohammed (1).pptxAbooddSandoqaa

624753984-Annex-A3-RPMS-Tool-for-Proficient-Teachers-SY-2024-2025.pdfCristineGraceAcuyan

Residential Zone 4 for industrial villageMdYasinArafat13

ppt somu_Jarvis_AI_Assistant_presen.pptxMohammedumarFarhan

All the DataOps, all the paradigms .Lars Albertsson

Convolutional Patch Representations for Image Retrieval An unsupervised approach

1. Convolutional Patch Representations for Image Retrieval: an Unsupervised Approach 29th Mar 2016 Original slides by Eva Mohedano Insight Centre for Data Analytics (Dublin City University Mattis Paulin, Julien Mairal, Matthijs Douze, Zaid Harchaoui, Florent Perronnin, Cordelia Schmidt

2. Overview Published ICCV 2015 (A.K.A. Local Convolutional Features With Unsupervised Training for Image Retrieval) Deep Convolutional Architecture to produce patch-level descriptors • Unsupervised framework • Comparison in patch and retrieval datasets • “RomePatches” dataset

3. Related Work • Shallow patch descriptors • Deep learning for image retrieval • Deep patch descriptors

4. Related Work • Shallow patch descriptors SIFT – Scale-Invariant Feature Transform - stereo matching - retrieval - classification SURF, BRIEF, LIOP, (…) Hand crafted → Relatively small number of parameters. Note: A patch is an image region extracted from an image.

5. Related Work • Deep learning for image retrieval CNN learned on a sufficiently large labeled dataset (ImageNet) generates intermediate layers that can be used as image descriptors. Those descriptors work for a wide variety of tasks, including image retrieval

6. Related Work • Deep learning for image retrieval source image: http://pubs.sciepub.com/ajme/2/7/9/

7. Related Work • Deep learning for image retrieval source image: http://pubs.sciepub.com/ajme/2/7/9/ Fully connected layers → Global Image Descriptors ● Compact representation ● lack of geometric invariance Below state-of-the art in image retrieval Compute at different scales (Babenko, Razavian)

8. Related Work • Deep learning for image retrieval source image: http://pubs.sciepub.com/ajme/2/7/9/ Convolutional layers

9. Related Work • Deep patch descriptors 3 different kind of supervision: 1. Category labels of ImageNet. [Long et al, 2014] 2. Surrogate patch labels: Each class is a given patch under different transformations [Fischer et al, 2014] 3. Matching/non-matching pairs. [Simo-Serra et al, 2015] Works focussed in patch-level metrics, not image retrieval. All approaches requiered some kind of supervision.

10. Image Retrieval Pipeline • Interest point detection Hessian-Affine detector. Rotation invariance. • Interest point description Feature representation in a Euclidean space • Patch Matching VLAD encoding. Power normalization with exponent 0.5 + L2-norm.

11. Image Retrieval Pipeline • Interest point detection Hessian-Affine detector. Rotation invariance. • Interest point description Feature representation in a Euclidean space • Patch Matching VLAD encoding. Power normalization with exponent 0.5 + L2-norm.

12. Convolutional Descriptors Patch size = 51x51 – Optimal for SIFT on Oxford dataset. CNN extended to retrieval by: • Encoding local descriptors with model trained with an unrelated classification task • Devising a surrogate classification problem that is as related as possible to image retrieval: • Using unsupervised learning: Convolutional Kernel Network

13. Convolutional Descriptors • Using unsupervised learning: Convolutional Kernel Network Feature representation based in a kernel (feature) map -- Data independent

14. Convolutional Descriptors • Using unsupervised learning: Convolutional Kernel Network Projection in Hilbert space Explicit kernel map can be computed to approximate it for computational efficiency. - Sub-sample of patches - Stochastic Gradient Optimization

15. Convolutional Descriptors • Using unsupervised learning: Convolutional Kernel Network 4 possible inputs From left to right: CKN-raw, CKN-mean subs, CKN-white (mean subs + PCA-whitening), CKN-grad (fully invariant to color) Only CKN-raw, CKN-white and CKN-grad are evaluated.

16. Experiments Datasets: 1. Rome Patches-Image 2. Oxford 3. UKbench and Holidays CKN trained on 1M sub-patches. 300K iterations. Mini-batches size of 1000.

17. Experiments

18. Conclusions • CKN offer similar and sometimes better performance than CNN in the context of patch description. • Good patch retrieval translates into good image retrieval. • CKNs are orders of magnitude faster to train than CNNs (10 min vs 2-3 days on a modern GPU) • Fully unsupervised – no labels.

19. Resources RomePatches+Code (Although code is not accessible!) Discriminative Unsupervised Feature Learning with Exemplar Convolutional Neural Networks - Code with augmentations in matlab - Code for training models. - Models already trained :-) Triplet’s net + Code !! - Greyscale local patches of 32x32. Tested in matching datasets

Convolutional Patch Representations for Image Retrieval An unsupervised approach

Recommended

More Related Content

What's hot (20)

Similar to Convolutional Patch Representations for Image Retrieval An unsupervised approach (20)

Recently uploaded (20)

Convolutional Patch Representations for Image Retrieval An unsupervised approach