CS6501: Deep Learning for Visual Recognition. CNN Architectures

Similar documents
Development of intelligent systems (RInS) Object recognition with Convolutional Neural Networks

[Boston March for Science 2017 photo Hendrik Strobelt]

Week 42: Siamese Network: Architecture and Applications in Visual Object Tracking. Yuanwei Wu

Where Is My Puppy? Retrieving Lost Dogs by Facial Features

Cats and Dogs. Omkar M Parkhi 1,2 Andrea Vedaldi 1 Andrew Zisserman 1 C. V. Jawahar 2. Abstract. 1. Introduction

IMAGE CAPTIONING USING PHRASE-BASED HIERARCHICAL LSTM MODEL

The Kaggle Competitions: An Introduction to CAMCOS Fall 2015

Multiclass and Multi-label Classification

Recurrent neural network grammars. Slide credits: Chris Dyer, Adhiguna Kuncoro


PIGEON DISCRIMINATION OF PAINTINGS 1

Moving toward formalisation COMP62342

Nathan A. Thompson, Ph.D. Adjunct Faculty, University of Cincinnati Vice President, Assessment Systems Corporation

Available online at ScienceDirect. Procedia Computer Science 102 (2016 )

STUDY BEHAVIOR OF CERTAIN PARAMETERS AFFECTING ASSESSMENT OF THE QUALITY OF QUAIL EGGS BY COMPUTER VISION SYSTEM

Moving towards formalisation COMP62342

The integration of dogs into collaborative humanrobot. - An applied ethological approach - PhD Thesis. Linda Gerencsér Supervisor: Ádám Miklósi

Boosting Biomedical Entity Extraction by Using Syntactic Patterns for Semantic Relation Discovery

Reversing Category Exclusivities in Infant Perceptual Categorization: Simulations and Data

The River of Life Lower Key Stage 2

Genera&on of Image Descrip&ons. Tambet Ma&isen

CAPABILITIES AND RESTRICTIONS OF ORTHOPHOTO PROCUCTION SYSTEMS FOR TERRESTRIAL ARCHAEOLOGICAL SURVEYS

Comparing DNA Sequences to Understand Evolutionary Relationships with BLAST

Semantically-driven Automatic Creation of Training Sets for Object Recognition

In the first half of the 20th century, Dr. Guido Fanconi published detailed clinical descriptions of several heritable human diseases.

We recommend you cite the published version. The publisher s URL is

Answers to Questions about Smarter Balanced 2017 Test Results. March 27, 2018

Econometric Analysis Dr. Sobel

DACHSHUND BREED COUNCIL Judging List Application Form

Course: Canine Massage and Bodywork Certification Course Part A Cranial Trunk and Thoracic Appendicular System. Movers of the Forelimb, Neck, and Head

6. 1 Leaping Lizards!

Package PetfindeR. R topics documented: May 22, Type Package Title 'Petfinder' API Wrapper Version Author Aaron Schlegel

DICOM Correction Proposal

Modeling and Control of Trawl Systems

Data Mining Classification: Alternative Techniques. Lecture Notes for Chapter 4. Rule-Based. Introduction to Data Mining, 2 nd Edition

Junior Sheep. Market Lamb. Premium Schedule

TOURISM A TOOL FOR CONSERVATION- A PARTNERSHIP APPROACH. Empowering the Community

COMPARING DNA SEQUENCES TO UNDERSTAND EVOLUTIONARY RELATIONSHIPS WITH BLAST

Muppet Genetics Lab. Due: Introduction

CS108L Computer Science for All Module 7: Algorithms

Body Parts and Products (Sessions I and II) BROWARD COUNTY ELEMENTARY SCIENCE BENCHMARK PLAN

Content Delivery Network Interconnection (CDNI) Request Routing: CDNI Footprint and Capabilities Advertisement using ALTO

Machine Learning.! A completely different way to have an. agent acquire the appropriate abilities to solve a particular goal is via machine learning.

Millie. Millie is an American Staffordshire Terrier, German Shepherd Dog, Weimaraner Mix. Millie. Dog's name: DR. NEALE FRETWELL.

Figure 1 Background Information to the phylum Arthropoda and appears to not have changed for

Multilevel Script. Teacher s Guide. Animals, Animals. Level E Level H Level K. Levels: E, H, and K Word Count: 460. Story Summary: Cast of Characters:

Long, blue, spiky-edged shadows crept out across the snowfields, while a rosy glow, at first scarce discernible, gradually deepened and suffused

Dynamic Programming for Linear Time Incremental Parsing

Pierre-Louis Toutain, Ecole Nationale Vétérinaire National veterinary School of Toulouse, France Wuhan 12/10/2015

Biology 164 Laboratory

Lecture 4: Controllability and observability

1. How many dogs were stolen in the area covered by your force during the following time periods.

Max WHAT BREEDS MAKE UP MAX? German Shepherd Dog Mix crossed with Cocker Spaniel / Maltese Cross

National Wildlife Disease Surveillance Systems: an European perspective

Site Selection and Environmental Assessment for Terrestrial Invertebrates, Amphibians and Reptiles

MODELING THE CAUSES OF LEG DISORDERS IN FINISHER HERDS

What is Classification?

Your web browser (Safari 7) is out of date. For more security, comfort and the best experience on this site: Update your browser Ignore

Bermuda Domestic Animal Registry Counts of Live Dogs and Cats, in Bermuda, by Breed

Application of Fuzzy Logic in Automated Cow Status Monitoring

Why individually weigh broilers from days onwards?

OPPORTUNITIES FOR GENETIC IMPROVEMENT OF DAIRY SHEEP IN NORTH AMERICA. David L. Thomas


A survey of spatial distribution and population size of feral cat colonies in RI Summary of Findings

Noise reduction and normalization of microblogging messages

ì<(sk$m)=bddfib< +^-Ä-U-Ä-U

Winter Adaptations. Why do animals need shelters, especially during the winter? WORD BANK: Woodchuck Grey Squirrel

Ancestry Report. Lotje. W hat b re eds make u p Lotj e? Mixed breed Ancestor. See next page for more details...

Happy hens. Teacher guidance - 1. Introduction. Project overview

BRITISH SHORTHAIR GROUP COMMITTEE

The Amazingly Adapted Arctic Fox By Guy Belleranti

The genetic factors under consideration in the present study include black (+) vs. red (y), a sex-linked pair of alternatives manifesting

5 Killer Dog Training Mindmaps to Help You Effectively Train Your Dog in 30 Days

Adaptations: Changes Through Time

Food & Allied. Poultry Industry. Industry Profile Industry Structure Industry Performance Regulatory Structure Key Challenges

Welcome to Texas. What is this? 2018 American Society of Health-System Pharmacists Page 1 of 13

North Star K9 Training Association. Trailing Certification Testing

MAIL ORDER HATCHERIES: OPERATIONAL AND DISTRIBUTION LOGISTICS, SALMONELLA INTERVENTION ACTIVITIES AIMED AT PREVENTION OF HUMAN SALMONELLOSIS

Table of contents. DNA Dog food

European public MRL assessment report (EPMAR)

Visual Communication in Science

Chapter 18: Categorical data

Writing Simple Procedures Drawing a Pentagon Copying a Procedure Commanding PenUp and PenDown Drawing a Broken Line...

DICOM Correction Proposal

Australian dogs trained to sniff out endangered species

Subdomain Entry Vocabulary Modules Evaluation

Overview of Findings. Slide 1

Flatworms Flatworms Platyhelminthes dorsoventrally free-living planarian parasitic fluke tapeworm label three body layers ectoderm mesoderm

NUECES COUNTY JUNIOR LIVESTOCK SHOW LAMB DIVISION. Market Lambs. Kenneth McElroy...Superintendent Kevin Kaplan...Assistant Superintendent

Note: The following article is used with permission of Dr. Sonia Altizer.

EARLINET validation of CATS L2 product

North Star K9 Training Association Trailing Urban Certification Testing

RMS Empress Of Ireland: Pride Of The Canadian Pacific's Atlantic Fleet By Derek Grout

Functions Introduction to Functions 7.2 One-to-One, Onto, Inverse functions. mjarrar Watch this lecture and download the slides

NUECES COUNTY JUNIOR LIVESTOCK SHOW LAMB DIVISION. Market Lambs. Tim Bowen...Superintendent Kenneth McElroy...Assistant Superintendent

September Population analysis of the French Bulldog breed

Purpose and focus of the module: Poultry Definition Domestication Classification. Basic Anatomy & Physiology

Distance and the presentation of visual stimuli to birds

Please include the dog breed and whether the dog was recovered for each case.

Compliance Can Be Ruff A Dog s Approach

Transcription:

CS6501: Deep Learning for Visual Recognition CNN Architectures

ILSVRC: ImagenetLarge Scale Visual Recognition Challenge [Russakovsky et al 2014]

The Problem: Classification Classify an image into 1000 possible classes: e.g. Abyssinian cat, Bulldog, French Terrier, Cormorant, Chickadee, red fox, banjo, barbell, hourglass, knot, maze, viaduct, etc. cat, tabby cat (0.71) Egyptian cat (0.22) red fox (0.11)..

The Data: ILSVRC Imagenet Large Scale Visual Recognition Challenge (ILSVRC): Annual Competition 1000 Categories ~1000 training images per Category ~1 million images in total for training ~50k images for validation Only images released for the test set but no annotations, evaluation is performed centrally by the organizers (max 2 per week)

The Evaluation Metric: Top K-error Top-1 error: 1.0 Top-1 accuracy: 0.0 True label: Abyssinian cat Top-2 error: 1.0 Top-2 accuracy: 0.0 Top-3 error: 1.0 Top-3 accuracy: 0.0 Top-4 error: 0.0 Top-4 accuracy: 1.0 Top-5 error: 0.0 Top-5 accuracy: 1.0 cat, tabby cat (0.61) Egyptian cat (0.22) red fox (0.11) Abyssinian cat (0.10) French terrier (0.03)..

Top-5 error on this competition (2012)

Alexnet (Krizhevsky et al NIPS 2012)

Alexnet https://www.saagie.com/fr/blog/object-detection-part1

Pytorch Code for Alexnet In-class analysis https://github.com/pytorch/vision/blob/master/torchvision/models/alexnet.py

Dropout Layer model.train() model.eval() Srivastava et al 2014

Preprocessing and Data Augmentation

Preprocessing and Data Augmentation 256 256

Preprocessing and Data Augmentation 224x224

Preprocessing and Data Augmentation 224x224

True label: Abyssinian cat

Some Important Aspects Using ReLUs instead of Sigmoid or Tanh Momentum + Weight Decay Dropout (Randomly sets Unit outputs to zero during training) GPU Computation!

What is happening? https://www.saagie.com/fr/blog/object-detection-part1

SIFT + FV + SVM (or softmax) Feature extraction (SIFT) Feature encoding (Fisher vectors) Classification (SVM or softmax) Deep Learning Convolutional Network (includes both feature extraction and classifier)

VGG Network Top-5: https://github.com/pytorch/vision/blob/master/torchvision/models/vgg.py Simonyan and Zisserman, 2014. https://arxiv.org/pdf/1409.1556.pdf

BatchNormalization Layer https://arxiv.org/abs/1502.03167

GoogLeNet https://github.com/kuangliu/pytorch-cifar/blob/master/models/googlenet.py Szegedy et al. 2014 https://www.cs.unc.edu/~wliu/papers/googlenet.pdf

Further Refinements Inception v3, e.g. GoogLeNet (Inceptionv1) Inception v3

ResNet (He et al CVPR 2016) Sorry, does not fit in slide. http://felixlaumon.github.io/assets/kaggle-right-whale/resnet.png https://github.com/pytorch/vision/blob/master/torchvision/models/resnet.py

Slide by Mohammad Rastegari

https://arxiv.org/pdf/1608.06993.pdf

https://arxiv.org/pdf/1608.06993.pdf

Object Detection deer cat

Object Detection as Classification CNN deer? cat? background?

Object Detection as Classification CNN deer? cat? background?

Object Detection as Classification CNN deer? cat? background?

Object Detection as Classification with Sliding Window CNN deer? cat? background?

Object Detection as Classification with Box Proposals

Box Proposal Method SS: Selective Search Segmentation As Selective Search for Object Recognition. van de Sande et al. ICCV 2011

RCNN https://people.eecs.berkeley.edu/~rbg/papers/r-cnn-cvpr.pdf Rich feature hierarchies for accurate object detection and semantic segmentation. Girshicket al. CVPR 2014.

Questions? 36