On Efficient Bayesian Scene Interpretation

Jahangiri, Ehsan

On Efficient Bayesian Scene Interpretation

dc.contributor.advisor	Hermansky, Hynek
dc.contributor.committeeMember	Geman, Donald J.
dc.contributor.committeeMember	Younes, Laurent
dc.contributor.committeeMember	Yuille, Alan L.
dc.contributor.committeeMember	Tran, Trac Duy
dc.creator	Jahangiri, Ehsan
dc.creator.orcid	0000-0001-6208-5474
dc.date.accessioned	2018-10-03T02:53:41Z
dc.date.available	2018-10-03T02:53:41Z
dc.date.created	2016-05
dc.date.issued	2016-02-05
dc.date.submitted	May 2016
dc.date.updated	2018-10-03T02:53:41Z
dc.description.abstract	Scene understanding, including object recognition, is perhaps the most challenging task in computer vision. Deep convolutional neural networks (CNNs) have received a flurry of interest in the past few years due to their superior performance. However, deep networks are computationally expensive and without efficient implementation on high performance computing systems not as practical as older methods. Furthermore, CNNs do not benefit from the human's visual selective attention and top-down contextual feedback connections. The human visual system makes extensive use of contextual information to facilitate and refine object detections; object detection and recognition based only on intrinsic features of target objects is not usually sufficient for reliable inference. In this thesis, we use a model-based approach to incorporate top-down contextual information, and analyze scenes in a coarse-to-fine fashion inspired by the visual selective attention property. In addition to disambiguating object detection, the space of objects and their poses can be searched more efficiently by taking advantage of the contextual relations between different scene entities. We present a new approach to efficiently search the space of objects and their poses using a Bayesian method called ``Entropy Pursuit'', where contextual relations between object instances and other scene entities are incorporated via a prior model. Using the entropy pursuit approach we collect bits of information about the scene sequentially by greedily selecting patches whose analysis provide the most informative in an information-theoretic sense. As proof of concept we use the entropy pursuit method for multi-category object recognition in table-setting scenes. We have investigated the possibility of generating a scene interpretation by processing only a fraction of patches from an input image. Our results confirm the hypothesis that we can identify an accurate interpretation by processing only a fraction of patches if the right patches are selected in the right order. We can save computation time by processing only a fraction of patches.
dc.format.mimetype	application/pdf
dc.identifier.uri	http://jhir.library.jhu.edu/handle/1774.2/59373
dc.language	en
dc.publisher	Johns Hopkins University
dc.publisher.country	USA
dc.subject	Scene Interpretation, Object Detection, Convolutional Neural Networks, Statistical Inference, Stochastic Approximation, MCMC.
dc.title	On Efficient Bayesian Scene Interpretation
dc.type	Thesis
dc.type.material	text
thesis.degree.department	Electrical and Computer Engineering
thesis.degree.discipline	Applied Mathematics & Statistics
thesis.degree.grantor	Johns Hopkins University
thesis.degree.grantor	Whiting School of Engineering
thesis.degree.level	Doctoral
thesis.degree.name	Ph.D.

Files

Original bundle

Now showing 1 - 1 of 1

Name:: JAHANGIRI-DISSERTATION-2016.pdf
Size:: 26.3 MB
Format:: Adobe Portable Document Format

Download

License bundle

Now showing 1 - 1 of 1

Name:: LICENSE.txt
Size:: 2.68 KB
Format:: Plain Text
Description:

Download

Collections

ETD -- Doctoral Dissertations