navigation bar
 
Center for Neuromorphic Systems Engineering
Research: Learning and Algorithms
Click on full report to go to detailed report; click on author name to go to home page (or email).
 

Optimization and Generalization in Boosting
Ling Li, Yaser Abu-Mostafa

Abstract. The superior out-of-sample performance of AdaBoost has been attributed to the fact that it minimizes a cost function based on margin. In order to examine how the cost function, in and of itself, affects the out-of-sample performance, we apply several more sophisticated optimization techniques directly to the cost function. When the AdaBoost exponential cost function is optimized, our methods generally yield much lower cost and training error but higher test error, which implies that the exponential cost is vulnerable to overfitting. With the optimization power gained, we can adopt more "regularized" cost functions that have better out-of-sample performance but are difficult to optimize. Our experiments demonstrate that with suitable cost functions, our methods can have better out-of-sample performance. (full report)


Monotonic Bernoulli Trials
Amrit Pratap, Yaser Abu-Mostafa, Pietro Perona

Abstract. When estimating a number of bernoulli variables which have a certain monotonicity constraint, if the number of samples for each variable is small, then the estimates will not satisfy the monotonicity constraint. Better performance is achieved by endorcing the monotonicity constraint on the estimation procedure. (full report)


Path-Planning for Feature-Recognition and Classification using Information Theoretic Methods
Tim Chung, Joel Burdick, Richard Murray

Abstract.This project investigates the role of information-theoretic techniques in cooperative multi-agent systems. These techniques are used to govern the path planning of agents to optimally classify features of interest by improving the quality of the measurements. Sensor measurements are assumed to be in the presence of noise. We consider issues associated with distributed systems such as sensor fusion of information and formation control of relative vehicle locations. The objective is to articulate the theory underlying the relationship between sensing tasks and cooperative control. (full report)


Networks, Evolution, Science & Neural Systems
Alex Bäcker

Abstract. Recent times have seen the advent of large amounts of data on networks of diverse kinds, from the WWW and citation networks to protein and gene expression networks. Part of my work has been aimed at extracting insight out of these massive collections of data. We show, for example, that recent years have seen an expansion in the memory of science and a homogenization of citation distributions. In parallel, I have been developing mathematical methods to extract information from multi-neuron recordings of brain activity. More generally, I am addressing a variety of open questions at the interface of biology, math and computation. (full report)


Fast Bayesian Support Vector Machine Parameter Tuning with the Nystrom Method
Carl Gold, Alex Holub

Abstract. We experiment with speeding up a Bayesian method for tuning the hyperparameters of a Support Vector Machine (SVM) classifier. The Bayesian approach gives the gradients of the evidence as averages over the posterior, which can be approximated using Hybrid Monte Carlo simulation (HMC). By using the Nystrom approximation to the SVM kernel, our method significantly reduces the dimensionality of the space to be simulated in the HMC. We show that this speeds up the running time of the HMC simulation from O(n^2) (with a large prefactor) to effectively O(n), where n is the number of training samples. We conclude that the Nystrom approximation has an almost insignificant effect on the performance of the algorithm when compared to the full Bayesian method, and gives excellent performance in comparison with other approaches to hyperparameter tuning.


Computational Modeling of Feature Inheritance
Whee Ky (Wei Ji) Ma

The proposition that reentrant interactions into the early visual system are necessary for visual awareness has lately been under close scrutiny. We examine this proposition in the context of a neuronal model which explains the phenomenology of feature inheritance. (full report)


Suppressive Effect of Sustained Low-Contrast Adaptation followed by Transient High-Contrast on Peripheral Target Detection
Farshad Moradi, Shinsuke Shimojo, Christof Koch

Filling-in can be induced by high-contrast edge adaptation, or after prolonged adaptation to a peripheral low-contrast object (Troxler fading). Adaptation to sustained low-contrast vs. adaptation to transient high-contrast suggests synergy between contrast and edge adaptation, but the possible interactions are not well understood. We observed that briefly increasing the contrast of a peripheral low-contrast object after a few seconds of strict fixation elicits disappearance of the object, resulting in perceptual filling-in of the location with the surround (Figure 1a). After a short time usually around one second the object reappears. Hence, following sustained adaptation to a low-contrast target, transient high-contrast stimulation can induce perceptual disappearance. (full report)


Computational Modeling of Visual Attention Systems
Robert J. Peters, Asha Iyer, Christof Koch
Nathan Mundhenk, Laurent Itti

Abstract. We have continued to extend our biological model of bottom-up visual attention with several recently characterized retinal and cortical interactions that are known to govern human performance in certain visual tasks. We are testing the behavioral importance of these interactions by comparing our model's predictions against human eye movement data recorded with our infrared eyetracker. In the last year we have worked with three new model components: (1) short-range orientation interactions (for clutter reduction), (2) long-range orientation interactions (for contour facilitation), and (3) retinal filtering (for fovea vs. periphery effects). (full report)



Attentional Selection for Learning and Recognition of Objects in Cluttered Scenes
Ueli Rutishauser, Dirk Walther, Christof Koch, and Pietro Perona

The problem of serial processing of highly complex visual stimuli containing multiple objects is not only faced by humans and other primates, but also by machine vision systems. Advanced object recognition algorithms are capable of achieving very good recognition performance with objects learned from a single image (one-shot learning). These algorithms perform well as long as they are trained on images in which a major part of the image is occupied by the object to be learned and recognized. As soon as major parts of an image are occupied by clutter it becomes impossible to learn from such images without manual pre-labeling. These approaches are thus not suitable in an unsupervised environment, as they would mainly learn background clutter instead of the actual objects. (full report)


Automated Event Detection in Underwater Video
Dirk Walther, Duane Edgington, Karen A. Salamy, Michael Risi, R. E. Sherlock, and Christof Koch

Remotely operated underwater vehicles (ROVs) become increasingly important as a tool for obtaining quantitative data on the distribution and abundance of oceanic animals. Using video cameras, it is possible to make quantitative video transects (QVT) through the water, providing high-resolution data at the scale of the individual animals and their natural aggregation patterns. The current manual method of analyzing QVT video by trained scientists is very labor intensive and poses a serious limitation to the amount of data that can be obtained from ROV dives. (full report)


Attentional Selection for Object Recognition in Visual Cortex
Dirk Walther, Laurent Itti, Maximilian Riesenhuber, Tomaso Poggio, Christof Koch

Most models of object recognition assume the isolated occurrence of objects in the field of view. However, in our everyday experience we are usually confronted with scenes that are cluttered with a variety of objects – some relevant for our actions, some not. Our brain’s response to this overwhelming flood of visual information is serializing the processing of the objects by mechanisms of visual attention. Attentional selection of objects is often modeled using all-or-nothing switching of neuronal connection pathways from the attended region of the retinal input to the recognition units. However, there is little physiological evidence for such all-or-none modulation in early areas. We have developed a combined model for spatial attention and object recognition in which the recognition system monitors the entire visual field, but attentional modulation by as little as 20% at a high level is sufficient to recognize multiple objects. (full report)


Evolutionary Design Synthesis – From Sensors to Controllers
Yizhen Zhang, Alcherio Martinoli, Erik Antonsson
Collaborators: Jonathan Litt, Edmond Wong (NASA Glenn Center

Abstract. In this project, an automated engineering design synthesis methodology based on evolutionary methodology is being explored, with special interest on design and optimization of distributed embodied systems. Two case studies have been considered so far; the first one concerns the design of a collective sensory system for traffic monitoring purposes, while the second one deals with the development of neural-based robot controllers for turbine blades inspection. It has been shown that the evolutionary methodology is able to address the engineering design challenges present in the case studies as well as other complex design problems, and synthesize novel design solutions of good quality. Moreover, the fitness function can be formulated as an aggregation of fuzzy design preferences with different weights and trade-off strategies leading to an automatic generation of the complete Pareto-optimal frontier.. (full report)


Spike Based Saliency Detection
Ulrik Beierholm, Pietro Perona

Abstract. Trying to quickly ascertain which parts of a visual scene is most relevant for a recognition task and then focusing on each of these areas, is an economical use of processing power known to be employed in the human visual system. Most models for saliency detection however are too slow to explain the performance of the biological system. We are currently working on implementing a fast neuronal spike based saliency detector model based on rank order coding. (full report)


Decomposition of Human Motion into Dynamics Based
Primitives with Application to Drawing Tasks

Domatilla Del Vecchio, Richard Murray, Pietro Perona

Abstract. Using tools from dynamical systems and systems identification we develop a framework for the study of primitives for human motion, which we refer to as movemes. The objective is understanding human motion by decomposing it into a sequence of elementary building blocks that belong to a known alphabet of dynamical systems. We develop a segmentation and classification algorithm in order to reduce a complex activity into the sequence of movemes that have generated it. We test our ideas on data sampled from five human subjects who were drawing figures using a computer mouse. Our experiments show that we are able to distinguish between movemes and recognize them even when they take place in activities containing an unspecified number of movemes. (full report)


Human Motion Detection and Classification
Claudio Fanti, Pietro Perona

Abstract. We foresee a future in which machines autonomously interact with Humans in the surrounding environment. So far, very good results have been achieved in detecting the presence of Humans and labeling their body parts by means of graphical-models based algorithms. We unavoidably have to deal with uncertainty and reasoning in absence of complete information. To that extent, we explore and enhance the state of the art in probabilistic inference and sampling techniques having the machines understanding human actions as a primary application. (full report)


Dynamic Recurrent Neural Networks for Pattern Recognition
Alex Holub, Gilles Laurent, Pietro Perona

We are investigating the computational properties of recurrent neural networks of binary artificial neurons. Our investigations are guided by recent work performed in the laboratory of Gilles Laurent which involves elucidating the underlying processing mechanisms in early olfactory processing. These physiological investigations indicate that the initial olfactory processing layer (in the locust the antennal lobe) consists of a dynamic recurrent neural network of excitatory and inhibitory units. The presentation of stimuli to the network results in stereotyped spatio-temporal neural firing patterns, with each unique stimulus presentation invoking a unique temporally-varying pattern of activity within the population of neurons. We have approximated the biological networks using recurrent networks with discrete binary neural elements. These non-linear networks exhibit chaotic behavior such that similar input patterns obtain very dissimilar network representations through the network dynamics. Similar pattern spreading characteristics have been observed in the initial processing networks of fish by members of the Laurent laboratory and it has been hypothesized that pattern spreading may be one computational benefit which the initial processing layer provides. (full report)


Rapid Natural Scene Categorization without Attention
Fei Fei Li, Rufin VanRullen, Christof Koch, Pietro Perona

Abstract. What can we see when we do not pay attention? While attention is not necessary for some detection tasks on simple synthetic stimuli, without it we are “blind” even to major aspects of a natural complex scene. It would thus appear that only visual tasks that have an explanation in the early stages of the visual system may be carried out without attention. We report on a complex visual task that requires no attention. Our subjects can rapidly detect animals in briefly presented natural scenes while simultaneously performing another visual task that demands full attention. By comparison, they are unable to discriminate large ‘T’s from ‘L’s in the same conditions. We conclude that attention may not be necessary for some visual tasks that are associated with ‘high level’ cortical areas. (full report)


Object Categorization: Unsupervised One-Shot Learning
Fei-Fei Li, Rob Fergus, Pietro Perona

Abstract. Learning visual models of object categories notoriously requires thousands of training examples; this is due to the diversity and richness of object appearance which requires models containing hundreds of parameters. We present a method for learning object categories from just a few images (1 - 5). It is based on incorporating "generic'' knowledge which may be obtained from previously learnt models of unrelated categories. We operate in a variational Bayesian framework: object categories are represented by probabilistic models, and "prior'' knowledge is represented as a probability density function on the parameters of these models. The "posterior'' model for an object category is obtained by updating the prior in the light of one or more observations. Our ideas are demonstrated on four diverse categories (human faces, airplanes, motorcycles, spotted cats). Initially three categories are learnt from hundreds of training examples, and a "prior'' is estimated from these. Then the model of the fourth category is learnt from 1 to 5 training examples, and is used for detecting new exemplars a set of test images. (full report)


Object Recognition by Probabilistic Hypothesis Construction
Pierre Moreels, Michael Maire, Pietro Perona

Abstract. We present a probabilistic framework for recognizing objects in images of cluttered scenes. Hundreds of objects may be considered and searched in parallel. Each object is learned from a single training image and is modeled by the visual appearance of a set of features, as well as their position with respect to a common reference frame. The recognition process computes both the identity and position of objects in the scene by computing the best interpretation (or hypothesis) of the scene in the light of a database of known objects. A hypothesis pairs features in an input image either with features in the database or marks them as clutters. Each hypothesis may be scored in a principled way using a generative model of the image which is defined using the learned objects as well as a model for clutter. While the space of all possible hypotheses is enormously large, one may find the best hypothesis efficiently—we explore a couple of heuristics to do so. In our initial experiments our algorithm compares favorably with state-of-the-art recognition systems. (full report)


Perception of Mirror Surfaces
Silvio Savarese, Fei Fei Li, Pietro Perona

Abstract. The aim of our work is to investigate how the human visual system perceives specular surfaces and which cues can be used to recover the shape of such class of objects. Our experiments show that mirror reflections are a weak cue for most human observers when additional information is not available. (full report)


3D Reconstruction of Specular Surfaces
Silvio Savarese, Min Chen and Pietro Perona

Abstract. Specular reflections carry valuable information on surface shapes. A curved mirror surface produces "distorted" images of the surrounding world. For example a straight line reflected by a curved mirror is in general a curve. It is clear that such distortions are systematically related to the shape of the surface. Our goal is to explore the geometry linking the shape of a curved mirror surface to the distortions produced on a scene it reflects. To this effect, we assume a simple known (calibrated) scene composed of lines passing through a point. We demonstrate that local shape geometry of the surface may be recovered from local deformation of the reflected images of at least three intersecting lines. (full report)




[top]

 
   

Home | Overview | Research | News | People | Contacts
Calendar of Events | Education and Outreach | Industrial Interactions | Strategic Plan
| NESS

last modified: 2/22/07